Compact, evolving community taxonomies using ... - Camille Roth

set of subjects, concepts, notions, issues; sharing a common goal of knowledge creation — Haas (1992). Definition here: “an epistemic community is the largest ...
173KB taille 1 téléchargements 370 vues
Compact, evolving community taxonomies using concept lattices Camille Roth Dept. Social and Cognitive Sciences, Univ. Modena and CREA — CNRS/Ecole Polytechnique, Paris

ICCS 14 — July 17-21, 2006; Aalborg, DK

Building taxonomies

Rationale Describe the taxonomy of a knowledge community, in particular scientific communities, that matches high-level descriptions. Epistemic communities Epistemic Community: group of agents sharing a common set of subjects, concepts, notions, issues; sharing a common goal of knowledge creation — Haas (1992) Definition here: “an epistemic community is the largest set of agents sharing a given set of notions” ∼ structural equivalence

Building taxonomies

Rationale Describe the taxonomy of a knowledge community, in particular scientific communities, that matches high-level descriptions. Epistemic communities Epistemic Community: group of agents sharing a common set of subjects, concepts, notions, issues; sharing a common goal of knowledge creation — Haas (1992) Definition here: “an epistemic community is the largest set of agents sharing a given set of notions” ∼ structural equivalence

Building taxonomies

Rationale Describe the taxonomy of a knowledge community, in particular scientific communities, that matches high-level descriptions. Epistemic communities Epistemic Community: group of agents sharing a common set of subjects, concepts, notions, issues; sharing a common goal of knowledge creation — Haas (1992) Definition here: “an epistemic community is the largest set of agents sharing a given set of notions” ∼ structural equivalence

Building taxonomies

Notions (N)

Formal framework Binary relation R between agents & notions

s1

Intent S ∧ of an agent set S: all notions used by every agent in S

s2

Prs

Lng

Extent N ? of a notion set N (S, N) is closed iff N = S ∧ and S = N ? Epistemic community: the extent of the intent of an agent set

NS

s3 s4

Agents (S)

Building taxonomies

Notions (N)

Formal framework Binary relation R between agents & notions

s1

Intent S ∧ of an agent set S: all notions used by every agent in S

s2

Prs

Lng

Extent N ? of a notion set N (S, N) is closed iff N = S ∧ and S = N ? Epistemic community: the extent of the intent of an agent set

NS

s3 s4

Agents (S)

Building taxonomies

Representing epistemic communities 1

structured into fields, with common concerns,

2

hierarchically: generalization / specialization,

3

overlapping.

From trees to lattices

Building taxonomies

Representing epistemic communities 1

structured into fields, with common concerns,

2

hierarchically: generalization / specialization,

3

overlapping.

From trees to lattices inspiration

application

ICCS

ICCS tree

inspiration

application

ICCS lattice

Building taxonomies Concept (Galois) lattices Consider the partially-ordered set of all epistemic communities {(S ∧? , S)|S ⊆ S} under (X , X ∧ ) < (X 0 , X 0∧ ) ⇔ X ⊂ X 0

( s 1 s 2 s 3s 4 ; ∅ ) ( s 1 s 2 s 3 ; Lng ) (s 1s 2 ; Lng Prs )

GL ( s 2 s 3 s 4 ; NS )

( s 2s 3 ; Lng NS )

( s 2 ; Lng Prs NS )

Managing taxonomies

Taxonomy selection & extraction Which ECs should we extract from the lattice? Given the assumptions, use agent set size — yet small isolated ECs could be interesting too. Create a partial taxonomy, with selection heuristics: partially-ordered set overlaying the lattice: “epistemic hypergraph”

( s 1 s 2 s 3s 4 ; ∅ ) ( s 1 s 2 s 3 ; Lng ) (s 1s 2 ; Lng Prs )

GL ( s 2 s 3 s 4 ; NS )

( s 2s 3 ; Lng NS )

( s 2 ; Lng Prs NS )

poset ( s 1 s 2 s 3s 4 ; ∅ ) ( s 1 s 2 s 3 ; Lng )

( s 2 s 3 s 4 ; NS )

Managing taxonomies

Taxonomy selection & extraction Which ECs should we extract from the lattice? Given the assumptions, use agent set size — yet small isolated ECs could be interesting too. Create a partial taxonomy, with selection heuristics: partially-ordered set overlaying the lattice: “epistemic hypergraph”

( s 1 s 2 s 3s 4 ; ∅ ) ( s 1 s 2 s 3 ; Lng ) (s 1s 2 ; Lng Prs )

GL ( s 2 s 3 s 4 ; NS )

( s 2s 3 ; Lng NS )

( s 2 ; Lng Prs NS )

poset ( s 1 s 2 s 3s 4 ; ∅ ) ( s 1 s 2 s 3 ; Lng )

( s 2 s 3 s 4 ; NS )

Managing taxonomies

Taxonomy selection & extraction Which ECs should we extract from the lattice? Given the assumptions, use agent set size — yet small isolated ECs could be interesting too. Create a partial taxonomy, with selection heuristics: partially-ordered set overlaying the lattice: “epistemic hypergraph”

( s 1 s 2 s 3s 4 ; ∅ ) ( s 1 s 2 s 3 ; Lng ) (s 1s 2 ; Lng Prs )

GL ( s 2 s 3 s 4 ; NS )

( s 2s 3 ; Lng NS )

( s 2 ; Lng Prs NS )

poset ( s 1 s 2 s 3s 4 ; ∅ ) ( s 1 s 2 s 3 ; Lng )

( s 2 s 3 s 4 ; NS )

Managing taxonomies Taxonomy evolution growth

1

Progress or decline of a field

(S1,N)

decrease

merging

2

Merging or scission of a field

(S2 ,N)

(S,N)

(S’,N’)

^

(S ∩ S’,(S ∩ S’) )

scission

(S2 ,N)

Empirical protocol

The zebrafish community MedLine abstracts including the term “zebrafish” over the period 1990-2003 each lemmatized term is a notion within an expert-based dictionary

Empirical results Hierarchical epistemic hypergraph 1990-1995 All (255)

Dev (168) Hom (67)

Mou (92)

Hum (34)

Brn (102)

Ver (75)

Pat (99) Spi (30)

Ven (50) Dor (49)

Gro (44) Sig (53)

Mou Dev (72) Hom Mou (40)

Dev Brn (81) Mou Hum (18)

Hom Hum (11)

Dev Pat (77) Ver Dev (68) Mou Ver (30)

Ven Dor (34) Brn Pat (62) Ver Pat (42)

Brn Ven (43) Brn Spi Crd (29)

Brn Dor (38)

Brn Ven Dor (30) Brn Spi Crd Ven (15)

Pwy (38)

Empirical results Hierarchical epistemic hypergraph 1998-2003 All (255)

Dev (150) Hom (57)

Mou (100)

Hum (100)

Brn (82) Ver (86)

Pat (90)

Dev Brn (62)

Mou Hum (58)

Hom Hum (38)

Gro (67)

Sig (133)

Mou Dev (71)

Hum Ver (44) Hom Mou (35)

Ven (40) Dor (40) Spi Crd (18)

Dev Pat (78) Ver Dev (70)

Sig Pwy (84) Gro Sig (51) Ven Dor (24)

Sig Rec (48) Gro Pwy (42)

Pat Brn (47)

Mou Ver (48)

Rec (67) Pwy (93)

Pwy Rec (34)

Ver Pat (58)

Gro Sig Pwy (39) Sig Pwy Rec (31)

Empirical results

Historical description 1

Research on brain and spinal cord depreciated,

2

The community started to enquire relationships between signal, pathway, and receptors,

3

Mouse-related research is stable, yet significant stress on human-related topics & new relationship to homologous genes and vertebrates: growing focus on differential studies.

Compared to expert-based descriptions

Empirical results

Historical description 1

Research on brain and spinal cord depreciated,

2

The community started to enquire relationships between signal, pathway, and receptors,

3

Mouse-related research is stable, yet significant stress on human-related topics & new relationship to homologous genes and vertebrates: growing focus on differential studies.

Compared to expert-based descriptions

Describing the structure of epistemic networks

Regarding SNA, applied epistemology, scientometrics... 1

hierarchically organized, overlapping communities

2

evolving partial taxonomies

3

binding automatically semantic content to agents,

4

underlining the importance of non-single-network properties