slides

Aug 29, 2009 - (“health insurance”, “climate change”, “national security”, “super Tuesday” .... common “resource” and a set of citation links between blogs.
1MB taille 6 téléchargements 371 vues
The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Socio-semantic dynamics in a blog network

Jean-Philippe Cointet CREA (CNRS/EP, France)

AND

Camille Roth CAMS (CNRS/EHESS, France)

IEEE S OCIAL C OM 09, VANCOUVER , BC – AUG 29–31, 2009

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

A SOCIAL network Three kinds of links for each blog... citation: post citation links

citation link

interaction: comment links affiliation: blogroll links ...where contents circulate in terms of topics (W) in terms of cultural items (U)

Dataset: US blogosphere scope: 4 months of ’08 campaign network: citations nodes: 1, 066 blogs (RTGI)

comment link

blogroll link

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

A socio-SEMANTIC network Three kinds of links for each blog... citation: post citation links interaction: comment links affiliation: blogroll links ...where contents circulate in terms of topics (W) in terms of cultural items (U)

Dataset: US blogosphere

semantic characterization

scope: 4 months of ’08 campaign

“relevant” syntagms

network: citations

(“health insurance”, “climate change”, “national

nodes: 1, 066 blogs (RTGI)

security”, “super Tuesday”, “human rights”...)

urls:

“www.youtube.com/x1hqwkeac”, etc.

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

A DYNAMIC socio-semantic network Three kinds of links for each blog... citation: post citation links interaction: comment links affiliation: blogroll links ...where contents circulate in terms of topics (W) in terms of cultural items (U)

Dataset: US blogosphere scope: 4 months of ’08 campaign network: citations nodes: 1, 066 blogs (RTGI)

http://presidentialwatch08.com/

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Socio-semantic configuration 0.06 super tuesday michigan california huckabee

frequency of terms

0.05

0.04

0.03

0.02

0.01

0

Jan 1

Michigan Primary

time

a

20/02 20/02 19/02 c

b

Super Tuesday

26/02

d

feb 17

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Socio-semantic configuration

semantic profile of a blog i:

0.2 0.18

Wi (w) ˆ i (w) := W P|W| Wi (w) w=1

0.14

|B|

|{j, Wj (w) > 0}|

0.12

P (δ)

· log

0.16

0.1 0.08 0.06 0.04 0.02

semantic distance between blogs i and j: δ(i, j) = 1 −

0

[0;.1[ [.1;.2[ [.2;.3[ [.3;.4[ [.4;.5[ [.5;.6[ [.6;.7[ [.7;.8[ [.8;.9[ [.9;1]

δ

ˆ i ·W ˆj W ˆ ˆ j" "Wi ""W

Semantic distance distributions. Triangles: computed over the whole set of possible blog pairs. Crosses: distribution computed on linked blogs.

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Computing link creation propensity

→ estimate the “propensity of interaction” ...that it is more or less likely for a node (or a dyad) with property “m” to receive a link ...which may be simply estimated by: ˆf (m) = ν(m) N(m) ν(m) = number of links pointing towards an agent of type m

(resp. number of new dyads of type m) during a time period,

N(m) = number of agents (resp. of dyads) of type m.

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Computing link creation propensity

ˆf (m) = ν(m) N(m)

1

10

fˆ(k)

→ estimate the “propensity of interaction” ...that it is more or less likely for a node (or a dyad) with property “m” to receive a link ...which may be simply estimated by:

0

10

0

50

100

k

150

ν(m) = number of links pointing towards an agent of type m

(resp. number of new dyads of type m) during a time period,

N(m) = number of agents (resp. of dyads) of type m.

200

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Dynamics of the social network

in-degree effects

→ increasing, plateauing topological distance effects

→ strong trend to repetition and local semantic distance

1

10

fˆ(k)

interaction

0

10

→ strong trend to homophily 0

primarily “social”? social distance and degree

50

100

k

150

200

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Dynamics of the social network

in-degree effects

→ increasing, plateauing

2

10

topological distance effects interaction

ˆ f (d)

→ strong trend to repetition and local

1

10

0

10

semantic distance

→ strong trend to homophily primarily “social”? social distance and degree

−1

10

−2

10

1

2

3

d

4

>4

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Dynamics of the social network

in-degree effects

→ increasing, plateauing

2

10

topological distance effects interaction

semantic distance

1

10

ˆ g(δ)

→ strong trend to repetition and local

0

10

→ strong trend to homophily primarily “social”? social distance and degree

−1

10

[0;.1] ].1;.2] ].2;.3] ].3;.4] ].4;.5] ].5;.6] ].6;.7] ].7;.8] ].8;.9] ].9;1]

δ

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Dynamics of the social network

in-degree effects

→ increasing, plateauing 0

10

topological distance effects interaction

semantic distance

→ strong trend to homophily primarily “social”? social distance and degree

−1

propension p(d, δ)

→ strong trend to repetition and local

10

−2

10

−3

10

−4

10

1 2 3 social distance d

>3

[0;0.2[

[0.8;1] [0.2;0.4[ [0.4;0.6[ [0.6;0.8[ semantic distance δ

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Dynamics of the social network

in-degree effects

→ increasing, plateauing 2

topological distance effects interaction

semantic distance

→ strong trend to homophily

propension p(d, k)

→ strong trend to repetition and local

10

1

10

0

10

−1

10

−2

10

1 2 3

primarily “social”? social distance and degree

>3 social distance d

0

50

100

social capital k

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Information flows: measures on the post network a

Dyadic measures: raw, weighted network, aggregated on 4 months

1

b

2

1 2

attentional matrix a... → and total attention αa = 5/6 detachment matrix

1

3

c

2

3 1 1 f 2

“edge range”: quantifying shortcuts

1 d

3

e

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Information flows: measures on the post network a

Dyadic measures: raw, weighted network, aggregated on 4 months

1/6 3/4 b

attentional matrix a... → and total attention αa = 5/6

1/4

2/3

1/3 2/6 c

1/5

detachment matrix 3/6

2/2 1/5 f 2/3

“edge range”: quantifying shortcuts

1/3 d

3/5

e

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Information flows: measures on the post network a

Dyadic measures:

6 4/3

raw, weighted network, aggregated on 4 months

b

4

3/2

3 3

attentional matrix a... → and total attention αa = 5/6

c

5

detachment matrix 2

1 5 f 3/2

“edge range”: quantifying shortcuts

3 d

5/3

e

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Information flows: measures on the post network a

6

Dyadic measures: raw, weighted network, aggregated on 4 months

4/3

4

3/2

b 3

attentional matrix a... → and total attention αa = 5/6

c 1

f

detachment matrix 2

3/2

5

3

5 e

“edge range”: quantifying shortcuts d

5/3

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

Information cascade Diffusion subgraphs 4

10

a

20/02

b

26/02

d

20/02 19/02 c

An example of diffusion subgraph, a common “resource” and a set of citation links between blogs

number of diffusion subgraphs

links nodes

3

10

2

10

1

10

0

10 0 10

1

10

size of diffusion subgraphs

⇒ heterogeneous cascade sizes

2

10

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

An ego-centered perspective 









 

role of the total attention on the number of diffusion links



2

a bit more “global”...

Number of tranmissions

10

We focus on the total number of “transmissions” generated by blogs with a given total attention α

1

10

0

10

−1

10

−2

10

second transmissions: we focus on “later transmissions”, i.e. after a first transmission event

−1

10

0

10

Total Attention α

1

10

2

10

Larger active readership => larger number of diffusion links, yet not linearly

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

An ego-centered perspective 









 

role of the total attention on the number of diffusion links



2

a bit more “global”... 









Number of tranmissions

10

We focus on the total number of “transmissions” generated by blogs with a given total attention α

1

10

0

10

−1

10  

−2

10

−1

10

0

10

Total Attention α

1

10

2

10



second transmissions: we focus on “later transmissions”, i.e. after a first transmission event

Larger active readership => larger number of diffusion links, yet not linearly

The “blogosphere” as a socio-semantic network

Link creation dynamics

Diffusion dynamics

→ role of edge-range on the number of grand-children We focus again on transmissions occurring after a first transmission event

a

20/02 20/02 19/02 c

b

26/02

d

Mean number of 2n d transmissions

A more global perspective 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0

0

10

20

30

40

50

60

edge range r

70

80

An information which has been transmitted through a “median” link generates a larger number of grandchildren

90

100

The “blogosphere” as a socio-semantic network

Link creation dynamics

Concluding remarks

Co-evolution of content and relationships Patterns not necessarily linked to authority only Patterns not necessarily ego-centered only → divergent from the “neighbor-based-influence” perspective

Diffusion dynamics

The “blogosphere” as a socio-semantic network

Link creation dynamics

Concluding remarks

Co-evolution of content and relationships Patterns not necessarily linked to authority only Patterns not necessarily ego-centered only → divergent from the “neighbor-based-influence” perspective Thank you! [email protected] & [email protected]

Diffusion dynamics