Open positions
We have filled all the positions for this quarter. More info.

Social circles: Twitter

Dataset information

This dataset consists of 'circles' (or 'lists') from Twitter. Twitter data was crawled from public sources. The dataset includes node features (profiles), circles, and ego networks.

Data is also available from Facebook and Google+.

Dataset statistics
Nodes 81306
Edges 1768149
Nodes in largest WCC 81306 (1.000)
Edges in largest WCC 1768149 (1.000)
Nodes in largest SCC 68413 (0.841)
Edges in largest SCC 1685163 (0.953)
Average clustering coefficient 0.5653
Number of triangles 13082506
Fraction of closed triangles 0.06415
Diameter (longest shortest path) 7
90-percentile effective diameter 4.5

Source (citation)


File Description
twitter.tar.gz Twitter data (973 networks)
twitter_combined.txt.gz Edges from all egonets combined
readme-Ego.txt Description of files