Open positions
Our group has an open research position for the summer quarter. More info here.

Patent citation network

Dataset information

U.S. patent dataset is maintained by the National Bureau of Economic Research. The data set spans 37 years (January 1, 1963 to December 30, 1999), and includes all the utility patents granted during that period, totaling 3,923,922 patents. The citation graph includes all citations made by patents granted between 1975 and 1999, totaling 16,522,438 citations. For the patents dataset there are 1,803,511 nodes for which we have no information about their citations (we only have the in-links).

The data was originally released by NBER.

Dataset statistics
Nodes 3774768
Edges 16518948
Nodes in largest WCC 3764117 (0.997)
Edges in largest WCC 16511741 (1.000)
Nodes in largest SCC 1 (0.000)
Edges in largest SCC 0 (0.000)
Average clustering coefficient 0.0757
Number of triangles 7515023
Fraction of closed triangles 0.02343
Diameter (longest shortest path) 22
90-percentile effective diameter 9.4

Source (citation)


Files

File Description
cit-Patents.txt.gz US Patent citation network 1975-1999
NBER Patents Complete US Patent data (includes time, classification, and patent invernetor data)