U.S. patent dataset is maintained by the National Bureau of Economic Research. The data set spans 37 years (January 1, 1963 to December 30, 1999), and includes all the utility patents granted during that period, totaling 3,923,922 patents. The citation graph includes all citations made by patents granted between 1975 and 1999, totaling 16,522,438 citations. For the patents dataset there are 1,803,511 nodes for which we have no information about their citations (we only have the in-links).
The data was originally released by NBER.
Dataset statistics | |
---|---|
Nodes | 3774768 |
Edges | 16518948 |
Nodes in largest WCC | 3764117 (0.997) |
Edges in largest WCC | 16511741 (1.000) |
Nodes in largest SCC | 1 (0.000) |
Edges in largest SCC | 0 (0.000) |
Average clustering coefficient | 0.0757 |
Number of triangles | 7515023 |
Fraction of closed triangles | 0.02343 |
Diameter (longest shortest path) | 22 |
90-percentile effective diameter | 9.4 |
File | Description |
---|---|
cit-Patents.txt.gz | US Patent citation network 1975-1999 |
NBER Patents | Complete US Patent data (includes time, classification, and patent invernetor data) |