Open positions
Open research positions in SNAP group are available here.

Higher-order network structure of genes

Dataset information

This is a dataset of network structural features describing genes. The dataset contains information on network motif counts and on significance analysis of the motifs.

Network motifs are subgraphs that recur within disease pathways. This dataset contains information on graphlets, connected non-isomorphic induced subgraphs. There are 30 possible graphlets of size 2 to 5 nodes. The simplest graphlet is just two nodes connected by an edge, and the most complex graphlet is a clique of size 5. By taking into account the symmetries between nodes in a graphlet, there are 73 different positions or orbits for 2-5-node graphlets.

References

Files

File Size Description
G-MtfPathways_gene-motifs.csv.gz 7.8MB Network motifs of genes (feature table)