MAPPR (Motif-based Approximate Personalized PageRank) is an algorithmic framework for local higher-order clustering. Given any graph, a motif of interest, and a target node, it can find a local cluster around this node with minimal motif conductance.
Local graph clustering methods are attractive because they enable targeted clustering around a given seed node and are faster than traditional global graph clustering methods because their runtime does not depend on the size of the input graph. However, current local graph partitioning methods are not designed to account for the higher-order structures crucial to the network, nor can they effectively handle directed networks. In this project, we introduce a new class of local graph clustering methods that address these issues by incorporating higher-order network information captured by small subgraphs, also called network motifs. We develop the Motif-based Approximate Personalized PageRank (MAPPR) algorithm that finds clusters containing a seed node with minimal motif conductance. We also generalize existing theory to prove the fast running time (independent of the size of the graph) and obtain theoretical guarantees on the cluster quality.
Local Higher-Order Graph Clustering.
Hao Yin, Austin R. Benson, Jure Leskovec, and David F. Gleich.
To appear in KDD, 2017.
Our goal is to find a subset of nodes around the seed node with small motif conductance. The
motif conductances of a set S of nodes with respect to a motif M is
φM(S) = cutM(S, S) / min(volM(S),
volM(S)), where
cutM(S, S) is the
number of instances of motif M with at least one node in S and one in
S,
and volM(S) is the number of nodes in instances of M that reside in S.
Our MAPPR method for finding sets S with small motif conductance is: