TriFMatch: a flash subgraph matching algorithm with effective filtering techniques
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Subgraph matching, as a fundamental operation for graph analysis, aims to extract all subgraphs in a data graph that are identical to a query graph. This task is computationally expensive due to its vast search space. However, existing optimizations face limitations, including insufficient branch pruning, low compression ratios, and over-pruning risks. To overcome these limitations, we propose TriFMatch, featuring three optimizations: (1) a novel query compression technique using successor equivalency, which generalizes existing methods based on neighbor equivalency; (2) a failure-driven filter; and (3) a containment-driven filter. These filters leverage dynamically computed candidates to maximize pruning efficiency. We theoretically and experimentally prove the correctness of all our methods. Experimental results reveal that TriFMatch outperforms state-of-the-art methods on both large query and data graphs, achieving speedups of up to $660\times$. Additionally, on standard datatsets from prior studies, TriFMatch successfully solves all queries within a limited time frame.