Counting Simplices in Hypergraph Streams

12/21/2021
by   Amit Chakrabarti, et al.
0

We consider the problem of space-efficiently estimating the number of simplices in a hypergraph stream. This is the most natural hypergraph generalization of the highly-studied problem of estimating the number of triangles in a graph stream. Our input is a k-uniform hypergraph H with n vertices and m hyperedges. A k-simplex in H is a subhypergraph on k+1 vertices X such that all k+1 possible hyperedges among X exist in H. The goal is to process a stream of hyperedges of H and compute a good estimate of T_k(H), the number of k-simplices in H. We design a suite of algorithms for this problem. Under a promise that T_k(H) ≥ T, our algorithms use at most four passes and together imply a space bound of O( ϵ^-2logδ^-1polylog n ·min{ m^1+1/k/T, m/T^2/(k+1)} ) for each fixed k ≥ 3, in order to guarantee an estimate within (1±ϵ)T_k(H) with probability at least 1-δ. We also give a simpler 1-pass algorithm that achieves O(ϵ^-2logδ^-1log n· (m/T) ( Δ_E + Δ_V^1-1/k )) space, where Δ_E (respectively, Δ_V) denotes the maximum number of k-simplices that share a hyperedge (respectively, a vertex). We complement these algorithmic results with space lower bounds of the form Ω(ϵ^-2), Ω(m^1+1/k/T), Ω(m/T^1-1/k) and Ω(mΔ_V^1/k/T) for multi-pass algorithms and Ω(mΔ_E/T) for 1-pass algorithms, which show that some of the dependencies on parameters in our upper bounds are nearly tight. Our techniques extend and generalize several different ideas previously developed for triangle counting in graphs, using appropriate innovations to handle the more complicated combinatorics of hypergraphs.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset