Fast Graph Sampling for Short Video Summarization using Gershgorin Disc Alignment

10/21/2021
βˆ™
by   Sadid Sahami, et al.
βˆ™
0
βˆ™

We study the problem of efficiently summarizing a short video into several keyframes, leveraging recent progress in fast graph sampling. Specifically, we first construct a similarity path graph (SPG) 𝒒, represented by graph Laplacian matrix 𝐋, where the similarities between adjacent frames are encoded as positive edge weights. We show that maximizing the smallest eigenvalue Ξ»_min(𝐁) of a coefficient matrix 𝐁 = diag(𝐚) + μ𝐋, where 𝐚 is the binary keyframe selection vector, is equivalent to minimizing a worst-case signal reconstruction error. We prove that, after partitioning 𝒒 into Q sub-graphs {𝒒^q}^Q_q=1, the smallest Gershgorin circle theorem (GCT) lower bound of Q corresponding coefficient matrices – min_q Ξ»^-_min(𝐁^q) – is a lower bound for Ξ»_min(𝐁). This inspires a fast graph sampling algorithm to iteratively partition 𝒒 into Q sub-graphs using Q samples (keyframes), while maximizing Ξ»^-_min(𝐁^q) for each sub-graph 𝒒^q. Experimental results show that our algorithm achieves comparable video summarization performance as state-of-the-art methods, at a substantially reduced complexity.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset