Algorithms for Discrepancy, Matchings, and Approximations: Fast, Simple, and Practical
We study one of the key tools in data approximation and optimization: low-discrepancy colorings. Formally, given a finite set system (X,đŽ), the discrepancy of a two-coloring Ī:Xâ{-1,1} is defined as max_S âđŽ|Ī(S)|, where Ī(S)=â_x â SĪ(x). We propose a randomized algorithm which, for any d>0 and (X,đŽ) with dual shatter function Ī^*(k)=O(k^d), returns a coloring with expected discrepancy O(â(|X|^1-1/dlog|đŽ|)) (this bound is tight) in time Ã(|đŽ|¡|X|^1/d+|X|^2+1/d), improving upon the previous-best time of O(|đŽ|¡|X|^3) by at least a factor of |X|^2-1/d when |đŽ|âĨ|X|. This setup includes many geometric classes, families of bounded dual VC-dimension, and others. As an immediate consequence, we obtain an improved algorithm to construct Îĩ-approximations of sub-quadratic size. Our method uses primal-dual reweighing with an improved analysis of randomly updated weights and exploits the structural properties of the set system via matchings with low crossing number â a fundamental structure in computational geometry. In particular, we get the same |X|^2-1/d factor speed-up on the construction time of matchings with crossing number O(|X|^1-1/d), which is the first improvement since the 1980s. The proposed algorithms are very simple, which makes it possible, for the first time, to compute colorings with near-optimal discrepancies and near-optimal sized approximations for abstract and geometric set systems in dimensions higher than 2.
READ FULL TEXT