Correlation Clustering with Asymmetric Classification Errors

08/11/2021
āˆ™
by   Jafar Jafarov, et al.
āˆ™
0
āˆ™

In the Correlation Clustering problem, we are given a weighted graph G with its edges labeled as "similar" or "dissimilar" by a binary classifier. The goal is to produce a clustering that minimizes the weight of "disagreements": the sum of the weights of "similar" edges across clusters and "dissimilar" edges within clusters. We study the correlation clustering problem under the following assumption: Every "similar" edge e has weight š°_eāˆˆ[Ī±š°, š°] and every "dissimilar" edge e has weight š°_eā‰„Ī±š° (where Ī±ā‰¤ 1 and š°>0 is a scaling parameter). We give a (3 + 2 log_e (1/Ī±)) approximation algorithm for this problem. This assumption captures well the scenario when classification errors are asymmetric. Additionally, we show an asymptotically matching Linear Programming integrality gap of Ī©(log 1/Ī±).

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset