Clustering With Pairwise Relationships: A Generative Approach

05/06/2018
by   Yen-Yun Yu, et al.
0

Semi-supervised learning (SSL) has become important in current data analysis applications, where the amount of unlabeled data is growing exponentially and user input remains limited by logistics and expense. Constrained clustering, as a subclass of SSL, makes use of user input in the form of relationships between data points (e.g., pairs of data points belonging to the same class or different classes) and can remarkably improve the performance of unsupervised clustering in order to reflect user-defined knowledge of the relationships between particular data points. Existing algorithms incorporate such user input, heuristically, as either hard constraints or soft penalties, which are separate from any generative or statistical aspect of the clustering model; this results in formulations that are suboptimal and not sufficiently general. In this paper, we propose a principled, generative approach to probabilistically model, without ad hoc penalties, the joint distribution given by user-defined pairwise relations. The proposed model accounts for general underlying distributions without assuming a specific form and relies on expectation-maximization for model fitting. For distributions in a standard form, the proposed approach results in a closed-form solution for updated parameters.

READ FULL TEXT
research
09/23/2016

Constraint-Based Clustering Selection

Semi-supervised clustering methods incorporate a limited amount of super...
research
09/06/2022

Semi-Supervised Clustering via Dynamic Graph Structure Learning

Most existing semi-supervised graph-based clustering methods exploit the...
research
05/17/2023

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

Semi-supervised learning has achieved notable success by leveraging very...
research
09/22/2022

One-Shot Federated Learning for Model Clustering and Learning in Heterogeneous Environments

We propose a communication efficient approach for federated learning in ...
research
04/20/2020

Local Clustering with Mean Teacher for Semi-supervised Learning

The Mean Teacher (MT) model of Tarvainen and Valpola has shown favorable...
research
09/22/2011

Exhaustive and Efficient Constraint Propagation: A Semi-Supervised Learning Perspective and Its Applications

This paper presents a novel pairwise constraint propagation approach by ...
research
12/29/2022

PCCC: The Pairwise-Confidence-Constraints-Clustering Algorithm

We consider a semi-supervised k-clustering problem where information is ...

Please sign up or login with your details

Forgot password? Click here to reset