Self-supervised Document Clustering Based on BERT with Data Augment

11/17/2020
by   Haoxiang Shi, et al.
0

Contrastive learning is a good way to pursue discriminative unsupervised learning, which can inherit advantages and experiences of well-studied deep models without complexly novel model designing. In this paper, we propose two learning method for document clustering, the one is a partial contrastive learning with unsupervised data augment, and the other is a self-supervised contrastive learning. Both methods achieve state-of-the-art results in clustering accuracy when compared to recently proposed unsupervised clustering approaches.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset