Representation Learning for Clustering via Building Consensus

4 May 2021  ·  Aniket Anand Deshmukh, Jayanth Reddy Regatti, Eren Manavoglu, Urun Dogan ·

In this paper, we focus on unsupervised representation learning for clustering of images. Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must be close in the representation space (exemplar consistency), and/or similar images must have similar cluster assignments (population consistency). We define an additional notion of consistency, consensus consistency, which ensures that representations are learned to induce similar partitions for variations in the representation space, different clustering algorithms or different initializations of a single clustering algorithm. We define a clustering loss by executing variations in the representation space and seamlessly integrate all three consistencies (consensus, exemplar and population) into an end-to-end learning framework. The proposed algorithm, consensus clustering using unsupervised representation learning (ConCURL), improves upon the clustering performance of state-of-the-art methods on four out of five image datasets. Furthermore, we extend the evaluation procedure for clustering to reflect the challenges encountered in real-world clustering tasks, such as maintaining clustering performance in cases with distribution shifts. We also perform a detailed ablation study for a deeper understanding of the proposed algorithm. The code and the trained models are available at https://github.com/JayanthRR/ConCURL_NCE.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Clustering CIFAR-10 ConCURL Accuracy 0.846 # 13
NMI 0.762 # 12
Train set Train # 1
ARI 0.715 # 13
Image Clustering CIFAR-100 ConCURL Accuracy 0.479 # 9
NMI 0.468 # 8
Train Set Train # 1
ARI 0.303 # 9
Image Clustering ImageNet-10 ConCURL Accuracy 0.958 # 3
NMI 0.907 # 3
ARI 0.909 # 3
Image Clustering Imagenet-dog-15 ConCURL Accuracy 0.695 # 5
NMI 0.63 # 5
ARI 0.531 # 5
Image Clustering STL-10 ConCURL Accuracy 0.749 # 14
NMI 0.636 # 11
Train Split Train+Test # 1

Methods


No methods listed for this paper. Add relevant methods here