Joint image clustering and self-supervised representation learning through debiased contrastive loss

Zheng, Shunjie-Fabian; Nam, Jaeeun; Baur, Simon; Wang, Mengyu; Zebardast, Nazlee; Elze, Tobias; Azizi, Shekoofeh; Bischl, Bernd; Rezaei, Mina; Eslami, Mohammad

doi:10.1117/12.3047438

2025

Conference Paper

Abstract

Joint self-supervised representation learning and image clustering have emerged as some of the most effective techniques for visual representation learning. However, existing methods often rely on artificially balanced datasets, raising concerns about their performance on imbalanced and long-tail data distributions. To address this challenge, we propose a novel framework that combines debiased self-supervised representation learning with joint clustering. By adapting the debiased contrastive loss, our approach mitigates the under-clustering of minority classes in imbalanced datasets. Furthermore, integrating the debiased contrastive loss with a divergence clustering loss significantly improves the quality of learned representations. We conducted extensive experiments on diverse datasets, including CIFAR-10, CIFAR-100, iNaturalist-2018, ISIC-2018 (skin lesions), and two ophthalmic retina fundus glaucoma datasets. Our framework was compared against state-of-the-art methods such as SimCLR, SimSiam, Debiased, and BNN, as well as other self-supervised and clustering algorithms. The results demonstrate that our method outperforms existing deep clustering, self-supervised, and semi-supervised techniques across various classification and clustering tasks, particularly on imbalanced and clinical datasets. These findings establish the effectiveness of our framework for representation learning under challenging data distributions, offering new insights into addressing imbalances in real-world applications.

Author(s)

Zheng, Shunjie-Fabian

Ludwig-Maximilians-Universität München

Nam, Jaeeun

Ludwig-Maximilians-Universität München

Baur, Simon

Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI

Wang, Mengyu

Harvard Medical School

Zebardast, Nazlee

Harvard Medical School

Elze, Tobias

Harvard Medical School

Azizi, Shekoofeh

Google LLC

Bischl, Bernd

Ludwig-Maximilians-Universität München

Rezaei, Mina

Ludwig-Maximilians-Universität München

Eslami, Mohammad

Harvard Medical School

Mainwork

Medical Imaging 2025. Image Processing

Conference

Conference "Medical Imaging - Image Processing" 2025

Options

Joint image clustering and self-supervised representation learning through debiased contrastive loss