Strongly Augmented Contrastive Clustering

Xiaozhi Deng; Dong Huang; Ding-Hua Chen; Chang-Dong Wang; Jian-Huang Lai

強力に拡張された対照クラスタリング

ディープクラスタリングは、ディープニューラルネットワークを介した共同表現学習とクラスタリングの機能により、近年ますます注目を集めています。最新の開発では、対照学習が、ディープクラスタリングのパフォーマンスを大幅に向上させる効果的な手法として登場しました。ただし、既存の対照学習ベースのディープクラスタリングアルゴリズムは、主に、弱い拡張と呼ばれる慎重に設計された拡張（多くの場合、構造を保持するための変換が制限されている）に焦点を当てていますが、弱い拡張を超えて、より強力な拡張でより多くの機会を探索することはできません（より積極的な変換またはさらに深刻な歪みを伴う）。このホワイトペーパーでは、Strongly Augmented Contrastive Clustering（SACC）と呼ばれるエンドツーエンドのディープクラスタリングアプローチを紹介します。これは、従来の2つの拡張ビューパラダイムを複数のビューに拡張し、強力な拡張と弱い拡張を組み合わせて強化されたディープクラスタリングを実現します。特に、重みが3重に共有されたバックボーンネットワークを利用します。このネットワークでは、強く拡張されたビューと2つの弱く拡張されたビューが組み込まれています。バックボーンによって生成された表現に基づいて、ウィークウィークビューペアとストロングウィークビューペアは、インスタンスレベルの対照学習（インスタンスプロジェクターを介して）とクラスターレベルの対照学習（クラスタープロジェクターを介して）に同時に利用されます）、これはバックボーンと一緒に、純粋に監視されていない方法で共同で最適化することができます。 5つの挑戦的な画像データセットに関する実験結果は、最先端のSACCアプローチよりも優れていることを示しています。コードはhttps://github.com/dengxiaozhi/SACCで入手できます。

Deep clustering has attracted increasing attention in recent years due to its capability of joint representation learning and clustering via deep neural networks. In its latest developments, the contrastive learning has emerged as an effective technique to substantially enhance the deep clustering performance. However, the existing contrastive learning based deep clustering algorithms mostly focus on some carefully-designed augmentations (often with limited transformations to preserve the structure), referred to as weak augmentations, but cannot go beyond the weak augmentations to explore the more opportunities in stronger augmentations (with more aggressive transformations or even severe distortions). In this paper, we present an end-to-end deep clustering approach termed Strongly Augmented Contrastive Clustering (SACC), which extends the conventional two-augmentation-view paradigm to multiple views and jointly leverages strong and weak augmentations for strengthened deep clustering. Particularly, we utilize a backbone network with triply-shared weights, where a strongly augmented view and two weakly augmented views are incorporated. Based on the representations produced by the backbone, the weak-weak view pair and the strong-weak view pairs are simultaneously exploited for the instance-level contrastive learning (via an instance projector) and the cluster-level contrastive learning (via a cluster projector), which, together with the backbone, can be jointly optimized in a purely unsupervised manner. Experimental results on five challenging image datasets have shown the superiority of our SACC approach over the state-of-the-art. The code is available at https://github.com/dengxiaozhi/SACC.

updated: Thu Jul 14 2022 11:28:50 GMT+0000 (UTC)

published: Wed Jun 01 2022 10:30:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト