Information based Deep Clustering: An experimental study

Jizong Peng; Christian Desrosiers; Marco Pedersoli

情報ベースのディープクラスタリング：実験的研究

最近、2つの方法が画像のクラスタリングと特徴表現の共同学習に優れたパフォーマンスを示しています。 1つは、情報最大化自己拡張トレーニング（IMSAT）と呼ばれ、仮想敵対例に基づく正則化用語を使用しながら、入力とクラスター間の相互情報を最大化します。 2番目の不変情報クラスタリング（IIC）は、サンプルのクラスタリングとその幾何学的に変換されたバージョン間の相互情報を最大化します。これらの方法は、相互の情報を異なる方法で使用し、さまざまな種類の変換を活用します。この作業では、ディープクラスタリングの変換と損失の包括的な分析を提案します。ここでは、これらの2つのコンポーネントの多数の組み合わせを比較し、相互作用を評価します。結果は、サンプルとその変換された表現との間の相互情報が、特に幾何学的および敵対的な変換と併用された場合、ディープクラスタリングの最先端のパフォーマンスにつながることを示唆しています。

Recently, two methods have shown outstanding performance for clustering images and jointly learning the feature representation. The first, called Information Maximiz-ing Self-Augmented Training (IMSAT), maximizes the mutual information between input and clusters while using a regularization term based on virtual adversarial examples. The second, named Invariant Information Clustering (IIC), maximizes the mutual information between the clustering of a sample and its geometrically transformed version. These methods use mutual information in distinct ways and leverage different kinds of transformations. This work proposes a comprehensive analysis of transformation and losses for deep clustering, where we compare numerous combinations of these two components and evaluate how they interact with one another. Results suggest that mutual information between a sample and its transformed representation leads to state-of-the-art performance for deep clustering, especially when used jointly with geometrical and adversarial transformations.

updated: Wed Dec 11 2019 01:14:25 GMT+0000 (UTC)

published: Thu Oct 03 2019 18:07:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト