An Image Clustering Auto-Encoder Based on Predefined Evenly-Distributed Class Centroids and MMD Distance

Qiuyu Zhu; Zhengyong Wang

事前定義された均等分布クラス重心とMMD距離に基づく画像クラスタリング自動エンコーダ

この論文では、新しい、効果的でシンプルなエンドツーエンドの画像クラスタリング自動エンコーダアルゴリズムICAEを提案します。アルゴリズムは、クラスタリングセンターとしてPEDCC（Predefined Evenly-Distributed Class Centroids）を使用します。これにより、潜在フィーチャのクラス間距離が最大になり、データ分散制約、データ拡張制約、自動エンコーダ再構築制約、Sobelスムース制約が追加されます。クラスタリングのパフォーマンスを改善します。具体的には、1対1のデータ増強を実行して、より効果的な機能を学習します。データと拡張データはオートエンコーダに同時に入力され、潜在的な特徴と、類似性が拡張損失によって制約される拡張された潜在的な特徴を取得します。次に、最大平均不一致距離（MMD）を使用して、潜在的な特徴と拡張された潜在的な特徴を組み合わせて、それらの分布をPEDCC分布（クラス間の均一分布、クラス内のディラック分布）に近づけ、クラスタリング指向をさらに学習します特徴。同時に、元の入力画像と再構成画像のMSEが再構成制約として使用され、Sobel平滑化損失が一般化制約を構築して一般化能力を改善します。最後に、MNIST、Fashion-MNIST、COIL20の3つの一般的なデータセットに関する広範な実験が行われます。実験結果は、アルゴリズムがこれまでで最高のクラスタリング結果を達成したことを示しています。さらに、定義済みのPEDCCクラスセンターとデコーダーを使用して、各クラスのサンプルを明確に生成できます。コードはhttps://github.com/zyWang-Power/Clustering！からダウンロードできます！

In this paper, we propose a novel, effective and simpler end-to-end image clustering auto-encoder algorithm: ICAE. The algorithm uses PEDCC (Predefined Evenly-Distributed Class Centroids) as the clustering centers, which ensures the inter-class distance of latent features is maximal, and adds data distribution constraint, data augmentation constraint, auto-encoder reconstruction constraint and Sobel smooth constraint to improve the clustering performance. Specifically, we perform one-to-one data augmentation to learn the more effective features. The data and the augmented data are simultaneously input into the autoencoder to obtain latent features and the augmented latent features whose similarity are constrained by an augmentation loss. Then, making use of the maximum mean discrepancy distance (MMD), we combine the latent features and augmented latent features to make their distribution close to the PEDCC distribution (uniform distribution between classes, Dirac distribution within the class) to further learn clustering-oriented features. At the same time, the MSE of the original input image and reconstructed image is used as reconstruction constraint, and the Sobel smooth loss to build generalization constraint to improve the generalization ability. Finally, extensive experiments on three common datasets MNIST, Fashion-MNIST, COIL20 are conducted. The experimental results show that the algorithm has achieved the best clustering results so far. In addition, we can use the predefined PEDCC class centers, and the decoder to clearly generate the samples of each class. The code can be downloaded at https://github.com/zyWang-Power/Clustering!

updated: Sat Jan 04 2020 07:29:28 GMT+0000 (UTC)

published: Mon Jun 10 2019 11:28:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト