Unsupervised Part Mining for Fine-grained Image Classification

Runsheng Zhang; jian zhang; Yaping Huang; Qi Zou

きめ細かい画像分類のための教師なしパーツマイニング

クラス内分散が大きく、クラス間分散が小さいため、きめの細かい画像分類は依然として困難です。微妙な視覚的な違いは、サブカテゴリ間の識別可能なパーツのローカル領域にのみ存在するため、パーツのローカリゼーションは、きめ細かい画像分類の重要な問題です。ほとんどの既存のアプローチは、オブジェクトまたはパーツの注釈を使用して画像内のオブジェクトまたはパーツをローカライズします。これは、費用と労力を要します。この問題に取り組むために、画像レベルの注釈さえも使用せずに識別可能なパーツをローカライズする完全な教師なしパーツマイニング（UPM）アプローチを提案します。これにより、きめ細かい分類パフォーマンスが大幅に向上します。まず、パターンマイニング手法を利用して、事前にトレーニングされた畳み込みニューラルネットワーク（CNN）モデルから抽出された特徴マップで、頻繁なパターン、つまり共起強調領域を発見します。これらの関連する意味のあるパターンは通常、外観と空間の一貫性を保持しているという事実に触発されて、次にマイニングされた領域をクラスター化してクラスターセンターを取得し、クラスターセンターを囲む識別部分が生成されます。重要なのは、提案されているパーツのローカリゼーションアプローチでは、注釈や高度なトレーニング手順が使用されていないことです。最後に、マルチストリーム分類ネットワークが構築され、元の、オブジェクトレベル、およびパーツレベルの機能を同時に集約します。他の最先端のアプローチと比較して、私たちのUPMアプローチは競争力のあるパフォーマンスを実現します。

Fine-grained image classification remains challenging due to the large intra-class variance and small inter-class variance. Since the subtle visual differences are only in local regions of discriminative parts among subcategories, part localization is a key issue for fine-grained image classification. Most existing approaches localize object or parts in an image with object or part annotations, which are expensive and labor-consuming. To tackle this issue, we propose a fully unsupervised part mining (UPM) approach to localize the discriminative parts without even image-level annotations, which largely improves the fine-grained classification performance. We first utilize pattern mining techniques to discover frequent patterns, i.e., co-occurrence highlighted regions, in the feature maps extracted from a pre-trained convolutional neural network (CNN) model. Inspired by the fact that these relevant meaningful patterns typically hold appearance and spatial consistency, we then cluster the mined regions to obtain the cluster centers and the discriminative parts surrounding the cluster centers are generated. Importantly, any annotations and sophisticated training procedures are not used in our proposed part localization approach. Finally, a multi-stream classification network is built for aggregating the original, object-level and part-level features simultaneously. Compared with other state-of-the-art approaches, our UPM approach achieves the competitive performance.

updated: Thu Mar 31 2022 13:20:38 GMT+0000 (UTC)

published: Tue Feb 26 2019 14:04:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト