The Origins and Prevalence of Texture Bias in Convolutional Neural Networks

Katherine L. Hermann; Ting Chen; Simon Kornblith

畳み込みニューラルネットワークにおけるテクスチャバイアスの起源と普及

最近の研究では、人間とは異なり、ImageNetでトレーニングされたCNNは、画像を形状ではなくテクスチャで分類する傾向があることが示されています。このバイアスはどの程度蔓延しており、どこから来ているのでしょうか。形状とテクスチャが競合する画像のデータセットでトレーニングすると、CNNは少なくともテクスチャと同じくらい簡単に形状で分類することを学習します。では、ImageNetでトレーニングされたCNNでテクスチャバイアスを生成する要因は何ですか？教師なしトレーニングの目的やアーキテクチャが異なると、テクスチャバイアスのレベルに小さいながらも重要で、ほとんど独立した影響があります。ただし、形状情報が非表示の表現からデコード可能であっても、すべての目的とアーキテクチャは、ほとんどの場合、テクスチャベースの分類決定を行うモデルにつながります。データ拡張の効果ははるかに大きくなります。トレーニング時に攻撃性の低いランダムな切り抜きを取り、単純で自然な増強（色の歪み、ノイズ、ぼかし）を適用することで、ほとんどの場合、形状によってあいまいな画像を分類し、分布外テストでベースラインを上回るモデルをトレーニングします。セット。私たちの結果は、人間とImageNetでトレーニングされたCNNが画像を処理する方法の明らかな違いは、主に内部動作の違いからではなく、表示されるデータの違いから生じる可能性があることを示しています。

Recent work has indicated that, unlike humans, ImageNet-trained CNNs tend to classify images by texture rather than by shape. How pervasive is this bias, and where does it come from? We find that, when trained on datasets of images with conflicting shape and texture, CNNs learn to classify by shape at least as easily as by texture. What factors, then, produce the texture bias in CNNs trained on ImageNet? Different unsupervised training objectives and different architectures have small but significant and largely independent effects on the level of texture bias. However, all objectives and architectures still lead to models that make texture-based classification decisions a majority of the time, even if shape information is decodable from their hidden representations. The effect of data augmentation is much larger. By taking less aggressive random crops at training time and applying simple, naturalistic augmentation (color distortion, noise, and blur), we train models that classify ambiguous images by shape a majority of the time, and outperform baselines on out-of-distribution test sets. Our results indicate that apparent differences in the way humans and ImageNet-trained CNNs process images may arise not primarily from differences in their internal workings, but from differences in the data that they see.

updated: Tue Nov 03 2020 22:51:23 GMT+0000 (UTC)

published: Wed Nov 20 2019 18:16:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト