Overinterpretation reveals image classification model pathologies

Brandon Carter; Siddhartha Jain; Jonas Mueller; David Gifford

過度の解釈は、画像分類モデルの病理を明らかにします

画像分類器は通常、テストセットの精度に基づいてスコア付けされますが、精度が高いと、微妙なタイプのモデルの失敗を隠すことができます。人気のあるベンチマークでの高スコアの畳み込みニューラルネットワーク（CNN）は、意味的に顕著な特徴がない場合でも高精度を表示できる厄介な病状を示すことがわかりました。モデルが入力機能をサポートする顕著なものなしで信頼性の高い決定を提供する場合、分類器はその入力を過剰に解釈し、人間には無意味に見えるパターンでクラスの証拠が多すぎることを発見したと言います。ここでは、CIFAR-10とImageNetでトレーニングされたニューラルネットワークが過剰解釈に苦しんでいることを示し、入力画像の95％がマスクされ、人間が残りのピクセルサブセットの顕著な特徴を識別できない場合でも、CIFAR-10のモデルが自信を持って予測できることを発見しました。。複雑なデータセットに十分な入力サブセットを検出するための新しい方法であるBatchedGradient SISを紹介し、この方法を使用して、トレーニングとテストのためのImageNetの境界ピクセルの十分性を示します。これらのパターンは、実際の展開における潜在的なモデルの脆弱性の前兆ですが、実際には、ベンチマークの有効な統計パターンであり、それだけで高いテスト精度を達成するのに十分です。敵対的な例とは異なり、過剰な解釈は変更されていない画像ピクセルに依存します。アンサンブルと入力ドロップアウトはそれぞれ、過剰な解釈を軽減するのに役立つことがわかりました。

Image classifiers are typically scored on their test set accuracy, but high accuracy can mask a subtle type of model failure. We find that high scoring convolutional neural networks (CNNs) on popular benchmarks exhibit troubling pathologies that allow them to display high accuracy even in the absence of semantically salient features. When a model provides a high-confidence decision without salient supporting input features, we say the classifier has overinterpreted its input, finding too much class-evidence in patterns that appear nonsensical to humans. Here, we demonstrate that neural networks trained on CIFAR-10 and ImageNet suffer from overinterpretation, and we find models on CIFAR-10 make confident predictions even when 95% of input images are masked and humans cannot discern salient features in the remaining pixel-subsets. We introduce Batched Gradient SIS, a new method for discovering sufficient input subsets for complex datasets, and use this method to show the sufficiency of border pixels in ImageNet for training and testing. Although these patterns portend potential model fragility in real-world deployment, they are in fact valid statistical patterns of the benchmark that alone suffice to attain high test accuracy. Unlike adversarial examples, overinterpretation relies upon unmodified image pixels. We find ensembling and input dropout can each help mitigate overinterpretation.

updated: Tue Dec 07 2021 16:38:50 GMT+0000 (UTC)

published: Thu Mar 19 2020 17:12:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト