Saliency is a Possible Red Herring When Diagnosing Poor Generalization

Joseph D. Viviano; Becks Simpson; Francis Dutil; Yoshua Bengio; Joseph Paul Cohen

不十分な一般化を診断する場合、顕著性は赤ニシンの可能性があります

不十分な一般化は、クラスを表す真の画像特徴ではなく、トレーニング分布にのみ存在する偽相関画像特徴を使用してターゲット変数を予測することを学習するモデルの1つの症状です。これは、帰属（別名顕著性）マップを使用して視覚的に診断できるとよく考えられています。この仮定が正しいかどうかを調査します。医用画像などの一部の予測タスクでは、人間の専門家によって描画されたマスクを使用して、予測を行うための関連情報を含む画像の領域を示す画像がいくつかある場合があります。関心のある領域の外で見つかるかもしれない気を散らす特徴を無視するようにネットワークを訓練することによって、そのような補助ラベルを利用する複数の方法を研究します。このマスク情報はトレーニング中にのみ使用され、トレーニングとテストの分布間のシフトの重大度に応じて、一般化の精度に影響を与えます。驚くべきことに、これらの方法は共変量シフトの存在下で一般化パフォーマンスを改善しますが、人間の専門家が重要とラベル付けした機能への帰属の修正と一般化パフォーマンスの間に強い対応はありません。これらの結果は、不十分な一般化の根本原因が常に空間的に定義されているとは限らないことを示唆しており、説明可能な予測のための顕著性マップだけでなく、「帰属優先順位」としてのマスクの有用性について疑問を投げかけています。

Poor generalization is one symptom of models that learn to predict target variables using spuriously-correlated image features present only in the training distribution instead of the true image features that denote a class. It is often thought that this can be diagnosed visually using attribution (aka saliency) maps. We study if this assumption is correct. In some prediction tasks, such as for medical images, one may have some images with masks drawn by a human expert, indicating a region of the image containing relevant information to make the prediction. We study multiple methods that take advantage of such auxiliary labels, by training networks to ignore distracting features which may be found outside of the region of interest. This mask information is only used during training and has an impact on generalization accuracy depending on the severity of the shift between the training and test distributions. Surprisingly, while these methods improve generalization performance in the presence of a covariate shift, there is no strong correspondence between the correction of attribution towards the features a human expert has labelled as important and generalization performance. These results suggest that the root cause of poor generalization may not always be spatially defined, and raise questions about the utility of masks as "attribution priors" as well as saliency maps for explainable predictions.

updated: Wed Feb 10 2021 16:40:27 GMT+0000 (UTC)

published: Tue Oct 01 2019 04:29:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト