The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Giang Nguyen; Daeyoung Kim; Anh Nguyen

機能帰属方法の有効性と自動評価スコアとの相関

人工知能（AI）モデルの決定を説明することは、多くの現実世界のハイステークアプリケーションでますます重要になっています。何百もの論文が、新しい機能の帰属方法を提案し、それらのツールを議論または活用して作業を行っています。ただし、人間がターゲットエンドユーザーであるにもかかわらず、ほとんどの帰属方法はプロキシ自動評価メトリックでのみ評価されました（Zhang et al.2018; Zhou et al.2016; Petsiuk et al.2018）。この論文では、ImageNet分類とStanford Dogsのきめ細かい分類で、画像が自然または敵対的である（つまり、敵対的摂動を含む）場合に、アトリビューションマップの有効性を測定する最初のユーザー調査を実施します。全体として、特徴の帰属は、驚くべきことに、人間に最も近いトレーニングセットの例を示すよりも効果的ではありません。きめ細かい犬の分類という難しい作業では、アトリビューションマップを人間に提示することは役に立ちませんが、代わりにAIだけの場合と比較して人間のAIチームのパフォーマンスを損ないます。重要なのは、自動アトリビューションマップ評価手段が実際の人間AIチームのパフォーマンスとあまり相関していないことを発見したことです。私たちの調査結果は、コミュニティがダウンストリームのヒューマンインザループアプリケーションでメソッドを厳密にテストし、既存の評価指標を再考することを奨励しています。

Explaining the decisions of an Artificial Intelligence (AI) model is increasingly critical in many real-world, high-stake applications. Hundreds of papers have either proposed new feature attribution methods, discussed or harnessed these tools in their work. However, despite humans being the target end-users, most attribution methods were only evaluated on proxy automatic-evaluation metrics (Zhang et al. 2018; Zhou et al. 2016; Petsiuk et al. 2018). In this paper, we conduct the first user study to measure attribution map effectiveness in assisting humans in ImageNet classification and Stanford Dogs fine-grained classification, and when an image is natural or adversarial (i.e., contains adversarial perturbations). Overall, feature attribution is surprisingly not more effective than showing humans nearest training-set examples. On a harder task of fine-grained dog categorization, presenting attribution maps to humans does not help, but instead hurts the performance of human-AI teams compared to AI alone. Importantly, we found automatic attribution-map evaluation measures to correlate poorly with the actual human-AI team performance. Our findings encourage the community to rigorously test their methods on the downstream human-in-the-loop applications and to rethink the existing evaluation metrics.

updated: Fri Jul 22 2022 05:44:50 GMT+0000 (UTC)

published: Mon May 31 2021 13:23:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト