Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

Yang Zhang; Ashkan Khakzar; Yawei Li; Azade Farshad; Seong Tae Kim; Nassir Navab

予測情報で入力特徴を特定することによるきめ細かいニューラルネットワークの説明

ブラックボックスニューラルネットワークを照らすための1つの主要なアプローチは、特徴の帰属、つまりネットワークの予測のための入力特徴の重要性を特定することです。特徴の予測情報は、それらの重要性の尺度の代用として最近提案されています。これまでのところ、予測情報は、ネットワーク内に情報のボトルネックを配置することによって潜在的な機能についてのみ識別されます。入力ドメインの予測情報で特徴を識別する方法を提案します。この方法では、入力特徴の情報をきめ細かく識別でき、ネットワークアーキテクチャに依存しません。私たちの方法の中心的なアイデアは、予測潜在特徴に関連付けられた入力特徴のみを通過させる入力のボトルネックを活用することです。主流の特徴帰属評価実験を使用して、私たちの方法をいくつかの特徴帰属方法と比較します。コードは公開されています。

One principal approach for illuminating a black-box neural network is feature attribution, i.e. identifying the importance of input features for the network's prediction. The predictive information of features is recently proposed as a proxy for the measure of their importance. So far, the predictive information is only identified for latent features by placing an information bottleneck within the network. We propose a method to identify features with predictive information in the input domain. The method results in fine-grained identification of input features' information and is agnostic to network architecture. The core idea of our method is leveraging a bottleneck on the input that only lets input features associated with predictive latent features pass through. We compare our method with several feature attribution methods using mainstream feature attribution evaluation experiments. The code is publicly available.

updated: Tue Dec 07 2021 22:26:38 GMT+0000 (UTC)

published: Mon Oct 04 2021 14:13:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト