Decoupling Deep Learning for Interpretable Image Recognition

Yitao Peng; Yihang Liu; Longzhen Yang; Lianghua He

解釈可能な画像認識のための深層学習の分離

ニューラルネットワークの解釈可能性は、最近大きな注目を集めています。以前のプロトタイプベースの説明可能なネットワークでは、推論と解釈の両方のプロセスでプロトタイプのアクティベーションが行われ、プロトタイプに特定の説明可能な構造が必要でした。したがって、この問題を回避するために、デカップリングプロトタイプネットワーク (DProtoNet) が提案されました。この新しいモデルには、エンコーダー、推論、および解釈モジュールが含まれています。エンコーダモジュールに関しては、表現力豊かな機能とプロトタイプを生成するために、無制限の機能マスクが提示されました。推論モジュールに関しては、ネットワークが一般化されたプロトタイプを学習できるように、プロトタイプを更新するためにマルチイメージプロトタイプ学習方法が導入されました。最後に、解釈モジュールに関して、ネットワークの検出ノードで元の画像とマスク画像の一貫した活性化を使用してヒートマップを生成するニューラルネットワークを説明するために、複数の動的マスク (MDM) デコーダーが提案されました。ニューラルネットワークの精度と解釈可能性を同時に改善するために、ネットワークの決定を説明するためにプロトタイプアクティベーションの使用を避けることにより、プロトタイプベースのネットワークの推論モジュールと解釈モジュールを分離します。複数の公開された一般データセットと医療データセットがテストされ、その結果、私たちの方法が以前の方法と比較して精度と最先端の解釈可能性で 5% の改善を達成できることが確認されました。

The interpretability of neural networks has recently received extensive attention. Previous prototype-based explainable networks involved prototype activation in both reasoning and interpretation processes, requiring specific explainable structures for the prototype, thus making the network less accurate as it gains interpretability. Therefore, the decoupling prototypical network (DProtoNet) was proposed to avoid this problem. This new model contains encoder, inference, and interpretation modules. As regards the encoder module, unrestricted feature masks were presented to generate expressive features and prototypes. Regarding the inference module, a multi-image prototype learning method was introduced to update prototypes so that the network can learn generalized prototypes. Finally, concerning the interpretation module, a multiple dynamic masks (MDM) decoder was suggested to explain the neural network, which generates heatmaps using the consistent activation of the original image and mask image at the detection nodes of the network. It decouples the inference and interpretation modules of a prototype-based network by avoiding the use of prototype activation to explain the network's decisions in order to simultaneously improve the accuracy and interpretability of the neural network. The multiple public general and medical datasets were tested, and the results confirmed that our method could achieve a 5% improvement in accuracy and state-of-the-art interpretability compared with previous methods.

updated: Sat Nov 19 2022 15:51:59 GMT+0000 (UTC)

published: Sat Oct 15 2022 17:05:55 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト