Recipro-CAM: Fast gradient-free visual explanations for convolutional neural networks

Seok-Yong Byun; Wonju Lee

Recipro-CAM: 畳み込みニューラルネットワークの勾配のない高速な視覚的説明

畳み込みニューラルネットワーク (CNN) は、コンピュータービジョンで広く使用されているディープラーニングアーキテクチャです。ただし、そのブラックボックスの性質により、モデルの動作を解釈することが難しくなります。この問題を軽減するために、AI の専門家は、クラスアクティベーションマップ (CAM) や Grad-CAM などの説明可能な AI 手法を検討してきました。これらの方法は有望であることが示されていますが、アーキテクチャ上の制約や勾配計算の負担によって制限されています。この問題を克服するために、Score-CAM と Ablation-CAM が勾配のない方法として提案されていますが、CAM または Grad-CAM ベースの方法と比較して実行時間が長く、勾配に関連する問題は解決しましたが、実際のソリューションには適していません。問題と有効な推論モード XAI。この課題に対処するために、勾配のない高速な相互 CAM (Recipro-CAM) メソッドを提案します。私たちのアプローチでは、抽出された特徴マップを空間的にマスキングして、アクティベーションマップとターゲットクラスのネットワーク予測の間の相関関係を利用します。提案された方法は有望な結果をもたらし、VGG-16 バックボーンを除いて、Average Drop-Coherence-Complexity (ADCC) メトリックで現在の最先端の方法よりも 1.78% から 3.72% 優れています。さらに、Recipro-CAM は、Grad-CAM と同様の速度で顕著性マップを生成し、Score-CAM よりも約 148 倍高速です。 Recipro-CAM のソースコードは、データ分析フレームワークで利用できます。

The Convolutional Neural Network (CNN) is a widely used deep learning architecture for computer vision. However, its black box nature makes it difficult to interpret the behavior of the model. To mitigate this issue, AI practitioners have explored explainable AI methods like Class Activation Map (CAM) and Grad-CAM. Although these methods have shown promise, they are limited by architectural constraints or the burden of gradient computing. To overcome this issue, Score-CAM and Ablation-CAM have been proposed as gradient-free methods, but they have longer execution times compared to CAM or Grad-CAM based methods, making them unsuitable for real-world solution though they resolved gradient related issues and enabled inference mode XAI. To address this challenge, we propose a fast gradient-free Reciprocal CAM (Recipro-CAM) method. Our approach involves spatially masking the extracted feature maps to exploit the correlation between activation maps and network predictions for target classes. Our proposed method has yielded promising results, outperforming current state-of-the-art method in the Average Drop-Coherence-Complexity (ADCC) metric by 1.78 % to 3.72 %, excluding VGG-16 backbone. Moreover, Recipro-CAM generates saliency maps at a similar rate to Grad-CAM and is approximately 148 times faster than Score-CAM. The source code for Recipro-CAM is available in our data analysis framework.

updated: Mon Mar 13 2023 02:00:41 GMT+0000 (UTC)

published: Wed Sep 28 2022 13:15:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト