From Heatmaps to Structural Explanations of Image Classifiers

Li Fuxin; Zhongang Qi; Saeed Khorram; Vivswan Shitole; Prasad Tadepalli; Minsuk Kahng; Alan Fern

ヒートマップから画像分類器の構造的説明まで

この論文は、私たちが得た否定的な結果と洞察を含めることを目的として、画像分類器を説明するという観点から、過去数年間の私たちの努力を要約しています。このホワイトペーパーでは、説明可能なニューラルネットワーク（XNN）について説明します。このニューラルネットワークは、人間の言語概念に依存することなく、純粋にディープネットワークからいくつかの高レベルの概念を抽出して視覚化しようとします。これは、ユーザーが直感的ではないネットワーク分類を理解するのに役立ち、カモメのさまざまな種を区別するという難しいきめ細かい分類タスクでのユーザーパフォーマンスを大幅に向上させます。重要な欠落部分が信頼性の高いヒートマップ視覚化ツールであることを認識し、統合された勾配を利用してヒートマップ生成の局所最適化を回避するI-GOSおよびiGOS ++を開発しました。これにより、すべての解像度でパフォーマンスが向上しました。これらの視覚化の開発中に、かなりの数の画像について、分類器には信頼できる予測に到達するための複数の異なるパスがあることに気付きました。これにより、構造化アテンショングラフ（SAG）が最近開発されました。これは、ビーム検索を利用して1つの画像の複数の粗いヒートマップを特定し、画像領域のさまざまな組み合わせが分類子。調査プロセスを通じて、深いネットワークの説明を構築する際の洞察、複数の説明の存在と頻度、および説明を機能させるさまざまな取引のトリックについて多くのことを学びました。この論文では、それらの洞察と意見を読者と共有し、それらのいくつかが説明可能な深層学習について将来の研究者に役立つことを期待しています。

This paper summarizes our endeavors in the past few years in terms of explaining image classifiers, with the aim of including negative results and insights we have gained. The paper starts with describing the explainable neural network (XNN), which attempts to extract and visualize several high-level concepts purely from the deep network, without relying on human linguistic concepts. This helps users understand network classifications that are less intuitive and substantially improves user performance on a difficult fine-grained classification task of discriminating among different species of seagulls. Realizing that an important missing piece is a reliable heatmap visualization tool, we have developed I-GOS and iGOS++ utilizing integrated gradients to avoid local optima in heatmap generation, which improved the performance across all resolutions. During the development of those visualizations, we realized that for a significant number of images, the classifier has multiple different paths to reach a confident prediction. This has lead to our recent development of structured attention graphs (SAGs), an approach that utilizes beam search to locate multiple coarse heatmaps for a single image, and compactly visualizes a set of heatmaps by capturing how different combinations of image regions impact the confidence of a classifier. Through the research process, we have learned much about insights in building deep network explanations, the existence and frequency of multiple explanations, and various tricks of the trade that make explanations work. In this paper, we attempt to share those insights and opinions with the readers with the hope that some of them will be informative for future researchers on explainable deep learning.

updated: Mon Sep 13 2021 23:39:57 GMT+0000 (UTC)

published: Mon Sep 13 2021 23:39:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト