CHIP: Channel-wise Disentangled Interpretation of Deep Convolutional Neural Networks

Xinrui Cui; Dan Wang; Z. Jane Wang

チップ：深い畳み込みニューラルネットワークのチャネルごとの解きほぐし

ディープコンボリューショナルニューラルネットワーク（DCNN）の広範なアプリケーションでは、DCNNが正確な予測を行うだけでなく、決定方法を説明することもますます重要になっています。この作業では、DCNNの予測を視覚的に解釈するために、チャネルごとに解かれたInterPretation（CHIP）モデルを提案します。提案されたモデルは、スパース正則化を利用することにより、ネットワーク内のチャネルのクラス識別の重要性を抽出します。ここでは、最初にネットワーク摂動法を導入してモデルを学習します。提案されたモデルは、ネットワークからグローバルな視点の知識を抽出するだけでなく、ネットワークの特定の予測に対するクラス差別的な視覚的解釈も提示できます。提案されたモデルが再トレーニングなしでネットワークの異なる層を解釈できることは注目に値します。さまざまなレイヤーで蒸留された解釈の知識を組み合わせることで、高解像度でクラスを区別する洗練されたCHIP視覚的解釈をさらに提案します。標準データセットの実験結果は、提案されたモデルが、既存の視覚的解釈方法と比較して、画像分類タスクにおけるネットワークの予測に有望な視覚的解釈を提供することを示しています。その上、提案された方法は、ILSVRC 2015の弱監視ローカリゼーションタスクのアプリケーションで関連するアプローチよりも優れています。

With the widespread applications of deep convolutional neural networks (DCNNs), it becomes increasingly important for DCNNs not only to make accurate predictions but also to explain how they make their decisions. In this work, we propose a CHannel-wise disentangled InterPretation (CHIP) model to give the visual interpretation to the predictions of DCNNs. The proposed model distills the class-discriminative importance of channels in networks by utilizing the sparse regularization. Here, we first introduce the network perturbation technique to learn the model. The proposed model is capable to not only distill the global perspective knowledge from networks but also present the class-discriminative visual interpretation for specific predictions of networks. It is noteworthy that the proposed model is able to interpret different layers of networks without re-training. By combining the distilled interpretation knowledge in different layers, we further propose the Refined CHIP visual interpretation that is both high-resolution and class-discriminative. Experimental results on the standard dataset demonstrate that the proposed model provides promising visual interpretation for the predictions of networks in image classification task compared with existing visual interpretation methods. Besides, the proposed method outperforms related approaches in the application of ILSVRC 2015 weakly-supervised localization task.

updated: Fri Dec 13 2019 19:01:48 GMT+0000 (UTC)

published: Thu Feb 07 2019 07:17:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト