Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters

Haoyu Liang; Zhihao Ouyang; Yuyuan Zeng; Hang Su; Zihao He; Shu-Tao Xia; Jun Zhu; Bo Zhang

クラス固有のフィルターを区別することによる解釈可能な畳み込みニューラルネットワークのトレーニング

畳み込みニューラルネットワーク（CNN）は、さまざまなタスクで正常に使用されています。ただし、CNNはしばしば「ブラックボックス」と見なされ、解釈性が欠如しています。主な理由の1つは、フィルタークラスのもつれ（フィルターとクラスの間の多対多の複雑な対応）によるものです。ほとんどの既存の作品は、モデルの基礎となっているエンタングルメントを減らすことを怠りながら、事前トレーニング済みのモデルに対してその場限りの解釈を試みます。対照的に、トレーニング中にフィルタークラスのもつれを緩和することに焦点を当てています。細胞分化に触発され、クラス固有のフィルターを奨励することで解釈可能なCNNをトレーニングする新しい戦略を提案します。各フィルターは1つ（またはいくつか）のクラスのみに応答します。具体的には、学習可能なスパースクラス固有ゲート（CSG）構造を設計して、各フィルターに1つ（またはいくつか）のクラスを柔軟に割り当てます。ゲートは、入力サンプルが特定のクラスからのものである場合にのみ、フィルターのアクティブ化を通過させます。広範な実験により、入力のスパースで高度なクラス関連の表現を生成する方法の優れたパフォーマンスが実証され、より強力な解釈が可能になります。さらに、標準のトレーニング戦略と比較すると、モデルは、オブジェクトのローカリゼーションや敵対的なサンプル検出などのアプリケーションにメリットをもたらします。コードリンク：https://github.com/hyliang96/CSGCNN。

Convolutional neural networks (CNNs) have been successfully used in a range of tasks. However, CNNs are often viewed as "black-box" and lack of interpretability. One main reason is due to the filter-class entanglement -- an intricate many-to-many correspondence between filters and classes. Most existing works attempt post-hoc interpretation on a pre-trained model, while neglecting to reduce the entanglement underlying the model. In contrast, we focus on alleviating filter-class entanglement during training. Inspired by cellular differentiation, we propose a novel strategy to train interpretable CNNs by encouraging class-specific filters, among which each filter responds to only one (or few) class. Concretely, we design a learnable sparse Class-Specific Gate (CSG) structure to assign each filter with one (or few) class in a flexible way. The gate allows a filter's activation to pass only when the input samples come from the specific class. Extensive experiments demonstrate the fabulous performance of our method in generating a sparse and highly class-related representation of the input, which leads to stronger interpretability. Moreover, comparing with the standard training strategy, our model displays benefits in applications like object localization and adversarial sample detection. Code link: https://github.com/hyliang96/CSGCNN.

updated: Sat Mar 20 2021 10:09:12 GMT+0000 (UTC)

published: Thu Jul 16 2020 09:12:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト