Interpretable Graph Capsule Networks for Object Recognition

Jindong Gu; Volker Tresp

物体認識のための解釈可能なグラフカプセルネットワーク

畳み込みニューラルネットワークの代替として、画像からオブジェクトを認識するカプセルネットワークが提案されています。現在の文献は、CNNに対するCapsNetの多くの利点を示しています。ただし、CapsNetsの個々の分類の説明を作成する方法は十分に検討されていません。広く使用されている顕著性手法は、主にCNNベースの分類を説明するために提案されています。それらは、アクティベーション値と対応する勾配を組み合わせることにより、顕著性マップの説明を作成します（例：Grad-CAM）。これらの顕著性メソッドは、基礎となる分類子の特定のアーキテクチャを必要とし、その中の反復ルーティングメカニズムのためにCapsNetsに簡単に適用することはできません。解釈可能性の欠如を克服するために、CapsNetsの新しい事後解釈方法を提案するか、モデルを変更して説明を組み込むことができます。この作品では、後者を探求します。具体的には、解釈可能なグラフカプセルネットワーク（GraCapsNets）を提案します。ここでは、ルーティング部分をマルチヘッドアテンションベースのグラフプーリングアプローチに置き換えます。提案されたモデルでは、個々の分類の説明を効果的かつ効率的に作成できます。私たちのモデルは、CapsNetsの基本的な部分を置き換えたとしても、いくつかの予期しない利点も示しています。当社のGraCapsNetは、CapsNetと比較した場合、より少ないパラメータとより優れた敵対的ロバスト性でより優れた分類パフォーマンスを実現します。さらに、GraCapsNetsは、CapsNetsの他の利点、つまり、解きほぐされた表現とアフィン変換の堅牢性も保持します。

Capsule Networks, as alternatives to Convolutional Neural Networks, have been proposed to recognize objects from images. The current literature demonstrates many advantages of CapsNets over CNNs. However, how to create explanations for individual classifications of CapsNets has not been well explored. The widely used saliency methods are mainly proposed for explaining CNN-based classifications; they create saliency map explanations by combining activation values and the corresponding gradients, e.g., Grad-CAM. These saliency methods require a specific architecture of the underlying classifiers and cannot be trivially applied to CapsNets due to the iterative routing mechanism therein. To overcome the lack of interpretability, we can either propose new post-hoc interpretation methods for CapsNets or modifying the model to have build-in explanations. In this work, we explore the latter. Specifically, we propose interpretable Graph Capsule Networks (GraCapsNets), where we replace the routing part with a multi-head attention-based Graph Pooling approach. In the proposed model, individual classification explanations can be created effectively and efficiently. Our model also demonstrates some unexpected benefits, even though it replaces the fundamental part of CapsNets. Our GraCapsNets achieve better classification performance with fewer parameters and better adversarial robustness, when compared to CapsNets. Besides, GraCapsNets also keep other advantages of CapsNets, namely, disentangled representations and affine transformation robustness.

updated: Sun Mar 07 2021 16:50:54 GMT+0000 (UTC)

published: Thu Dec 03 2020 03:18:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト