Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments

Rajhans Singh; Ankita Shukla; Pavan Turaga

幾何学的モーメントを使用したディープネットワークにおける形状認識と解釈可能性の向上

画像分類のためのディープネットワークは、多くの場合、オブジェクトの形状よりもテクスチャ情報に依存します。ディープモデルを形状認識型にするための努力がなされてきましたが、そのようなモデルを単純化し、解釈可能にしたり、形状の既知の数学的定義に基づいたりすることは多くの場合困難です。この論文では、形状関連の特性を測定するための古典的によく理解されているアプローチである幾何学的モーメントにヒントを得た深層学習モデルを紹介します。提案された方法は、タスク固有の方法で特徴を幾何学的に不変にするための座標ベースとアフィンパラメータを生成するためのトレーニング可能なネットワークで構成されます。提案されたモデルにより、最終的な特徴の解釈が改善されます。標準的な画像分類データセットに対するこの方法の有効性を実証します。提案されたモデルは、解釈可能性を大幅に向上させながら、ベースラインおよび標準の ResNet モデルと比較してより高い分類パフォーマンスを実現します。

Deep networks for image classification often rely more on texture information than object shape. While efforts have been made to make deep-models shape-aware, it is often difficult to make such models simple, interpretable, or rooted in known mathematical definitions of shape. This paper presents a deep-learning model inspired by geometric moments, a classically well understood approach to measure shape-related properties. The proposed method consists of a trainable network for generating coordinate bases and affine parameters for making the features geometrically invariant yet in a task-specific manner. The proposed model improves the final feature's interpretation. We demonstrate the effectiveness of our method on standard image classification datasets. The proposed model achieves higher classification performance compared to the baseline and standard ResNet models while substantially improving interpretability.

updated: Mon May 22 2023 22:02:03 GMT+0000 (UTC)

published: Tue May 24 2022 02:08:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト