Deep Geometric Moment

Rajhans Singh; Ankita Shukla; Pavan Turaga

深い幾何学的な瞬間

画像分類のための深いネットワークは、多くの場合、オブジェクトの形状よりもテクスチャ情報に依存しています。ディープモデルを形状認識にするための努力がなされてきましたが、そのようなモデルを単純にしたり、解釈可能にしたり、形状の既知の数学的定義に根ざしたりすることはしばしば困難です。このホワイトペーパーでは、幾何学的モーメントに触発された深層学習モデルを紹介します。これは、形状に関連する特性を測定するための古典的によく理解されているアプローチです。提案された方法は、座標ベースを生成するためのトレーニング可能なネットワークと、特徴を幾何学的に不変にするためのアフィンパラメータで構成されていますが、タスク固有の方法です。提案されたモデルは、最終的な機能の解釈を改善します。標準の画像分類データセットに対するこの方法の有効性を示します。提案されたモデルは、解釈可能性を大幅に改善しながら、ベースラインおよび標準のResNetモデルと比較してより高い分類パフォーマンスを実現します。

Deep networks for image classification often rely more on texture information than object shape. While efforts have been made to make deep-models shape-aware, it is often difficult to make such models simple, interpretable, or rooted in known mathematical definitions of shape. This paper presents a deep-learning model inspired by geometric moments, a classically well understood approach to measure shape-related properties. The proposed method consists of a trainable network for generating coordinate bases and affine parameters for making the features geometrically invariant, yet in a task-specific manner. The proposed model improves the final feature's interpretation. We demonstrate the effectiveness of our method on standard image classification datasets. The proposed model achieves higher classification performance as compared to the baseline and standard ResNet models while substantially improving interpretability.

updated: Tue May 24 2022 02:08:05 GMT+0000 (UTC)

published: Tue May 24 2022 02:08:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト