Attention-based Dynamic Subspace Learners for Medical Image Analysis

Sukesh Adiga V; Jose Dolz; Herve Lombaert

医療画像分析のための注意ベースの動的部分空間学習者

類似性の学習は、医療画像分析、特にレコメンデーションシステムや、画像内の解剖学的データの解釈を明らかにする上で重要な側面です。ほとんどの既存の方法は、単一のメトリック学習器を使用して、画像セット上の埋め込みスペースでそのような類似性を学習します。ただし、画像には、色、形状、アーティファクトなど、さまざまなオブジェクト属性があります。単一のメトリック学習器を使用してそのような属性をエンコードすることは不十分であり、一般化に失敗する可能性があります。代わりに、複数の学習者が、包括的な埋め込みの部分空間でこれらの属性の個別の側面に焦点を当てることができます。ただし、これは、新しいデータセットごとに経験的に検出される学習者の数を意味します。この作業、動的部分空間学習者は、学習者の数を事前に知る必要をなくし、トレーニング中に新しい部分空間学習者を集約することによって、複数の学習者を動的に活用することを提案します。さらに、そのような部分空間学習の視覚的解釈可能性は、注意モジュールを私たちの方法に統合することによって強化されます。この統合された注意メカニズムは、画像セットのクラスタリングと埋め込み機能の視覚的説明に寄与する識別可能な画像機能の視覚的洞察を提供します。注意ベースの動的部分空間学習者の利点は、画像クラスタリング、画像検索、および弱く監視されたセグメンテーションのアプリケーションで評価されます。私たちの方法は、複数の学習者のベースラインのパフォーマンスで競争力のある結果を達成し、3つの異なる公開ベンチマークデータセットでのクラスタリングと検索スコアの点で分類ネットワークを大幅に上回っています。さらに、アテンションマップはプロキシラベルを提供します。これにより、最先端の解釈手法と比較して、ダイススコアのセグメンテーション精度が最大15％向上します。

Learning similarity is a key aspect in medical image analysis, particularly in recommendation systems or in uncovering the interpretation of anatomical data in images. Most existing methods learn such similarities in the embedding space over image sets using a single metric learner. Images, however, have a variety of object attributes such as color, shape, or artifacts. Encoding such attributes using a single metric learner is inadequate and may fail to generalize. Instead, multiple learners could focus on separate aspects of these attributes in subspaces of an overarching embedding. This, however, implies the number of learners to be found empirically for each new dataset. This work, Dynamic Subspace Learners, proposes to dynamically exploit multiple learners by removing the need of knowing apriori the number of learners and aggregating new subspace learners during training. Furthermore, the visual interpretability of such subspace learning is enforced by integrating an attention module into our method. This integrated attention mechanism provides a visual insight of discriminative image features that contribute to the clustering of image sets and a visual explanation of the embedding features. The benefits of our attention-based dynamic subspace learners are evaluated in the application of image clustering, image retrieval, and weakly supervised segmentation. Our method achieves competitive results with the performances of multiple learners baselines and significantly outperforms the classification network in terms of clustering and retrieval scores on three different public benchmark datasets. Moreover, our attention maps offer a proxy-labels, which improves the segmentation accuracy up to 15% in Dice scores when compared to state-of-the-art interpretation techniques.

updated: Sat Jun 18 2022 00:44:40 GMT+0000 (UTC)

published: Sat Jun 18 2022 00:44:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト