PAENet: A Progressive Attention-Enhanced Network for 3D to 2D Retinal Vessel Segmentation

Zhuojie Wu; Muyi Sun

PAENet：3Dから2Dの網膜血管セグメンテーションのための進歩的な注意強化ネットワーク

3Dから2Dの網膜血管セグメンテーションは、光コヒーレンストモグラフィー血管造影（OCTA）画像における困難な問題です。正確な網膜血管のセグメンテーションは、眼科疾患の診断と予防にとって重要です。ただし、OCTAボリュームの3Dデータを最大限に活用することは、満足のいくセグメンテーション結果を得るための重要な要素です。本論文では、豊富な特徴表現を抽出するための注意メカニズムに基づくプログレッシブ注意強化ネットワーク（PAENet）を提案します。具体的には、フレームワークは、3次元の特徴学習パスと2次元のセグメンテーションパスの2つの主要部分で構成されています。 3次元の特徴学習パスでは、新しい適応プーリングモジュール（APM）を設計し、新しい4重注意モジュール（QAM）を提案します。 APMは、ボリュームの投影方向に沿った依存関係をキャプチャし、フィーチャフュージョンの一連のプーリング係数を学習します。これにより、フィーチャの次元が効率的に削減されます。さらに、QAMは、4Dフィーチャテンソルを最大限に活用する、4グループの次元間の依存関係をキャプチャすることによってフィーチャを再重み付けします。 2次元セグメンテーションパスでは、より詳細な情報を取得するために、3D情報を2Dパスに注入するFeature Fusion Module（FFM）を提案します。一方、Polarized Self-Attention（PSA）ブロックを採用して、それぞれ空間次元とチャネル次元のセマンティック相互依存性をモデル化します。実験的に、OCTA-500データセットでの広範な実験は、提案されたアルゴリズムが以前の方法と比較して最先端のパフォーマンスを達成することを示しています。

3D to 2D retinal vessel segmentation is a challenging problem in Optical Coherence Tomography Angiography (OCTA) images. Accurate retinal vessel segmentation is important for the diagnosis and prevention of ophthalmic diseases. However, making full use of the 3D data of OCTA volumes is a vital factor for obtaining satisfactory segmentation results. In this paper, we propose a Progressive Attention-Enhanced Network (PAENet) based on attention mechanisms to extract rich feature representation. Specifically, the framework consists of two main parts, the three-dimensional feature learning path and the two-dimensional segmentation path. In the three-dimensional feature learning path, we design a novel Adaptive Pooling Module (APM) and propose a new Quadruple Attention Module (QAM). The APM captures dependencies along the projection direction of volumes and learns a series of pooling coefficients for feature fusion, which efficiently reduces feature dimension. In addition, the QAM reweights the features by capturing four-group cross-dimension dependencies, which makes maximum use of 4D feature tensors. In the two-dimensional segmentation path, to acquire more detailed information, we propose a Feature Fusion Module (FFM) to inject 3D information into the 2D path. Meanwhile, we adopt the Polarized Self-Attention (PSA) block to model the semantic interdependencies in spatial and channel dimensions respectively. Experimentally, our extensive experiments on the OCTA-500 dataset show that our proposed algorithm achieves state-of-the-art performance compared with previous methods.

updated: Sun Oct 24 2021 14:47:52 GMT+0000 (UTC)

published: Thu Aug 26 2021 10:27:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト