CapsNet for Medical Image Segmentation

Minh Tran; Viet-Khoa Vo-Ho; Kyle Quinn; Hien Nguyen; Khoa Luu; Ngan Le

医療画像セグメンテーションのためのCapsNet

畳み込みニューラルネットワーク（CNN）は、非構造化データから特徴を自動的に抽出する機能により、医療画像のセグメンテーションを含むコンピュータービジョンのタスクの解決に成功しています。ただし、CNNは回転とアフィン変換に敏感であり、その成功は、さまざまな入力バリエーションをキャプチャする大規模なラベル付きデータセットに依存しています。このネットワークパラダイムは、医療セグメンテーションのために注釈付きデータを取得するのに費用がかかり、プライバシー規制が厳しいため、大規模な課題を提起しました。さらに、CNNを使用した視覚表現学習には独自の欠点があります。たとえば、従来のCNNのプーリング層は位置情報を破棄する傾向があり、CNNは方向とサイズが異なる入力画像で失敗する傾向があります。カプセルネットワーク（CapsNet）は、プーリングレイヤーを動的ルーティングと畳み込みストライドに置き換えることで表現学習の堅牢性を向上させた最近の新しいアーキテクチャであり、分類、認識、セグメンテーション、自然言語処理などの一般的なタスクで潜在的な結果を示しています。スカラー出力を生成するCNNとは異なり、CapsNetは、部分全体の関係を維持することを目的としたベクトル出力を返します。この作業では、最初にCNNの制限とCapsNetの基本を紹介します。次に、医療画像セグメンテーションのタスクのためのCapsNetの最近の開発を提供します。最後に、2D画像と3Dボリューム医療画像セグメンテーションの両方にCapsNetを実装するためのさまざまな効果的なネットワークアーキテクチャについて説明します。

Convolutional Neural Networks (CNNs) have been successful in solving tasks in computer vision including medical image segmentation due to their ability to automatically extract features from unstructured data. However, CNNs are sensitive to rotation and affine transformation and their success relies on huge-scale labeled datasets capturing various input variations. This network paradigm has posed challenges at scale because acquiring annotated data for medical segmentation is expensive, and strict privacy regulations. Furthermore, visual representation learning with CNNs has its own flaws, e.g., it is arguable that the pooling layer in traditional CNNs tends to discard positional information and CNNs tend to fail on input images that differ in orientations and sizes. Capsule network (CapsNet) is a recent new architecture that has achieved better robustness in representation learning by replacing pooling layers with dynamic routing and convolutional strides, which has shown potential results on popular tasks such as classification, recognition, segmentation, and natural language processing. Different from CNNs, which result in scalar outputs, CapsNet returns vector outputs, which aim to preserve the part-whole relationships. In this work, we first introduce the limitations of CNNs and fundamentals of CapsNet. We then provide recent developments of CapsNet for the task of medical image segmentation. We finally discuss various effective network architectures to implement a CapsNet for both 2D images and 3D volumetric medical image segmentation.

updated: Wed Mar 16 2022 21:15:07 GMT+0000 (UTC)

published: Wed Mar 16 2022 21:15:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト