Disentangle, align and fuse for multimodal and semi-supervised image segmentation

Agisilaos Chartsias; Giorgos Papanastasiou; Chengjia Wang; Scott Semple; David E. Newby; Rohan Dharmakumar; Sotirios A. Tsaftaris

マルチモーダルおよび半教師あり画像セグメンテーションのための解きほぐし、位置合わせ、融合

磁気共鳴（MR）プロトコルは、病理学と臓器の状態を適切に評価するためにいくつかのシーケンスに依存しています。画像解析の進歩にもかかわらず、ここではモダリティと呼ばれる各シーケンスを個別に扱う傾向があります。モダリティ（臓器の解剖学的構造）間で共有される共通情報を利用することは、マルチモダリティの処理と学習に役立ちます。ただし、この利点を得るには、モダリティ全体での固有の解剖学的位置ずれと信号強度の不一致を克服する必要があります。注釈がほとんど（半教師あり）またはまったくない（教師なし）場合でも、他のモダリティに存在する情報を活用することを学習することにより、（単一の入力モデルに対して）対象のモダリティのセグメンテーション精度を向上させる方法を提示します。特定のモダリティ。私たちの方法の中核は、解剖学的要因と画像化要因への解きほぐされた分解を学習することです。さまざまな入力からの共有解剖学的要因が共同で処理および融合されて、より正確なセグメンテーションマスクが抽出されます。画像の位置ずれは、解剖学的要因を非線形に整列させる空間トランスフォーマーネットワークで修正されます。イメージングファクターは、さまざまなモダリティデータ全体の信号強度特性をキャプチャし、画像の再構成に使用され、半教師あり学習を可能にします。入力間の時間的およびスライスペアリングは動的に学習されます。後期ガドリニウム増強（LGE）および血液酸素化レベル依存（BOLD）心臓セグメンテーション、およびT2腹部セグメンテーションでのアプリケーションを示します。コードはhttps://github.com/vios-s/multimodal_segmentationで入手できます。

Magnetic resonance (MR) protocols rely on several sequences to assess pathology and organ status properly. Despite advances in image analysis, we tend to treat each sequence, here termed modality, in isolation. Taking advantage of the common information shared between modalities (an organ's anatomy) is beneficial for multi-modality processing and learning. However, we must overcome inherent anatomical misregistrations and disparities in signal intensity across the modalities to obtain this benefit. We present a method that offers improved segmentation accuracy of the modality of interest (over a single input model), by learning to leverage information present in other modalities, even if few (semi-supervised) or no (unsupervised) annotations are available for this specific modality. Core to our method is learning a disentangled decomposition into anatomical and imaging factors. Shared anatomical factors from the different inputs are jointly processed and fused to extract more accurate segmentation masks. Image misregistrations are corrected with a Spatial Transformer Network, which non-linearly aligns the anatomical factors. The imaging factor captures signal intensity characteristics across different modality data and is used for image reconstruction, enabling semi-supervised learning. Temporal and slice pairing between inputs are learned dynamically. We demonstrate applications in Late Gadolinium Enhanced (LGE) and Blood Oxygenation Level Dependent (BOLD) cardiac segmentation, as well as in T2 abdominal segmentation. Code is available at https://github.com/vios-s/multimodal_segmentation.

updated: Mon Nov 09 2020 19:18:39 GMT+0000 (UTC)

published: Mon Nov 11 2019 17:44:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト