Towards to Robust and Generalized Medical Image Segmentation Framework

Yurong Chen

堅牢で一般化された医療画像セグメンテーションフレームワークに向けて

放射線科医の作業負荷を軽減するために、医用画像をレビューおよび分析する機能を備えたコンピュータ支援診断が徐々に展開されています。ディープラーニングベースの関心領域セグメンテーションは、最もエキサイティングなユースケースの1つです。ただし、このパラダイムは、堅牢性と一般化が不十分なため、実際の臨床アプリケーションでは制限されています。この問題は、トレーニングデータが不足しているため、より不吉です。この論文では、表現学習の観点から課題に取り組みます。堅牢性と一般化の低下を引き起こした主な理由の1つとして、崩壊した表現を転移学習によって回避できることを調査します。したがって、ロバストな一般化されたセグメンテーションのための新しい2段階のフレームワークを提案します。特に、監視されていないTile-wise AutoEncoder（T-AE）事前トレーニングアーキテクチャは、ダウンストリームタスクの一般化と堅牢性を向上させるための意味のある表現を学習するために造られました。さらに、学習した知識はセグメンテーションベンチマークに転送されます。画像再構成ネットワークと組み合わせることで、表現はデコードされ続け、モデルがより多くのセマンティック機能をキャプチャするように促します。マルチチェストX線データセットでの肺セグメンテーションの実験が行われます。経験的に、関連する実験結果は、特に限られたトレーニングデータのシナリオの下で、高いパフォーマンスと破損に対する堅牢性の観点から、目に見えないドメインで提案されたフレームワークの優れた一般化機能を示しています。

To mitigate the radiologist's workload, computer-aided diagnosis with the capability to review and analyze medical images is gradually deployed. Deep learning-based region of interest segmentation is among the most exciting use cases. However, this paradigm is restricted in real-world clinical applications due to poor robustness and generalization. The issue is more sinister with a lack of training data. In this paper, we address the challenge from the representation learning point of view. We investigate that the collapsed representations, as one of the main reasons which caused poor robustness and generalization, could be avoided through transfer learning. Therefore, we propose a novel two-stage framework for robust generalized segmentation. In particular, an unsupervised Tile-wise AutoEncoder (T-AE) pretraining architecture is coined to learn meaningful representation for improving the generalization and robustness of the downstream tasks. Furthermore, the learned knowledge is transferred to the segmentation benchmark. Coupled with an image reconstruction network, the representation keeps to be decoded, encouraging the model to capture more semantic features. Experiments of lung segmentation on multi chest X-ray datasets are conducted. Empirically, the related experimental results demonstrate the superior generalization capability of the proposed framework on unseen domains in terms of high performance and robustness to corruption, especially under the scenario of the limited training data.

updated: Mon Aug 09 2021 05:58:49 GMT+0000 (UTC)

published: Mon Aug 09 2021 05:58:49 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト