Cross-Modal Information Maximization for Medical Imaging: CMIM

Tristan Sylvain; Francis Dutil; Tess Berthier; Lisa Di Jorio; Margaux Luck; Devon Hjelm; Yoshua Bengio

医用画像のためのクロスモーダル情報の最大化：CMIM

病院では、データは特定の情報システムにサイロ化され、患者が受けるさまざまな医用画像検査（CTスキャン、MRI、PET、超音波など）や関連する放射線レポートなどのさまざまなモダリティで同じ情報を利用できるようになります。これにより、テスト時に常に利用できるとは限らない同じ情報の複数のビューをトレーニング時に取得して使用するユニークな機会が提供されます。この論文では、相互情報量最大化の最近の進歩を使用して、テスト時にモダリティの低下に強いマルチモーダル入力の優れた表現を学習することにより、利用可能なデータを最大限に活用する革新的なフレームワークを提案します。列車の時間にクロスモーダル情報を最大化することにより、医用画像の分類とセグメンテーションという2つの異なる設定で、いくつかの最先端のベースラインを上回ることができます。特に、私たちの方法は、弱いモダリティの推論時間のパフォーマンスに強い影響を与えることが示されています。

In hospitals, data are siloed to specific information systems that make the same information available under different modalities such as the different medical imaging exams the patient undergoes (CT scans, MRI, PET, Ultrasound, etc.) and their associated radiology reports. This offers unique opportunities to obtain and use at train-time those multiple views of the same information that might not always be available at test-time. In this paper, we propose an innovative framework that makes the most of available data by learning good representations of a multi-modal input that are resilient to modality dropping at test-time, using recent advances in mutual information maximization. By maximizing cross-modal information at train time, we are able to outperform several state-of-the-art baselines in two different settings, medical image classification, and segmentation. In particular, our method is shown to have a strong impact on the inference-time performance of weaker modalities.

updated: Mon Feb 01 2021 21:10:37 GMT+0000 (UTC)

published: Tue Oct 20 2020 20:05:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト