MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts

Qiuhui Chen; Xinyue Hu; Zirui Wang; Yi Hong

MedBLIP: 3D 医療画像とテキストからの言語画像事前トレーニングのブートストラッピング

ビジョン言語事前トレーニング (VLP) モデルは、多くのコンピュータービジョンアプリケーションで効果的であることが実証されています。この論文では、実際に行われている画像スキャンと電子医療記録のテキスト記述に基づいてコンピューター支援診断 (CAD) を行うための、医療分野における VLP モデルの開発について検討します。私たちの目標を達成するために、私たちは軽量 CAD システム MedBLIP を提案します。これは、既製の凍結済みの事前トレーニング済み画像エンコーダーと凍結された大規模言語モデルから VLP をブートストラップするための新しいパラダイムです。私たちは、3D 医療画像と 2D の事前トレーニング済み画像エンコーダーおよび言語モデルの間のギャップを埋めるために MedQFormer モジュールを設計します。 MedBLIP の有効性を評価するために、ADNI、NACC、OASIS、AIBL、MIRIAD という 5 つの公開アルツハイマー病 (AD) データセットから 30,000 を超える画像ボリュームを収集します。私たちが知っているこの最大の AD データセット上で、私たちのモデルは、健康、軽度認知障害 (MCI)、および AD 被験者のゼロショット分類で SOTA パフォーマンスを達成し、医療視覚的質問応答 (VQA) を作成する機能を示しています。コードと事前トレーニングされたモデルはオンラインで入手できます: https://github.com/Qybc/MedBLIP。

Vision-language pre-training (VLP) models have been demonstrated to be effective in many computer vision applications. In this paper, we consider developing a VLP model in the medical domain for making computer-aided diagnoses (CAD) based on image scans and text descriptions in electronic health records, as done in practice. To achieve our goal, we present a lightweight CAD system MedBLIP, a new paradigm for bootstrapping VLP from off-the-shelf frozen pre-trained image encoders and frozen large language models. We design a MedQFormer module to bridge the gap between 3D medical images and 2D pre-trained image encoders and language models as well. To evaluate the effectiveness of our MedBLIP, we collect more than 30,000 image volumes from five public Alzheimer's disease (AD) datasets, i.e., ADNI, NACC, OASIS, AIBL, and MIRIAD. On this largest AD dataset we know, our model achieves the SOTA performance on the zero-shot classification of healthy, mild cognitive impairment (MCI), and AD subjects, and shows its capability of making medical visual question answering (VQA). The code and pre-trained models is available online: https://github.com/Qybc/MedBLIP.

updated: Thu May 18 2023 08:19:33 GMT+0000 (UTC)

published: Thu May 18 2023 08:19:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト