Knowledge-Augmented Contrastive Learning for Abnormality Classification and Localization in Chest X-rays with Radiomics using a Feedback Loop

Yan Han; Chongyan Chen; Ahmed Tewfik; Benjamin Glicksberg; Ying Ding; Yifan Peng; Zhangyang Wang

フィードバックループを使用したラジオミクスによる胸部X線の異常分類と位置特定のための知識増強対照学習

胸部X線の異常の分類と位置特定のための高精度の予測モデルを構築するには、通常、手動で注釈を付けた多数のラベルと異常のピクセル領域（境界ボックス）が必要です。ただし、そのような注釈、特にバウンディングボックスを取得するにはコストがかかります。最近、対照学習は、ラベルのない自然画像を活用して、高度に一般化可能で識別可能な特徴を生成することで大きな期待を示しています。ただし、医用画像はデータ拡張の修正がはるかに少ないため、その力を医用画像ドメインに拡張することは十分に検討されておらず、非常に重要です。対照的に、彼らの事前の知識とラジオミック機能はしばしば重要です。このギャップを埋めるために、疾患の分類とローカリゼーションのタスクを同時に実行する、エンドツーエンドの半教師あり知識増強対照学習フレームワークを提案します。私たちのフレームワークの重要なノブは、知識の増強としてラジオミック機能をシームレスに統合することにより、医用画像に合わせた独自のポジティブサンプリングアプローチです。具体的には、最初に画像エンコーダーを適用して胸部X線を分類し、画像の特徴を生成します。次に、Grad-CAMを活用して、胸部X線の重要な（異常な）領域を強調表示し（注釈が付いていない場合でも）、そこから放射線の特徴を抽出します。次に、放射性特徴は別の専用エンコーダーを通過して、同じ胸部X線から生成された画像特徴のポジティブサンプルとして機能します。このように、私たちのフレームワークは、画像とラジオミックモダリティ機能が相互に強化するためのフィードバックループを構成します。それらの対照は、堅牢で解釈可能な知識増強表現を生み出します。 NIH胸部X線データセットでの広範な実験は、私たちのアプローチが分類とローカリゼーションの両方のタスクで既存のベースラインを上回っていることを示しています。

Building a highly accurate predictive model for classification and localization of abnormalities in chest X-rays usually requires a large number of manually annotated labels and pixel regions (bounding boxes) of abnormalities. However, it is expensive to acquire such annotations, especially the bounding boxes. Recently, contrastive learning has shown strong promise in leveraging unlabeled natural images to produce highly generalizable and discriminative features. However, extending its power to the medical image domain is under-explored and highly non-trivial, since medical images are much less amendable to data augmentations. In contrast, their prior knowledge, as well as radiomic features, is often crucial. To bridge this gap, we propose an end-to-end semi-supervised knowledge-augmented contrastive learning framework, that simultaneously performs disease classification and localization tasks. The key knob of our framework is a unique positive sampling approach tailored for the medical images, by seamlessly integrating radiomic features as a knowledge augmentation. Specifically, we first apply an image encoder to classify the chest X-rays and to generate the image features. We next leverage Grad-CAM to highlight the crucial (abnormal) regions for chest X-rays (even when unannotated), from which we extract radiomic features. The radiomic features are then passed through another dedicated encoder to act as the positive sample for the image features generated from the same chest X-ray. In this way, our framework constitutes a feedback loop for image and radiomic modality features to mutually reinforce each other. Their contrasting yields knowledge-augmented representations that are both robust and interpretable. Extensive experiments on the NIH Chest X-ray dataset demonstrate that our approach outperforms existing baselines in both classification and localization tasks.

updated: Mon Oct 18 2021 02:44:45 GMT+0000 (UTC)

published: Sun Apr 11 2021 09:16:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト