Extracting 2D weak labels from volume labels using multiple instance learning in CT hemorrhage detection

Samuel W. Remedios; Zihao Wu; Camilo Bermudez; Cailey I. Kerley; Snehashis Roy; Mayur B. Patel; John A. Butman; Bennett A. Landman; Dzung L. Pham

CT出血検出での複数インスタンス学習を使用したボリュームラベルからの2D弱いラベルの抽出

複数インスタンス学習（MIL）は、モデルがバッグクラスラベルからインスタンスクラスラベルを学習できるようにすることを目的とする、教師付き学習方法です。この場合、バッグは複数のインスタンスを含むように定義されます。 MILは、弱いラベルから学習するための牽引力を獲得していますが、3D医療画像に広く適用されていません。 MILは、（1）従来の3Dネットワークの応用を妨げる異方性の高いボクセルと（2）ボリュームラベル全体を学習する能力が限られているため、臨床CTの取得に適しています。この作業では、深い畳み込みニューラルネットワークを使用してMILを適用し、臨床CT頭部画像ボリュームに1つ以上の大きな出血（> 20cm ^ 3）があるかどうかを特定し、2Dスライス注釈を必要とせずに学習した2Dモデルを作成します。個々のイメージボリュームは個別のバッグと見なされ、各ボリュームのスライスはインスタンスです。このようなフレームワークは、2Dセグメンテーションアプローチのトレーニングを支援するために、臨床レポートで取得した情報を組み込むための段階を設定します。このコンテキスト内で、トレーニングデータの量を変えることでMILの一般化を可能にするデータ要件を評価します。我々の結果は、スライスごとの正確な出血検出を達成するために、少なくとも400の患者画像ボリュームのトレーニングサイズが必要であることを示しています。 5倍の交差検証では、トレーニングボリュームの最大数を使用した主要モデルの平均真陽性率は98.10％、平均真陰性率は99.36％、平均精度は0.9698でした。これらのモデルは、CTニューロイメージングにおけるMILの継続的な調査と適応を可能にするために、ソースコードとともに利用可能になりました。

Multiple instance learning (MIL) is a supervised learning methodology that aims to allow models to learn instance class labels from bag class labels, where a bag is defined to contain multiple instances. MIL is gaining traction for learning from weak labels but has not been widely applied to 3D medical imaging. MIL is well-suited to clinical CT acquisitions since (1) the highly anisotropic voxels hinder application of traditional 3D networks and (2) patch-based networks have limited ability to learn whole volume labels. In this work, we apply MIL with a deep convolutional neural network to identify whether clinical CT head image volumes possess one or more large hemorrhages (> 20cm^3), resulting in a learned 2D model without the need for 2D slice annotations. Individual image volumes are considered separate bags, and the slices in each volume are instances. Such a framework sets the stage for incorporating information obtained in clinical reports to help train a 2D segmentation approach. Within this context, we evaluate the data requirements to enable generalization of MIL by varying the amount of training data. Our results show that a training size of at least 400 patient image volumes was needed to achieve accurate per-slice hemorrhage detection. Over a five-fold cross-validation, the leading model, which made use of the maximum number of training volumes, had an average true positive rate of 98.10%, an average true negative rate of 99.36%, and an average precision of 0.9698. The models have been made available along with source code to enabled continued exploration and adaption of MIL in CT neuroimaging.

updated: Wed Nov 13 2019 17:24:21 GMT+0000 (UTC)

published: Wed Nov 13 2019 17:24:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト