Explainable multiple abnormality classification of chest CT volumes with AxialNet and HiResCAM

Rachel Lea Draelos; Lawrence Carin

AxialNetおよびHiResCAMによる胸部CTボリュームの説明可能な複数の異常分類

モデルの予測を理解することは、モデルの正確性の迅速な検証を容易にし、交絡変数を悪用するモデルの使用を防ぐために、ヘルスケアにおいて重要です。モデルが各異常を予測するために使用される領域を示さなければならない、体積医療画像における説明可能な複数の異常分類の挑戦的な新しいタスクを紹介します。このタスクを解決するために、各異常のトップスライスの識別を可能にする畳み込みニューラルネットワークAxialNetを学習するマルチインスタンスを提案します。次に、アテンションメカニズムであるHiResCAMを組み込んで、サブスライス領域を識別します。 AxialNetの場合、無関係な場所を強調表示することがあるGrad-CAMとは異なり、HiResCAMの説明はモデルが使用した場所を反映することが保証されていることを証明します。忠実な説明を生成するモデルを装備し、HiResCAMと3D許可領域を活用して、異常が発生する臓器のみに基づいてモデルが異常を予測するように促す新しいマスク損失を通じて、モデルの学習を改善することを目指します。 3D許可領域は、放射線レポートから抽出された位置情報と形態学的画像処理によって取得された臓器セグメンテーションマップを組み合わせた新しいアプローチPARTITIONによって自動的に取得されます。全体として、ボリューム医用画像における説明可能な複数異常予測の最初のモデルを提案し、次にマスク損失を使用して、状態を表す36,316スキャンのRAD-ChestCTデータセットの複数の異常の臓器局在化を33％改善します。芸術の。この作業は、胸部CTボリュームにおける複数の異常モデリングの臨床的適用性を向上させます。

Understanding model predictions is critical in healthcare, to facilitate rapid verification of model correctness and to guard against use of models that exploit confounding variables. We introduce the challenging new task of explainable multiple abnormality classification in volumetric medical images, in which a model must indicate the regions used to predict each abnormality. To solve this task, we propose a multiple instance learning convolutional neural network, AxialNet, that allows identification of top slices for each abnormality. Next we incorporate HiResCAM, an attention mechanism, to identify sub-slice regions. We prove that for AxialNet, HiResCAM explanations are guaranteed to reflect the locations the model used, unlike Grad-CAM which sometimes highlights irrelevant locations. Armed with a model that produces faithful explanations, we then aim to improve the model's learning through a novel mask loss that leverages HiResCAM and 3D allowed regions to encourage the model to predict abnormalities based only on the organs in which those abnormalities appear. The 3D allowed regions are obtained automatically through a new approach, PARTITION, that combines location information extracted from radiology reports with organ segmentation maps obtained through morphological image processing. Overall, we propose the first model for explainable multi-abnormality prediction in volumetric medical images, and then use the mask loss to achieve a 33% improvement in organ localization of multiple abnormalities in the RAD-ChestCT data set of 36,316 scans, representing the state of the art. This work advances the clinical applicability of multiple abnormality modeling in chest CT volumes.

updated: Wed Nov 24 2021 01:14:33 GMT+0000 (UTC)

published: Wed Nov 24 2021 01:14:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト