Deep Mining Generation of Lung Cancer Malignancy Models from Chest X-ray Images

Michael J. Horry; Subrata Chakraborty; Biswajeet Pradhan; Manoranjan Paul; Douglas P. S. Gomes; Anwaar Ul-Haq

胸部X線画像からの肺がん悪性腫瘍モデルのディープマイニング生成

肺がんは、世界中のがんによる死亡と罹患の主な原因です。多くの研究で、機械学習モデルが胸部X線画像から肺結節を検出するのに効果的であることが示されています。ただし、ディープラーニングモデルのブラックボックスの性質に起因するいくつかの実用的、倫理的、および規制上の制約のため、これらの手法はまだ医学界に受け入れられていません。さらに、胸部X線で見えるほとんどの肺結節は良性です。したがって、コンピュータビジョンベースの肺結節検出の狭いタスクは、自動化された肺がん検出と同一視することはできません。両方の懸念に対処するために、この研究では、肺がんの悪性腫瘍の予測を解釈可能な決定木として提示する、新しいハイブリッド深層学習と決定木ベースのコンピュータービジョンモデルを紹介します。このプロセスの深層学習コンポーネントは、肺がんに関連する病理学的バイオマーカーに関する公開されている大規模なデータセットを使用してトレーニングされます。次に、これらのモデルを使用して、悪性腫瘍のメタデータが利用可能な2つの独立したデータセットから胸部X線画像のバイオマーカースコアを推測します。浅い決定木を悪性腫瘍の層化データセットに適合させ、さまざまな指標を調べて最適なモデルを決定することにより、多変量予測モデルをマイニングします。私たちの最良の決定木モデルは、それぞれ86.7％と80.0％の感度と特異度を達成し、92.9％の正の予測値を実現します。この方法を使用してマイニングされた決定木は、人間の放射線技師の効率を改善するためのワークフロー拡張ツールとして実装するための、臨床的に有用な多変量肺がん悪性腫瘍モデルへの改良の出発点と見なすことができます。

Lung cancer is the leading cause of cancer death and morbidity worldwide. Many studies have shown machine learning models to be effective at detecting lung nodules from chest X-ray images. However, these techniques have yet to be embraced by the medical community due to several practical, ethical, and regulatory constraints stemming from the black-box nature of deep learning models. Additionally, most lung nodules visible on chest X-ray are benign; therefore, the narrow task of computer vision-based lung nodule detection cannot be equated to automated lung cancer detection. Addressing both concerns, this study introduces a novel hybrid deep learning and decision tree-based computer vision model which presents lung cancer malignancy predictions as interpretable decision trees. The deep learning component of this process is trained using a large publicly available dataset on pathological biomarkers associated with lung cancer. These models are then used to inference biomarker scores for chest X-ray images from two, independent data sets for which malignancy metadata is available. We mine multi-variate predictive models by fitting shallow decision trees to the malignancy stratified datasets and interrogate a range of metrics to determine the best model. Our best decision tree model achieves sensitivity and specificity of 86.7% and 80.0% respectively with a positive predictive value of 92.9%. Decision trees mined using this method may be considered as a starting point for refinement into clinically useful multi-variate lung cancer malignancy models for implementation as a workflow augmentation tool to improve the efficiency of human radiologists.

updated: Sun Aug 29 2021 10:20:27 GMT+0000 (UTC)

published: Thu Dec 10 2020 04:11:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト