Pan-Cancer Integrative Histology-Genomic Analysis via Interpretable Multimodal Deep Learning

Richard J. Chen; Ming Y. Lu; Drew F. K. Williamson; Tiffany Y. Chen; Jana Lipkova; Muhammad Shaban; Maha Shady; Mane Williams; Bumjin Joo; Zahra Noor; Faisal Mahmood

汎癌統合組織学-解釈可能なマルチモーダルディープラーニングによるゲノム分析

ディープラーニングベースの計算病理学の急速に出現している分野は、組織学のスライド画像全体から客観的な予後モデルを開発する上で有望であることが実証されています。ただし、ほとんどの予後モデルは、組織学またはゲノミクスのみに基づいており、組織学とゲノミクスを統合して画像とオミックの共同予後モデルを開発する方法については触れていません。さらに、そのような予後を支配するこれらのモデルから説明可能な形態学的および分子記述子を特定することは興味深い。マルチモーダルディープラーニングを使用して、ギガピクセルの全スライド病理画像、RNA-seqの存在量、コピー数多型、および14の主要ながんタイプの5,720人の患者からの変異データを統合しました。私たちの解釈可能な、弱く監視されたマルチモーダル深層学習アルゴリズムは、結果を予測するためにこれらの異種モダリティを融合し、マルチモーダル解釈可能性を介して貧弱で好ましい結果を裏付けるこれらのモダリティから予後の特徴を発見することができます。私たちのモデルを、組織学スライドと分子プロファイルのみでトレーニングされた単峰性の深層学習モデルと比較し、14の癌のうち9つでリスク層別化のパフォーマンスの向上を示しています。さらに、すべてのがんの種類にわたる予後予測に関与する形態学的および分子マーカーを分析します。疾患および患者レベルでの14種類の癌にわたる患者の予後の形態学的および分子的相関を含むすべての分析データは、インタラクティブなオープンアクセスデータベース（http://pancancer.mahmoodlab.org）に表示され、さらなる調査と予後を可能にします。バイオマーカーの発見。これらのモデルの説明が予後であることを検証するために、WSIの注意力の高い形態学的領域をさらに分析しました。これは、腫瘍浸潤リンパ球の存在が、研究した14種類のがんのうち9種類で良好ながん予後を裏付けていることを示しています。

The rapidly emerging field of deep learning-based computational pathology has demonstrated promise in developing objective prognostic models from histology whole slide images. However, most prognostic models are either based on histology or genomics alone and do not address how histology and genomics can be integrated to develop joint image-omic prognostic models. Additionally identifying explainable morphological and molecular descriptors from these models that govern such prognosis is of interest. We used multimodal deep learning to integrate gigapixel whole slide pathology images, RNA-seq abundance, copy number variation, and mutation data from 5,720 patients across 14 major cancer types. Our interpretable, weakly-supervised, multimodal deep learning algorithm is able to fuse these heterogeneous modalities for predicting outcomes and discover prognostic features from these modalities that corroborate with poor and favorable outcomes via multimodal interpretability. We compared our model with unimodal deep learning models trained on histology slides and molecular profiles alone, and demonstrate performance increase in risk stratification on 9 out of 14 cancers. In addition, we analyze morphologic and molecular markers responsible for prognostic predictions across all cancer types. All analyzed data, including morphological and molecular correlates of patient prognosis across the 14 cancer types at a disease and patient level are presented in an interactive open-access database (http://pancancer.mahmoodlab.org) to allow for further exploration and prognostic biomarker discovery. To validate that these model explanations are prognostic, we further analyzed high attention morphological regions in WSIs, which indicates that tumor-infiltrating lymphocyte presence corroborates with favorable cancer prognosis on 9 out of 14 cancer types studied.

updated: Wed Aug 04 2021 20:40:05 GMT+0000 (UTC)

published: Wed Aug 04 2021 20:40:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト