AMIGO: Sparse Multi-Modal Graph Transformer with Shared-Context Processing for Representation Learning of Giga-pixel Images

Ramin Nakhli; Puria Azadi Moghadam; Haoyang Mi; Hossein Farahani; Alexander Baras; Blake Gilks; Ali Bashashati

AMIGO：ギガピクセル画像の表現学習のための共有コンテキスト処理を備えたスパースマルチモーダルグラフトランスフォーマー

ギガピクセル全体のスライド組織病理画像 (WSI) の処理は、計算コストの高い作業です。複数インスタンス学習 (MIL) は、WSI を処理するための従来のアプローチになりました。MIL では、これらの画像を小さなパッチに分割してさらに処理します。ただし、MIL ベースの手法では、パッチ内の個々のセルに関する明示的な情報が無視されます。この論文では、共有コンテキスト処理の新しい概念を定義することにより、組織内の細胞グラフを使用して、患者の階層構造を利用しながら単一の表現を提供するマルチモーダルグラフトランスフォーマー (AMIGO) を設計しました。これにより、細胞レベルの情報と組織レベルの情報の間で動的に焦点を合わせることができます。生存予測における複数の最先端の方法に対してモデルのパフォーマンスをベンチマークし、階層的なビジョントランスフォーマー (ViT) を含むすべての方法を大幅に上回ることができることを示しました。さらに重要なことは、私たちのモデルは欠落している情報に対して非常に堅牢であり、データの 20% という低いデータでも同じパフォーマンスを達成できることを示しています。最後に、2 つの異なるがんデータセットで、モデルが患者を低リスク群と高リスク群に層別化できることを実証しましたが、他の最先端の方法ではこの目標を達成できませんでした。また、188 人の患者からの 1,600 の組織マイクロアレイ (TMA) コアとその生存情報を含む免疫組織化学画像 (InUIT) の大規模なデータセットを公開し、このコンテキストで公開されている最大のデータセットの 1 つにしています。

Processing giga-pixel whole slide histopathology images (WSI) is a computationally expensive task. Multiple instance learning (MIL) has become the conventional approach to process WSIs, in which these images are split into smaller patches for further processing. However, MIL-based techniques ignore explicit information about the individual cells within a patch. In this paper, by defining the novel concept of shared-context processing, we designed a multi-modal Graph Transformer (AMIGO) that uses the celluar graph within the tissue to provide a single representation for a patient while taking advantage of the hierarchical structure of the tissue, enabling a dynamic focus between cell-level and tissue-level information. We benchmarked the performance of our model against multiple state-of-the-art methods in survival prediction and showed that ours can significantly outperform all of them including hierarchical Vision Transformer (ViT). More importantly, we show that our model is strongly robust to missing information to an extent that it can achieve the same performance with as low as 20% of the data. Finally, in two different cancer datasets, we demonstrated that our model was able to stratify the patients into low-risk and high-risk groups while other state-of-the-art methods failed to achieve this goal. We also publish a large dataset of immunohistochemistry images (InUIT) containing 1,600 tissue microarray (TMA) cores from 188 patients along with their survival information, making it one of the largest publicly available datasets in this context.

updated: Wed Mar 01 2023 23:37:45 GMT+0000 (UTC)

published: Wed Mar 01 2023 23:37:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト