Diagnose Like a Pathologist: Transformer-Enabled Hierarchical Attention-Guided Multiple Instance Learning for Whole Slide Image Classification

Conghao Xiong; Hao Chen; Joseph J. Y. Sung; Irwin King

病理学者のように診断: スライド画像全体の分類のためのトランスフォーマー対応の階層的注意誘導複数インスタンス学習

マルチインスタンス学習 (MIL) とトランスフォーマーは、病理組織学の全スライド画像 (WSI) 分類においてますます人気が高まっています。ただし、さまざまな倍率で病理組織の特定の領域を選択的に観察する人間の病理学者とは異なり、ほとんどの方法では WSI の複数の解像度を階層的かつ注意深く組み込んでいないため、WSI や他の解像度からの情報への焦点が失われます。この問題を解決するために、WSI を最大限に活用するための階層的注意誘導型複数インスタンス学習フレームワークを提案します。このフレームワークは、WSI の複数の解像度にわたって、動的かつ注意深く識別領域を検出できます。このフレームワーク内で、トランスフォーマーのパフォーマンスをさらに強化し、より包括的な WSI (バッグ) 表現を取得するために、統合されたアテンショントランスフォーマーが提案されています。このトランスフォーマーは、複数の統合アテンションモジュールで構成されます。これは、トランスフォーマー層と、そのバッグ内のすべてのインスタンス表現に基づいてバッグ表現を生成する集約モジュールの組み合わせです。実験結果は、私たちの方法が Camelyon16、TCGA-RCC、TCGA-NSCLC、社内 IMGC データセットを含む複数のデータセットで最先端のパフォーマンスを達成したことを示しています。コードは https://github.com/BearCleverProud/HAG-MIL で入手できます。

Multiple Instance Learning (MIL) and transformers are increasingly popular in histopathology Whole Slide Image (WSI) classification. However, unlike human pathologists who selectively observe specific regions of histopathology tissues under different magnifications, most methods do not incorporate multiple resolutions of the WSIs, hierarchically and attentively, thereby leading to a loss of focus on the WSIs and information from other resolutions. To resolve this issue, we propose a Hierarchical Attention-Guided Multiple Instance Learning framework to fully exploit the WSIs. This framework can dynamically and attentively discover the discriminative regions across multiple resolutions of the WSIs. Within this framework, an Integrated Attention Transformer is proposed to further enhance the performance of the transformer and obtain a more holistic WSI (bag) representation. This transformer consists of multiple Integrated Attention Modules, which is the combination of a transformer layer and an aggregation module that produces a bag representation based on every instance representation in that bag. The experimental results show that our method achieved state-of-the-art performances on multiple datasets, including Camelyon16, TCGA-RCC, TCGA-NSCLC, and an in-house IMGC dataset. The code is available at https://github.com/BearCleverProud/HAG-MIL.

updated: Mon Jul 17 2023 03:00:47 GMT+0000 (UTC)

published: Thu Jan 19 2023 15:38:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト