Task-specific Fine-tuning via Variational Information Bottleneck for Weakly-supervised Pathology Whole Slide Image Classification

Honglin Li; Chenglu Zhu; Yunlong Zhang; Yuxuan Sun; Zhongyi Shui; Wenwei Kuang; Sunyi Zheng; Lin Yang

変分情報によるタスク固有の微調整弱教師付き病理学全体スライド画像分類のボトルネック

複数インスタンス学習 (MIL) は、デジタル病理学全体スライド画像 (WSI) 分類で有望な結果を示していますが、そのようなパラダイムは、ギガピクセル WSI の高い計算コストとモデルトレーニングの限られたサンプルサイズの課題により、パフォーマンスと一般化の問題に直面しています。計算の問題に対処するために、ほとんどの MIL メソッドは ImageNet から凍結された事前トレーニング済みモデルを利用して、最初に表現を取得します。このプロセスは、ドメインギャップが大きいために重要な情報を失い、画像レベルのトレーニング時間の拡張がないため、モデルの一般化を妨げる可能性があります。自己教師あり学習 (SSL) は実行可能な表現学習スキームを提案していますが、下流のタスクの改善は、SSL のタスクに依存しない機能から、部分ラベル教師あり学習の下でのタスク固有の機能への変換において、さらに検討する必要があります。計算コストとパフォーマンスのジレンマを軽減するために、情報ボトルネック理論に基づいた効率的な WSI 微調整フレームワークを提案します。この理論により、フレームワークは WSI の最小限の十分な統計を見つけることができるため、WSI レベルの弱いラベルのみに応じてバックボーンをタスク固有の表現に微調整することができます。 WSI-MIL の問題をさらに分析して、微調整方法を理論的に導き出します。私たちのフレームワークは、さまざまな WSI ヘッドの 5 つの病理 WSI データセットで評価されます。微調整された表現の実験結果は、以前の研究と比較して、精度と一般化の両方で大幅な改善を示しています。ソースコードは、https://github.com/invoker-LL/WSI-finetuning で入手できます。

While Multiple Instance Learning (MIL) has shown promising results in digital Pathology Whole Slide Image (WSI) classification, such a paradigm still faces performance and generalization problems due to challenges in high computational costs on Gigapixel WSIs and limited sample size for model training. To deal with the computation problem, most MIL methods utilize a frozen pretrained model from ImageNet to obtain representations first. This process may lose essential information owing to the large domain gap and hinder the generalization of model due to the lack of image-level training-time augmentations. Though Self-supervised Learning (SSL) proposes viable representation learning schemes, the improvement of the downstream task still needs to be further explored in the conversion from the task-agnostic features of SSL to the task-specifics under the partial label supervised learning. To alleviate the dilemma of computation cost and performance, we propose an efficient WSI fine-tuning framework motivated by the Information Bottleneck theory. The theory enables the framework to find the minimal sufficient statistics of WSI, thus supporting us to fine-tune the backbone into a task-specific representation only depending on WSI-level weak labels. The WSI-MIL problem is further analyzed to theoretically deduce our fine-tuning method. Our framework is evaluated on five pathology WSI datasets on various WSI heads. The experimental results of our fine-tuned representations show significant improvements in both accuracy and generalization compared with previous works. Source code will be available at https://github.com/invoker-LL/WSI-finetuning.

updated: Wed Mar 15 2023 08:41:57 GMT+0000 (UTC)

published: Wed Mar 15 2023 08:41:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト