Overcoming the limitations of patch-based learning to detect cancer in whole slide images

Ozan Ciga; Tony Xu; Sharon Nofech-Mozes; Shawna Noy; Fang-I Lu; Anne L. Martel

スライド画像全体で癌を検出するためのパッチベースの学習の制限を克服する

全スライド画像（WSI）は、深層学習モデルをトレーニングするときに固有の課題をもたらします。それらは非常に大きいため、分析のために各画像を小さなパッチに分割する必要があり、詳細とコンテキストの両方をキャプチャするために画像の特徴を複数のスケールで抽出する必要があり、極端なクラスの不均衡が存在する可能性があります。これらの画像の分析は、主に公開された注釈付きデータセットが利用できるようになったおかげで、大幅に進歩しました。ただし、メソッドがチャレンジタスクで高いスコアを獲得したとしても、この成功は、より臨床的に関連性のあるワークフローでの良好なパフォーマンスに変換されない可能性があると仮定します。多くのデータセットは、データキュレーションバイアスに悩まされる可能性のある画像パッチで構成されています。他のデータセットはスライドレベル全体でのみラベル付けされ、画像全体に注釈がないため、最終的な決定が正しい限り、誤ったローカル予測がマスクされる可能性があります。このホワイトペーパーでは、パッチまたはスライドレベルの分類と、スライド全体で癌を正確に特定またはセグメント化する必要がある方法との違いについて概説し、両方のケースでベストプラクティスが異なることを実験的に検証します。術前補助療法後の乳がんWSIにバイナリがん検出ネットワークを適用して、スライド全体で感度と精度を必要とするがんの範囲の概要を示す腫瘍床を見つけます。私たちは、アーキテクチャや拡張など、複数の設計の選択とその結果への影響を広範囲に研究しています。さらに、偽陽性率（スライドレベルで7％）を大幅に削減し、問題に関連する各メトリックを改善し、腫瘍範囲のエラーを15％削減する、ネガティブデータサンプリング戦略を提案します。

Whole slide images (WSIs) pose unique challenges when training deep learning models. They are very large which makes it necessary to break each image down into smaller patches for analysis, image features have to be extracted at multiple scales in order to capture both detail and context, and extreme class imbalances may exist. Significant progress has been made in the analysis of these images, thanks largely due to the availability of public annotated datasets. We postulate, however, that even if a method scores well on a challenge task, this success may not translate to good performance in a more clinically relevant workflow. Many datasets consist of image patches which may suffer from data curation bias; other datasets are only labelled at the whole slide level and the lack of annotations across an image may mask erroneous local predictions so long as the final decision is correct. In this paper, we outline the differences between patch or slide-level classification versus methods that need to localize or segment cancer accurately across the whole slide, and we experimentally verify that best practices differ in both cases. We apply a binary cancer detection network on post neoadjuvant therapy breast cancer WSIs to find the tumor bed outlining the extent of cancer, a task which requires sensitivity and precision across the whole slide. We extensively study multiple design choices and their effects on the outcome, including architectures and augmentations. Furthermore, we propose a negative data sampling strategy, which drastically reduces the false positive rate (7% on slide level) and improves each metric pertinent to our problem, with a 15% reduction in the error of tumor extent.

updated: Tue Dec 01 2020 16:37:18 GMT+0000 (UTC)

published: Tue Dec 01 2020 16:37:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト