Negative Evidence Matters in Interpretable Histology Image Classification

Soufiane Belharbi; Marco Pedersoli; Ismail Ben Ayed; Luke McCaffrey; Eric Granger

解釈可能な組織学画像分類における否定的な証拠の問題

画像クラスラベルなどのグローバル注釈のみを使用して、弱教師あり学習メソッドを使用すると、CNN分類器は画像を共同で分類し、予測されたクラスに関連付けられた関心領域を生成できます。ただし、ピクセルレベルでのガイダンスがないと、このような方法では不正確な領域が生成される可能性があります。この問題は、自然画像よりも組織像の方が難しいことが知られています。これは、オブジェクトの目立ちが少なく、構造のバリエーションが多く、前景領域と背景領域の類似性が高いためです。したがって、CNNの視覚的解釈のためのコンピュータビジョン文献の方法は直接適用されない場合があります。この作業では、完全に負のサンプルからの情報を活用する複合損失関数に基づくシンプルで効率的な方法を提案します。新しい損失関数には、2つの補完的な用語が含まれています。1つはCNN分類器から収集された正の証拠を利用し、2つ目はトレーニングデータセットからの完全に負のサンプルを利用します。特に、事前にトレーニングされた分類器に、関心領域を調整できるデコーダーを装備しています。同じ分類子を利用して、ピクセルレベルで正と負の両方の証拠を収集し、デコーダーをトレーニングします。これにより、追加の監視信号なしで、監視として画像クラスのみを使用することなく、データで自然に発生する完全にネガティブなサンプルを利用できます。最近のいくつかの関連する方法と比較して、結腸癌の公開ベンチマークGlaSおよび3つの異なるバックボーンを使用した乳癌のCamelyon16パッチベースのベンチマークを超えて、私たちの方法によって導入された大幅な改善を示します。私たちの結果は、否定的な証拠と肯定的な証拠の両方を使用することの利点を示しています。つまり、分類子から取得したものと、データセットで自然に利用できるものです。両方の用語のアブレーション研究を提供します。私たちのコードは公開されています。

Using only global annotations such as the image class labels, weakly-supervised learning methods allow CNN classifiers to jointly classify an image, and yield the regions of interest associated with the predicted class. However, without any guidance at the pixel level, such methods may yield inaccurate regions. This problem is known to be more challenging with histology images than with natural ones, since objects are less salient, structures have more variations, and foreground and background regions have stronger similarities. Therefore, methods in computer vision literature for visual interpretation of CNNs may not directly apply. In this work, we propose a simple yet efficient method based on a composite loss function that leverages information from the fully negative samples. Our new loss function contains two complementary terms: the first exploits positive evidence collected from the CNN classifier, while the second leverages the fully negative samples from the training dataset. In particular, we equip a pre-trained classifier with a decoder that allows refining the regions of interest. The same classifier is exploited to collect both the positive and negative evidence at the pixel level to train the decoder. This enables to take advantages of the fully negative samples that occurs naturally in the data, without any additional supervision signals and using only the image class as supervision. Compared to several recent related methods, over the public benchmark GlaS for colon cancer and a Camelyon16 patch-based benchmark for breast cancer using three different backbones, we show the substantial improvements introduced by our method. Our results shows the benefits of using both negative and positive evidence, ie, the one obtained from a classifier and the one naturally available in datasets. We provide an ablation study of both terms. Our code is publicly available.

updated: Fri Jan 07 2022 13:26:18 GMT+0000 (UTC)

published: Fri Jan 07 2022 13:26:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト