Generalized Categorisation of Digital Pathology Whole Image Slides using Unsupervised Learning

Mostafa Ibrahim; Kevin Bryson

教師なし学習を使用したデジタルパソロジー全画像スライドの一般化された分類

このプロジェクトは、大きな病理画像を小さなタイルに分割し、真のラベルの知識がなくてもそれらのタイルを個別のグループにクラスター化することを目的としています。分析では、腫瘍細胞と非腫瘍細胞のクラスター化の特定の側面がいかに難しいかを示し、監視されていないさまざまなアプローチの結果は、簡単な作業ではありません。このプロジェクトは、デジタルパソロジーコミュニティで使用されるソフトウェアパッケージも提供します。このソフトウェアパッケージは、教師なし、教師なしタイル分類を実行するために開発されたアプローチのいくつかを使用し、手動で簡単にラベル付けできます。このプロジェクトでは、K-Meansやガウス混合モデルなどの従来のクラスタリングアルゴリズムから、ディープオートエンコーダーやマルチロス学習などのより複雑な特徴抽出手法まで、さまざまな手法を組み合わせて使用しています。プロジェクト全体を通して、完全性スコアやクラスタープロットなどのいくつかの指標を使用して、評価のベンチマークを設定しようとします。結果全体を通して、畳み込みオートエンコーダーは、その強力な内部表現学習能力により、他のアプローチをわずかに上回っていることを示しています。さらに、ガウス混合モデルは、さまざまなクラスターをキャプチャする柔軟性があるため、平均してK-Meansよりも優れた結果を生成することを示します。また、さまざまなタイプの病理テクスチャを分類することの難しさの大きな違いも示しています。

This project aims to break down large pathology images into small tiles and then cluster those tiles into distinct groups without the knowledge of true labels, our analysis shows how difficult certain aspects of clustering tumorous and non-tumorous cells can be and also shows that comparing the results of different unsupervised approaches is not a trivial task. The project also provides a software package to be used by the digital pathology community, that uses some of the approaches developed to perform unsupervised unsupervised tile classification, which could then be easily manually labelled. The project uses a mixture of techniques ranging from classical clustering algorithms such as K-Means and Gaussian Mixture Models to more complicated feature extraction techniques such as deep Autoencoders and Multi-loss learning. Throughout the project, we attempt to set a benchmark for evaluation using a few measures such as completeness scores and cluster plots. Throughout our results we show that Convolutional Autoencoders manages to slightly outperform the rest of the approaches due to its powerful internal representation learning abilities. Moreover, we show that Gaussian Mixture models produce better results than K-Means on average due to its flexibility in capturing different clusters. We also show the huge difference in the difficulties of classifying different types of pathology textures.

updated: Sun Dec 27 2020 14:38:22 GMT+0000 (UTC)

published: Sun Dec 27 2020 14:38:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト