Application of Transfer Learning and Ensemble Learning in Image-level Classification for Breast Histopathology

Yuchao Zheng; Chen Li; Xiaomin Zhou; Haoyuan Chen; Hao Xu; Yixin Li; Haiqing Zhang; Xiaoyan Li; Hongzan Sun; Xinyu Huang; Marcin Grzegorzek

乳房組織病理学の画像レベル分類における転移学習とアンサンブル学習の応用

背景：乳がんは、世界中の女性で最も有病率が高いです。乳がんの分類と診断、およびその組織病理学的画像は、常に臨床上の懸念のホットスポットでした。コンピューター支援診断（CAD）では、従来の分類モデルは主に単一のネットワークを使用して特徴を抽出しますが、これには大きな制限があります。一方、多くのネットワークは、患者レベルのデータセットでトレーニングおよび最適化されており、低レベルのデータラベルの適用を無視しています。方法：この論文は、乳房の組織病理学的画像の良性および悪性病変の二項分類のための画像レベルのラベルに基づく深いアンサンブルモデルを提案します。まず、BreaKHisデータセットは、トレーニング、検証、およびテストセットにランダムに分割されます。次に、データ拡張技術を使用して、良性サンプルと悪性サンプルの数のバランスを取ります。第三に、伝達学習のパフォーマンスと各ネットワーク間の相補性を考慮して、VGG16、Xception、ResNet50、DenseNet201が基本分類子として選択されます。結果：重みとして精度を持つアンサンブルネットワークモデルでは、画像レベルの二項分類は98.90％の精度を達成します。私たちの方法の機能を検証するために、最新のTransformerおよびMultilayer Perception（MLP）モデルを同じデータセットで実験的に比較しました。私たちのモデルは5％〜20％のアドバンテージで勝ち、分類タスクにおけるアンサンブルモデルの広範囲にわたる重要性を強調しています。結論：この研究は、アンサンブルアルゴリズムを使用してモデルの分類パフォーマンスを改善することに焦点を当てています。転移学習は、小さなデータセットで重要な役割を果たし、トレーニングの速度と精度を向上させます。私たちのモデルは、精度において多くの既存のアプローチを上回り、補助医療診断の分野に方法を提供しています。

Background: Breast cancer has the highest prevalence in women globally. The classification and diagnosis of breast cancer and its histopathological images have always been a hot spot of clinical concern. In Computer-Aided Diagnosis (CAD), traditional classification models mostly use a single network to extract features, which has significant limitations. On the other hand, many networks are trained and optimized on patient-level datasets, ignoring the application of lower-level data labels. Method: This paper proposes a deep ensemble model based on image-level labels for the binary classification of benign and malignant lesions of breast histopathological images. First, the BreaKHis dataset is randomly divided into a training, validation and test set. Then, data augmentation techniques are used to balance the number of benign and malignant samples. Thirdly, considering the performance of transfer learning and the complementarity between each network, VGG16, Xception, ResNet50, DenseNet201 are selected as the base classifiers. Result: In the ensemble network model with accuracy as the weight, the image-level binary classification achieves an accuracy of 98.90%. In order to verify the capabilities of our method, the latest Transformer and Multilayer Perception (MLP) models have been experimentally compared on the same dataset. Our model wins with a 5%-20% advantage, emphasizing the ensemble model's far-reaching significance in classification tasks. Conclusion: This research focuses on improving the model's classification performance with an ensemble algorithm. Transfer learning plays an essential role in small datasets, improving training speed and accuracy. Our model has outperformed many existing approaches in accuracy, providing a method for the field of auxiliary medical diagnosis.

updated: Tue May 10 2022 01:08:44 GMT+0000 (UTC)

published: Mon Apr 18 2022 13:31:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト