Red Blood Cell Segmentation with Overlapping Cell Separation and Classification on Imbalanced Dataset

Korranat Naruenatthanaset; Thanarat H. Chalidabhongse; Duangdao Palasuwan; Nantheera Anantrasirichai; Attakorn Palasuwan

不均衡なデータセットでの重複する細胞分離と分類を伴う赤血球セグメンテーション

血液塗抹標本画像の自動赤血球（RBC）分類は、血液学者がRBCラボの結果を分析するのに役立ち、時間とコストを削減します。ただし、重複するセルは誤った予測結果を引き起こす可能性があるため、分類する前に複数の単一RBCに分離する必要があります。ディープラーニングで複数のクラスを分類するために、通常のサンプルは常に希少疾患のサンプルよりも高いため、医用画像では不均衡の問題が一般的です。この論文では、血液塗抹標本画像からRBCをセグメント化および分類するための新しい方法、特に細胞の重複とデータの不均衡の問題に取り組むための新しい方法を紹介します。重複する細胞分離に焦点を当てて、私たちのセグメンテーションプロセスは最初にRBCを表すために楕円を推定します。この方法では、凹点を検出してから、有向楕円フィッティングを使用して楕円を見つけます。 20枚の血液塗抹標本画像からの精度は0.889でした。分類には、バランスの取れたトレーニングデータセットが必要です。ただし、一部のRBCタイプはまれです。このデータセットの不均衡率は、20,875個の個々のRBCサンプルからの12個のRBCクラスで34.538でした。したがって、不均衡なデータセットを使用したRBC分類に機械学習を使用することは、他の多くのアプリケーションよりも困難です。この問題に対処するための手法を分析しました。 EfficientNet-B1を拡張して使用した場合、最高の精度とF1スコアはそれぞれ0.921と0.8679でした。実験結果は、データ拡張が全体的な分類パフォーマンスを大幅に改善する一方で、拡張を伴う重みバランス手法が少数派クラスのF1スコアを改善することによって不均衡の問題に対処する可能性があることを示しました。

Automated red blood cell (RBC) classification on blood smear images helps hematologists to analyze RBC lab results in a reduced time and cost. However, overlapping cells can cause incorrect predicted results, and so they have to be separated into multiple single RBCs before classifying. To classify multiple classes with deep learning, imbalance problems are common in medical imaging because normal samples are always higher than rare disease samples. This paper presents a new method to segment and classify RBCs from blood smear images, specifically to tackle cell overlapping and data imbalance problems. Focusing on overlapping cell separation, our segmentation process first estimates ellipses to represent RBCs. The method detects the concave points and then finds the ellipses using directed ellipse fitting. The accuracy from 20 blood smear images was 0.889. Classification requires balanced training datasets. However, some RBC types are rare. The imbalance ratio of this dataset was 34.538 for 12 RBC classes from 20,875 individual RBC samples. The use of machine learning for RBC classification with an imbalanced dataset is hence more challenging than many other applications. We analyzed techniques to deal with this problem. The best accuracy and F1-score were 0.921 and 0.8679, respectively, using EfficientNet-B1 with augmentation. Experimental results showed that the weight balancing technique with augmentation had the potential to deal with imbalance problems by improving the F1-score on minority classes, while data augmentation significantly improved the overall classification performance.

updated: Tue Aug 03 2021 06:55:46 GMT+0000 (UTC)

published: Wed Dec 02 2020 16:49:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト