Red Blood Cell Segmentation with Overlapping Cell Separation and Classification on Imbalanced Dataset

Korranat Naruenatthanaset; Thanarat H. Chalidabhongse; Duangdao Palasuwan; Nantheera Anantrasirichai; Attakorn Palasuwan

不均衡なデータセットでの重複する細胞分離と分類を伴う赤血球セグメンテーション

血液塗抹標本画像での自動赤血球分類は、血液専門医がRBCラボの結果をより少ない時間とコストで分析するのに役立ちます。重複するセルは、分類する前に複数の単一RBCに分離する必要がある誤った予測結果を引き起こす可能性があります。ディープラーニングで複数のクラスを分類するために、通常のサンプルは常に希少疾患のサンプルよりも高いため、医用画像では不均衡の問題が一般的です。この論文では、血液塗抹標本画像から赤血球をセグメント化および分類するための新しい方法、特に細胞の重複とデータの不均衡の問題に取り組むための新しい方法を紹介します。重複する細胞分離に焦点を当てて、私たちのセグメンテーションプロセスは最初に赤血球を表すために楕円を推定します。この方法では、凹点を検出してから、有向楕円フィッティングを使用して楕円を見つけます。精度は、20枚の血液塗抹標本画像で0.889です。分類には、バランスの取れたトレーニングデータセットが必要です。ただし、一部のRBCタイプはまれです。不均衡率は、20,875個の赤血球サンプルを含む12クラスで34.538です。したがって、不均衡なデータセットを使用したRBC分類に機械学習を使用することは、他の多くのアプリケーションよりも困難です。この問題に対処するための手法を分析します。最高の精度とf1スコアは、拡張機能を備えたEfficientNet-b1で0.921と0.8679です。実験結果は、データ拡張が全体的な分類パフォーマンスを大幅に改善する一方で、拡張を伴う重みバランス手法がマイノリティクラスのf1スコアを改善することによって不均衡の問題に対処する可能性があることを示しています。

Automated red blood cell classification on blood smear images helps hematologist to analyze RBC lab results in less time and cost. Overlapping cells can cause incorrect predicted results that have to separate into multiple single RBCs before classifying. To classify multiple classes with deep learning, imbalance problems are common in medical imaging because normal samples are always higher than rare disease samples. This paper presents a new method to segment and classify red blood cells from blood smear images, specifically to tackle cell overlapping and data imbalance problems. Focusing on overlapping cell separation, our segmentation process first estimates ellipses to represent red blood cells. The method detects the concave points and then finds the ellipses using directed ellipse fitting. The accuracy is 0.889 on 20 blood smear images. Classification requires balanced training datasets. However, some RBC types are rare. The imbalance ratio is 34.538 on 12 classes with 20,875 individual red blood cell samples. The use of machine learning for RBC classification with an imbalance dataset is hence more challenging than many other applications. We analyze techniques to deal with this problem. The best accuracy and f1 score are 0.921 and 0.8679 on EfficientNet-b1 with augmentation. Experimental results show that the weight balancing technique with augmentation has the potential to deal with imbalance problems by improving the f1 score on minority classes while data augmentation significantly improves the overall classification performance.

updated: Wed Dec 09 2020 06:29:54 GMT+0000 (UTC)

published: Wed Dec 02 2020 16:49:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト