Comparative study of deep learning methods for the automatic segmentation of lung, lesion and lesion type in CT scans of COVID-19 patients

Sofie Tilborghs; Ine Dirks; Lucas Fidon; Siri Willems; Tom Eelbode; Jeroen Bertels; Bart Ilsen; Arne Brys; Adriana Dubbeldam; Nico Buls; Panagiotis Gonidakis; Sebastián Amador Sánchez; Annemiek Snoeckx; Paul M. Parizel; Johan de Mey; Dirk Vandermeulen; Tom Vercauteren; David Robben; Dirk Smeets; Frederik Maes; Jef Vandemeulebroucke; Paul Suetens

COVID-19患者のCTスキャンにおける肺、病変および病変タイプの自動セグメンテーションのための深層学習法の比較研究

COVID-19に関する最近の研究は、CTイメージングが疾患の理解を助けることに加えて、疾患の進行を評価し、診断を支援するための有用な情報を提供することを示唆しています。胸部CTスキャンを使用してCOVID-19を迅速かつ正確に定量化するためにディープラーニングを使用することを提案する研究が増えています。関心のある主なタスクは、COVID-19の確認済みまたは疑いのある患者の胸部CTスキャンにおける肺および肺病変の自動セグメンテーションです。この研究では、オープンソースアルゴリズムと自社開発アルゴリズムの両方を含む、マルチセンターデータセットを使用した12の深層学習アルゴリズムを比較します。結果は、さまざまな方法を組み合わせることで、肺セグメンテーション、バイナリ病変セグメンテーション、およびマルチクラス病変セグメンテーションのテストセット全体のパフォーマンスを向上させ、それぞれ平均ダイススコアが0.982、0.724、0.469になることを示しています。得られたバイナリ病変は、91.3 mlの平均絶対体積誤差でセグメント化されました。一般に、異なる病変タイプを区別する作業はより難しく、平均絶対容積の差は152 mlであり、統合およびすりガラスの不透明度の平均ダイススコアはそれぞれ0.369および0.523でした。すべての方法は、人間の評価者による視覚的評価よりも優れた平均ボリュームエラーでバイナリ病変セグメンテーションを実行します。これらの方法は、臨床実習で使用する大規模評価に十分成熟していることを示唆しています。

Recent research on COVID-19 suggests that CT imaging provides useful information to assess disease progression and assist diagnosis, in addition to help understanding the disease. There is an increasing number of studies that propose to use deep learning to provide fast and accurate quantification of COVID-19 using chest CT scans. The main tasks of interest are the automatic segmentation of lung and lung lesions in chest CT scans of confirmed or suspected COVID-19 patients. In this study, we compare twelve deep learning algorithms using a multi-center dataset, including both open-source and in-house developed algorithms. Results show that ensembling different methods can boost the overall test set performance for lung segmentation, binary lesion segmentation and multiclass lesion segmentation, resulting in mean Dice scores of 0.982, 0.724 and 0.469, respectively. The resulting binary lesions were segmented with a mean absolute volume error of 91.3 ml. In general, the task of distinguishing different lesion types was more difficult, with a mean absolute volume difference of 152 ml and mean Dice scores of 0.369 and 0.523 for consolidation and ground glass opacity, respectively. All methods perform binary lesion segmentation with an average volume error that is better than visual assessment by human raters, suggesting these methods are mature enough for a large-scale evaluation for use in clinical practice.

updated: Mon Jan 10 2022 08:26:14 GMT+0000 (UTC)

published: Wed Jul 29 2020 10:40:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト