Detection of masses and architectural distortions in digital breast tomosynthesis: a publicly available dataset of 5,060 patients and a deep learning model

Mateusz Buda; Ashirbani Saha; Ruth Walsh; Sujata Ghate; Nianyi Li; Albert Święcicki; Joseph Y. Lo; Maciej A. Mazurowski

デジタル乳房トモシンセシスにおける腫瘤と構造的歪みの検出：5,060人の患者の公開されているデータセットと深層学習モデル

乳がん検診は最も一般的な放射線検査の1つであり、毎年3,900万件を超える検査が実施されています。乳がんのスクリーニングは、人工知能の最も研究されている医用画像アプリケーションの1つですが、十分に注釈が付けられた大規模な公開データセットがないため、アルゴリズムの開発と評価が妨げられています。これは、比較的新しい乳がんスクリーニングモダリティであるデジタル乳房トモシンセシス（DBT）にとって特に問題です。私たちは、デジタル乳房トモシンセシス画像の大規模なデータセットをキュレートして公開しました。これには、5,060人の患者からの5,610件の研究に属する22,032個の再構築されたDBTボリュームが含まれています。これには4つのグループが含まれていました：（1）5,129の正常な研究、（2）追加の画像診断が必要であるが生検が行われなかった280の研究、（3）112の良性生検研究、および（4）89の癌研究。私たちのデータセットには、2人の経験豊富な放射線科医によって注釈が付けられた質量と建築の歪みが含まれていました。さらに、単相深層学習検出モデルを開発し、データセットを使用してテストし、将来の研究のベースラインとして使用しました。私たちのモデルは、乳房ごとに2つの偽陽性で65％の感度に達しました。私たちの大規模で多様で高度にキュレーションされたデータセットは、トレーニング用のデータとモデル検証用の一般的なケースのセットを提供することで、乳がんスクリーニング用のAIアルゴリズムの開発と評価を容易にします。私たちの研究で開発されたモデルのパフォーマンスは、タスクが依然として挑戦的であり、将来のモデル開発のベースラインとして役立つことを示しています。

Breast cancer screening is one of the most common radiological tasks with over 39 million exams performed each year. While breast cancer screening has been one of the most studied medical imaging applications of artificial intelligence, the development and evaluation of the algorithms are hindered due to the lack of well-annotated large-scale publicly available datasets. This is particularly an issue for digital breast tomosynthesis (DBT) which is a relatively new breast cancer screening modality. We have curated and made publicly available a large-scale dataset of digital breast tomosynthesis images. It contains 22,032 reconstructed DBT volumes belonging to 5,610 studies from 5,060 patients. This included four groups: (1) 5,129 normal studies, (2) 280 studies where additional imaging was needed but no biopsy was performed, (3) 112 benign biopsied studies, and (4) 89 studies with cancer. Our dataset included masses and architectural distortions which were annotated by two experienced radiologists. Additionally, we developed a single-phase deep learning detection model and tested it using our dataset to serve as a baseline for future research. Our model reached a sensitivity of 65% at 2 false positives per breast. Our large, diverse, and highly-curated dataset will facilitate development and evaluation of AI algorithms for breast cancer screening through providing data for training as well as common set of cases for model validation. The performance of the model developed in our study shows that the task remains challenging and will serve as a baseline for future model development.

updated: Sun Nov 20 2022 17:34:21 GMT+0000 (UTC)

published: Fri Nov 13 2020 18:33:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト