Deep Learning-based Biological Anatomical Landmark Detection in Colonoscopy Videos

Kaiwei Che; Chengwei Ye; Yibing Yao; Nachuan Ma; Ruo Zhang; Jiankun Wang; Max Q. -H. Meng

大腸内視鏡検査ビデオにおける深層学習ベースの生物学的解剖学的ランドマーク検出

結腸内視鏡検査は、患者の胃腸（GI）管全体を視覚化して、病変領域をキャプチャするための標準的なイメージングツールです。ただし、大腸内視鏡検査のビデオから抽出された多数の画像を確認するには、臨床医が過度の時間を要します。したがって、結腸内の生物学的解剖学的ランドマークの自動検出が強く求められており、病変領域の位置に関するガイダンス情報を提供することにより、臨床医の負担を軽減するのに役立ちます。この記事では、結腸内視鏡検査ビデオで生物学的解剖学的ランドマークを検出するための新しい深層学習ベースのアプローチを提案します。まず、生の結腸内視鏡検査ビデオシーケンスが前処理されて干渉フレームが拒否されます。次に、ResNet-101ベースのネットワークを使用して、3つの生物学的解剖学的ランドマークを個別に検出し、中間の検出結果を取得します。第3に、ビデオ期間全体でランドマーク期間のより信頼性の高いローカリゼーションを実現するために、時間分布に基づいて誤って予測されたフレームを識別し、それらを正しいクラスに再割り当てすることにより、中間検出結果を後処理することを提案します。最後に、平均検出精度は99.75％に達します。一方、平均IoU 0.91は、予測されたランドマーク期間とグラウンドトゥルースの間に高度な類似性があることを示しています。実験結果は、提案されたモデルが結腸内視鏡検査ビデオから生物学的解剖学的ランドマークを正確に検出および位置特定できることを示しています。

Colonoscopy is a standard imaging tool for visualizing the entire gastrointestinal (GI) tract of patients to capture lesion areas. However, it takes the clinicians excessive time to review a large number of images extracted from colonoscopy videos. Thus, automatic detection of biological anatomical landmarks within the colon is highly demanded, which can help reduce the burden of clinicians by providing guidance information for the locations of lesion areas. In this article, we propose a novel deep learning-based approach to detect biological anatomical landmarks in colonoscopy videos. First, raw colonoscopy video sequences are pre-processed to reject interference frames. Second, a ResNet-101 based network is used to detect three biological anatomical landmarks separately to obtain the intermediate detection results. Third, to achieve more reliable localization of the landmark periods within the whole video period, we propose to post-process the intermediate detection results by identifying the incorrectly predicted frames based on their temporal distribution and reassigning them back to the correct class. Finally, the average detection accuracy reaches 99.75%. Meanwhile, the average IoU of 0.91 shows a high degree of similarity between our predicted landmark periods and ground truth. The experimental results demonstrate that our proposed model is capable of accurately detecting and localizing biological anatomical landmarks from colonoscopy videos.

updated: Fri Aug 06 2021 05:52:32 GMT+0000 (UTC)

published: Fri Aug 06 2021 05:52:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト