NanoNet: Real-Time Polyp Segmentation in Video Capsule Endoscopy and Colonoscopy

Debesh Jha; Nikhil Kumar Tomar; Sharib Ali; Michael A. Riegler; Håvard D. Johansen; Dag Johansen; Thomas de Lange; Pål Halvorsen

NanoNet：ビデオカプセル内視鏡検査および結腸内視鏡検査におけるリアルタイムポリープセグメンテーション

胃腸内視鏡検査での深層学習は、臨床成績の改善に役立ち、病変をより正確に評価するのに役立ちます。この点で、関心領域の自動リアルタイム描写を実行できるセマンティックセグメンテーション方法、たとえば、癌または前癌病変の境界識別は、診断と介入の両方に役立つ可能性があります。ただし、内視鏡画像の正確でリアルタイムのセグメンテーションは、オペレーターへの依存度が高く、画質が高いため、非常に困難です。臨床現場で自動化された方法を利用するには、ローエンドの内視鏡ハードウェアデバイスと統合できるように、待ち時間の短い軽量モデルを設計することが重要です。この作業では、ビデオカプセル内視鏡と結腸内視鏡画像のセグメンテーションのための新しいアーキテクチャであるNanoNetを提案します。私たちが提案するアーキテクチャは、リアルタイムのパフォーマンスを可能にし、他のより複雑なアーキテクチャと比較して、より高いセグメンテーション精度を備えています。ビデオカプセル内視鏡検査とポリープを使用した標準的な結腸内視鏡検査データセット、および内視鏡生検と手術器具で構成されるデータセットを使用して、アプローチの有効性を評価します。私たちの実験は、モデルの複雑さ、速度、モデルパラメータ、およびメトリックパフォーマンスの間のトレードオフの観点から、アーキテクチャのパフォーマンスの向上を示しています。さらに、結果として得られるモデルサイズは比較的小さく、数百万のパラメーターを持つ従来の深層学習アプローチと比較して、パラメーターは36,000近くにすぎません。

Deep learning in gastrointestinal endoscopy can assist to improve clinical performance and be helpful to assess lesions more accurately. To this extent, semantic segmentation methods that can perform automated real-time delineation of a region-of-interest, e.g., boundary identification of cancer or precancerous lesions, can benefit both diagnosis and interventions. However, accurate and real-time segmentation of endoscopic images is extremely challenging due to its high operator dependence and high-definition image quality. To utilize automated methods in clinical settings, it is crucial to design lightweight models with low latency such that they can be integrated with low-end endoscope hardware devices. In this work, we propose NanoNet, a novel architecture for the segmentation of video capsule endoscopy and colonoscopy images. Our proposed architecture allows real-time performance and has higher segmentation accuracy compared to other more complex ones. We use video capsule endoscopy and standard colonoscopy datasets with polyps, and a dataset consisting of endoscopy biopsies and surgical instruments, to evaluate the effectiveness of our approach. Our experiments demonstrate the increased performance of our architecture in terms of a trade-off between model complexity, speed, model parameters, and metric performances. Moreover, the resulting model size is relatively tiny, with only nearly 36,000 parameters compared to traditional deep learning approaches having millions of parameters.

updated: Thu Apr 22 2021 15:40:28 GMT+0000 (UTC)

published: Thu Apr 22 2021 15:40:28 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト