Self-Supervised Learning from Unlabeled Fundus Photographs Improves Segmentation of the Retina

Jan Kukačka; Anja Zenz; Marcel Kollovieh; Dominik Jüstel; Vasilis Ntziachristos

ラベルのない眼底写真からの自己教師あり学習は、網膜のセグメンテーションを改善します

眼底写真は、網膜イメージングの主要な方法であり、糖尿病性網膜症の予防に不可欠です。眼底写真の自動セグメンテーションは、アイケアスクリーニングプログラムの品質、容量、および費用対効果を向上させます。ただし、現在のセグメンテーション方法は、実際の臨床アプリケーションに典型的なイメージング条件と病状の多様性に対して堅牢ではありません。これらの制限を克服するために、対照的な自己教師あり学習を利用して、公開されているEyePACSデータセット内のさまざまなラベルのない眼底画像を活用しました。 U-Netのエンコーダーを事前にトレーニングし、後でいくつかの網膜血管と病変のセグメンテーションデータセットで微調整しました。対照的な自己教師あり学習を使用することにより、事前にトレーニングされたネットワークが、ラベルを提供せずに血管、視神経乳頭、中心窩、およびさまざまな病変を認識できることを初めて示します。さらに、下流の血管セグメンテーションタスクで微調整すると、このような事前トレーニングされたネットワークは、さまざまなデータセットからの画像で最先端のパフォーマンスを実現します。さらに、事前トレーニングにより、トレーニング時間が短縮され、血管と病変のセグメンテーションタスクの両方で数ショットのパフォーマンスが向上します。全体として、私たちの結果は、わずかな注釈付きサンプルで新しいデバイスに適応できる堅牢なモデルを必要とする実際の臨床アプリケーションで重要な役割を果たすことができる、対照的な自己教師あり事前トレーニングの利点を示しています。

Fundus photography is the primary method for retinal imaging and essential for diabetic retinopathy prevention. Automated segmentation of fundus photographs would improve the quality, capacity, and cost-effectiveness of eye care screening programs. However, current segmentation methods are not robust towards the diversity in imaging conditions and pathologies typical for real-world clinical applications. To overcome these limitations, we utilized contrastive self-supervised learning to exploit the large variety of unlabeled fundus images in the publicly available EyePACS dataset. We pre-trained an encoder of a U-Net, which we later fine-tuned on several retinal vessel and lesion segmentation datasets. We demonstrate for the first time that by using contrastive self-supervised learning, the pre-trained network can recognize blood vessels, optic disc, fovea, and various lesions without being provided any labels. Furthermore, when fine-tuned on a downstream blood vessel segmentation task, such pre-trained networks achieve state-of-the-art performance on images from different datasets. Additionally, the pre-training also leads to shorter training times and an improved few-shot performance on both blood vessel and lesion segmentation tasks. Altogether, our results showcase the benefits of contrastive self-supervised pre-training which can play a crucial role in real-world clinical applications requiring robust models able to adapt to new devices with only a few annotated samples.

updated: Thu Aug 05 2021 18:02:56 GMT+0000 (UTC)

published: Thu Aug 05 2021 18:02:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト