Quantifying the Effect of Image Similarity on Diabetic Foot Ulcer Classification

Imran Chowdhury Dipto; Bill Cassidy; Connah Kendrick; Neil D. Reeves; Joseph M. Pappachan; Vishnu Chandrabalan; Moi Hoon Yap

糖尿病性足潰瘍の分類に対する画像の類似性の影響の定量化

この研究では、ディープラーニング分類ネットワークをトレーニングする際に、公開されている糖尿病性足潰瘍データセット内の視覚的に類似した画像の効果について調査を行います。ディープラーニングアルゴリズムのトレーニングに使用されるデータセット内にバイナリと同一の重複画像が存在することはよく知られた問題であり、ネットワークパフォーマンスを低下させる可能性のある不要なバイアスが発生する可能性があります。ただし、視覚的に類似した同一でない画像の影響は研究不足のトピックであり、これまで糖尿病性足潰瘍の研究では調査されていません。オープンソースのファジーアルゴリズムを使用して、Diabetic Foot Ulcers Challenge 2021 (DFUC2021) トレーニングデータセットでますます類似する画像のグループを識別します。各類似度しきい値に基づいて、さまざまなディープラーニングマルチクラス分類器のトレーニングに使用する新しいトレーニングセットを作成します。次に、DFUC2021 テストセットで最もパフォーマンスの高いモデルのパフォーマンスを評価します。私たちの調査結果は、80% の類似度しきい値の画像が削除されたトレーニングセットでトレーニングされたモデルが、InceptionResNetV2 ネットワークを使用して最高のパフォーマンスを達成したことを示しています。このモデルでは、F1 スコア、適合率、再現率がそれぞれ 0.023、0.029、0.013 向上しました。これらの結果は、非常に類似した画像が糖尿病性足潰瘍チャレンジ 2021 データセット内のパフォーマンス低下バイアスの存在に寄与する可能性があること、およびトレーニングセットから 80% 類似している画像を削除することが分類パフォーマンスの向上に役立つことを示しています。

This research conducts an investigation on the effect of visually similar images within a publicly available diabetic foot ulcer dataset when training deep learning classification networks. The presence of binary-identical duplicate images in datasets used to train deep learning algorithms is a well known issue that can introduce unwanted bias which can degrade network performance. However, the effect of visually similar non-identical images is an under-researched topic, and has so far not been investigated in any diabetic foot ulcer studies. We use an open-source fuzzy algorithm to identify groups of increasingly similar images in the Diabetic Foot Ulcers Challenge 2021 (DFUC2021) training dataset. Based on each similarity threshold, we create new training sets that we use to train a range of deep learning multi-class classifiers. We then evaluate the performance of the best performing model on the DFUC2021 test set. Our findings show that the model trained on the training set with the 80% similarity threshold images removed achieved the best performance using the InceptionResNetV2 network. This model showed improvements in F1-score, precision, and recall of 0.023, 0.029, and 0.013, respectively. These results indicate that highly similar images can contribute towards the presence of performance degrading bias within the Diabetic Foot Ulcers Challenge 2021 dataset, and that the removal of images that are 80% similar from the training set can help to boost classification performance.

updated: Tue Apr 25 2023 16:54:27 GMT+0000 (UTC)

published: Tue Apr 25 2023 16:54:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト