SwinCheX: Multi-label classification on chest X-ray images with transformers

Sina Taslimi; Soroush Taslimi; Nima Fathi; Mohammadreza Salehi; Mohammad Hossein Rohban

SwinCheX：トランスフォーマーを使用した胸部X線画像のマルチラベル分類

さまざまな病気の診断や広範なデータセットの収集における胸部X線画像の利用の大幅な増加によると、ディープニューラルネットワークを使用した自動診断手順を持つことは専門家の心を占領しました。コンピュータービジョンで利用可能な方法のほとんどは、CNNバックボーンを使用して、分類問題の高精度を取得します。それにもかかわらず、最近の研究では、NLPで事実上の方法として確立された変圧器も、ビジョンにおいて多くのCNNベースのモデルよりも優れている可能性があることが示されています。本論文は、最先端の診断分類を達成するためのバックボーンとしてSwinTransformerに基づくマルチラベル分類ディープモデルを提案した。ヘッドアーキテクチャには、MLPとも呼ばれる多層パーセプトロンを利用します。モデルは、「胸部X線14」と呼ばれる最も広く使用されている最大のX線データセットの1つで評価されます。これは、14の有名な胸部疾患を持つ30,000人以上の患者からの100,000以上の正面/背面画像で構成されています。私たちのモデルは、ヘッド設定用にいくつかのMLPレイヤーでテストされており、それぞれがすべてのクラスで競争力のあるAUCスコアを達成しています。胸部X線14の包括的な実験では、3層ヘッドが以前のSOTA平均AUC 0.799と比較して、平均AUCスコア0.810で最先端のパフォーマンスを達成することが示されています。将来の研究の基礎として使用できる既存の方法の公正なベンチマークのための実験的なセットアップを提案します。最後に、提案された方法が胸部の病理学的に関連する領域に対応していることを確認することにより、結果をフォローアップしました。

According to the considerable growth in the avail of chest X-ray images in diagnosing various diseases, as well as gathering extensive datasets, having an automated diagnosis procedure using deep neural networks has occupied the minds of experts. Most of the available methods in computer vision use a CNN backbone to acquire high accuracy on the classification problems. Nevertheless, recent researches show that transformers, established as the de facto method in NLP, can also outperform many CNN-based models in vision. This paper proposes a multi-label classification deep model based on the Swin Transformer as the backbone to achieve state-of-the-art diagnosis classification. It leverages Multi-Layer Perceptron, also known as MLP, for the head architecture. We evaluate our model on one of the most widely-used and largest x-ray datasets called "Chest X-ray14," which comprises more than 100,000 frontal/back-view images from over 30,000 patients with 14 famous chest diseases. Our model has been tested with several number of MLP layers for the head setting, each achieves a competitive AUC score on all classes. Comprehensive experiments on Chest X-ray14 have shown that a 3-layer head attains state-of-the-art performance with an average AUC score of 0.810, compared to the former SOTA average AUC of 0.799. We propose an experimental setup for the fair benchmarking of existing methods, which could be used as a basis for the future studies. Finally, we followed up our results by confirming that the proposed method attends to the pathologically relevant areas of the chest.

updated: Thu Jun 09 2022 03:17:57 GMT+0000 (UTC)

published: Thu Jun 09 2022 03:17:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト