Pulmonary Disease Classification Using Globally Correlated Maximum Likelihood: an Auxiliary Attention mechanism for Convolutional Neural Networks

Edward Verenich; Tobias Martin; Alvaro Velasquez; Nazar Khan; Faraz Hussain

グローバルに相関する最尤法を使用した肺疾患の分類：畳み込みニューラルネットワークの補助的注意メカニズム

畳み込みニューラルネットワーク（CNN）は、胸部X線写真の肺の異常を分類および検出するために現在広く使用されています。 CNNの2つの補完的な一般化プロパティ、並進不変性と同変は、画像内の空間的な位置に関係なく、肺疾患に関連する明らかな異常を検出するのに特に役立ちます。ただし、これらのプロパティには、ローカル領域で検出された異常の正確な空間情報とグローバルな相対位置の損失も伴います。このような異常のグローバルな相対位置は、COVID-19やウイルス性肺炎などの同様の状態を区別するのに役立つ可能性があります。このような場合、グローバルな注意メカニズムが必要になります。これは、CNNが、並進不変性と同変によって提供される一般化を目的とした従来のアーキテクチャではサポートされていません。 Vision Transformersは、グローバルな注意メカニズムを提供しますが、並進不変性と同変性がないため、CNNの一般化に一致するために大幅に多くのトレーニングデータサンプルが必要になります。 CNNの誘導バイアスを維持しながら、空間情報の損失と機能間のグローバルな関係に対処するために、顕著な機能間のグローバルな相関関係を抽出するために、既存のCNNアーキテクチャの補助的な注意メカニズムとして機能する新しい手法を紹介します。

Convolutional neural networks (CNN) are now being widely used for classifying and detecting pulmonary abnormalities in chest radiographs. Two complementary generalization properties of CNNs, translation invariance and equivariance, are particularly useful in detecting manifested abnormalities associated with pulmonary disease, regardless of their spatial locations within the image. However, these properties also come with the loss of exact spatial information and global relative positions of abnormalities detected in local regions. Global relative positions of such abnormalities may help distinguish similar conditions, such as COVID-19 and viral pneumonia. In such instances, a global attention mechanism is needed, which CNNs do not support in their traditional architectures that aim for generalization afforded by translation invariance and equivariance. Vision Transformers provide a global attention mechanism, but lack translation invariance and equivariance, requiring significantly more training data samples to match generalization of CNNs. To address the loss of spatial information and global relations between features, while preserving the inductive biases of CNNs, we present a novel technique that serves as an auxiliary attention mechanism to existing CNN architectures, in order to extract global correlations between salient features.

updated: Wed Sep 01 2021 19:03:46 GMT+0000 (UTC)

published: Wed Sep 01 2021 19:03:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト