EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification

Xiang Yu; Zhe Geng; Xiaohua Huang; Qinglu Wang; Daiyin Zhu

EMC2A-Net: SAR ターゲット分類のための効率的なマルチブランチクロスチャネルアテンションネットワーク

近年、畳み込みニューラルネットワーク (CNN) は、合成開口レーダー (SAR) ターゲット認識において大きな可能性を示しています。 SAR 画像には強い粒度感があり、従来の CNN モデルではめったに考慮されない、スペックルノイズ、ターゲットの主な散乱体、ターゲットの輪郭など、さまざまなスケールのテクスチャ機能があります。この論文では、マルチブランチ構造に基づく 2 つの残差ブロック、つまりマルチスケール受容野 (RF) を備えた EMC2A ブロックを提案し、効率的な同位体アーキテクチャの深層 CNN (DCNN)、EMC2A-Net を設計しました。 EMC2A ブロックは、異なる拡張率で並列拡張畳み込みを利用します。これにより、計算負荷を大幅に増加させることなく、マルチスケールコンテキストの特徴を効果的にキャプチャできます。マルチスケール機能融合の効率をさらに改善するために、このペーパーでは、次元削減なしでローカルマルチスケール機能相互作用戦略を採用するマルチスケール機能クロスチャネルアテンションモジュール、つまり EMC2A モジュールを提案しました。この戦略は、効率的な 1 次元 (1D) 循環畳み込みとシグモイド関数を介して各チャネルの重みを適応的に調整し、グローバルチャネルごとのレベルで注意を導きます。 MSTAR データセットの比較結果は、EMC2A-Net が同じタイプの既存の利用可能なモデルよりも優れており、比較的軽量なネットワーク構造を持っていることを示しています。アブレーション実験の結果は、EMC2A モジュールが、いくつかのパラメーターと適切なクロスチャネル相互作用のみを使用することで、モデルのパフォーマンスを大幅に改善することを示しています。

In recent years, convolutional neural networks (CNNs) have shown great potential in synthetic aperture radar (SAR) target recognition. SAR images have a strong sense of granularity and have different scales of texture features, such as speckle noise, target dominant scatterers and target contours, which are rarely considered in the traditional CNN model. This paper proposed two residual blocks, namely EMC2A blocks with multiscale receptive fields(RFs), based on a multibranch structure and then designed an efficient isotopic architecture deep CNN (DCNN), EMC2A-Net. EMC2A blocks utilize parallel dilated convolution with different dilation rates, which can effectively capture multiscale context features without significantly increasing the computational burden. To further improve the efficiency of multiscale feature fusion, this paper proposed a multiscale feature cross-channel attention module, namely the EMC2A module, adopting a local multiscale feature interaction strategy without dimensionality reduction. This strategy adaptively adjusts the weights of each channel through efficient one-dimensional (1D)-circular convolution and sigmoid function to guide attention at the global channel wise level. The comparative results on the MSTAR dataset show that EMC2A-Net outperforms the existing available models of the same type and has relatively lightweight network structure. The ablation experiment results show that the EMC2A module significantly improves the performance of the model by using only a few parameters and appropriate cross-channel interactions.

updated: Wed Aug 03 2022 04:31:52 GMT+0000 (UTC)

published: Wed Aug 03 2022 04:31:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト