Learning Invariant Representation via Contrastive Feature Alignment for Clutter Robust SAR Target Recognition

Bowen Peng; Jianyue Xie; Bo Peng; Li Liu

クラッターロバスト SAR ターゲット認識のための対照的な機能アライメントによる不変表現の学習

ディープニューラルネットワーク (DNN) は、合成開口レーダー自動ターゲット認識 (SAR ATR) を専門知識に基づく機能設計から解放し、従来のソリューションに対する優位性を実証しました。強力なバックグラウンド相関の形での地上車両ベンチマークの独特の欠陥により、DNN がクラッターに過適合し、なじみのない環境に対してロバストでなくなることが示されています。ただし、固定バックグラウンドモデルトレーニングとさまざまなバックグラウンドアプリケーションとの間のギャップは未調査のままです。対照的な学習に着想を得たこのレターでは、堅牢な認識のために不変表現を学習することを目的とした Contrastive Feature Alignment (CFA) と呼ばれるソリューションを提案しています。提案された方法は、混合クラッターバリアント生成戦略と、不変表現学習のためのチャネル加重平均二乗誤差（CWMSE）損失を備えた新しい推論ブランチに貢献します。具体的には、生成戦略は、特徴空間でクラッターに敏感な逸脱をより適切に引き付けるように微妙に設計されています。 CWMSE 損失は、この偏差をより適切に対比し、元の画像と対応するクラッターバリアントによってアクティブ化された深い特徴を整列させるためにさらに考案されています。提案された CFA は、分類と CWMSE 損失の両方を組み合わせてモデルを共同でトレーニングします。これにより、不変のターゲット表現の漸進的な学習が可能になります。 MSTAR データセットと 6 つの DNN モデルに関する広範な評価により、提案の有効性が証明されています。その結果、CFA でトレーニングされたモデルは、データセットに含まれていないなじみのない環境の中でターゲットを認識でき、さまざまな信号対クラッター比に対して堅牢であることが実証されました。

The deep neural networks (DNNs) have freed the synthetic aperture radar automatic target recognition (SAR ATR) from expertise-based feature designing and demonstrated superiority over conventional solutions. There has been shown the unique deficiency of ground vehicle benchmarks in shapes of strong background correlation results in DNNs overfitting the clutter and being non-robust to unfamiliar surroundings. However, the gap between fixed background model training and varying background application remains underexplored. Inspired by contrastive learning, this letter proposes a solution called Contrastive Feature Alignment (CFA) aiming to learn invariant representation for robust recognition. The proposed method contributes a mixed clutter variants generation strategy and a new inference branch equipped with channel-weighted mean square error (CWMSE) loss for invariant representation learning. In specific, the generation strategy is delicately designed to better attract clutter-sensitive deviation in feature space. The CWMSE loss is further devised to better contrast this deviation and align the deep features activated by the original images and corresponding clutter variants. The proposed CFA combines both classification and CWMSE losses to train the model jointly, which allows for the progressive learning of invariant target representation. Extensive evaluations on the MSTAR dataset and six DNN models prove the effectiveness of our proposal. The results demonstrated that the CFA-trained models are capable of recognizing targets among unfamiliar surroundings that are not included in the dataset, and are robust to varying signal-to-clutter ratios.

updated: Tue Apr 04 2023 12:35:33 GMT+0000 (UTC)

published: Tue Apr 04 2023 12:35:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト