Single-stream CNN with Learnable Architecture for Multi-source Remote Sensing Data

Yi Yang; Daoye Zhu; Tengteng Qu; Qiangyu Wang; Fuhu Ren; Chengqi Cheng

マルチソースリモートセンシングデータ用の学習可能なアーキテクチャを備えたシングルストリームCNN

この論文では、マルチソースリモートセンシングデータの共同分類のためのディープ畳み込みニューラルネットワーク（CNN）に基づく効率的で一般化可能なフレームワークを提案します。最近の方法は主にマルチストリームアーキテクチャに基づいていますが、グループコンボリューションを使用して、シングルストリームネットワーク内で同等のネットワークアーキテクチャを効率的に構築します。さらに、動的グループ畳み込み（DGConv）を採用および改善して、グループ畳み込みのハイパーパラメーターを作成し、ネットワークアーキテクチャ全体をネットワークトレーニング中に学習できるようにします。したがって、提案された方法は、理論的には、最新のCNNモデルをマルチソースリモートセンシングデータセットに調整でき、手動で決定されたアーキテクチャハイパーパラメータによって引き起こされる次善のソリューションを回避できる可能性があります。実験では、提案された方法がResNetとUNetに適用され、調整されたネットワークが3つの非常に多様なベンチマークデータセット（つまり、Houston2018データ、Berlinデータ、およびMUUFLデータ）で検証されます。実験結果は、提案されたシングルストリームCNNの有効性を示しており、特にResNet18-DGConvは、HS-SARベルリンデータセットの最先端の分類全体の精度（OA）を62.23％から68.21％に向上させます。実験では、2つの興味深い発見があります。まず、DGConvを使用すると、通常、テストOAの変動が減少します。第2に、マルチストリームは、最初の数層に適用するとモデルのパフォーマンスに悪影響を及ぼしますが、より深い層に適用すると有益になります。全体として、調査結果は、マルチストリームアーキテクチャが、マルチソースリモートセンシングデータの深層学習モデルで厳密に必要なコンポーネントではなく、本質的にモデル正則化の役割を果たしていることを意味します。私たちのコードはhttps://github.com/yyyyangyi/Multi-source-RS-DGConvで公開されています。私たちの仕事が将来の新しい研究に刺激を与えることを願っています。

In this paper, we propose an efficient and generalizable framework based on deep convolutional neural network (CNN) for multi-source remote sensing data joint classification. While recent methods are mostly based on multi-stream architectures, we use group convolution to construct equivalent network architectures efficiently within a single-stream network. We further adopt and improve dynamic grouping convolution (DGConv) to make group convolution hyperparameters, and thus the overall network architecture, learnable during network training. The proposed method therefore can theoretically adjust any modern CNN models to any multi-source remote sensing data set, and can potentially avoid sub-optimal solutions caused by manually decided architecture hyperparameters. In the experiments, the proposed method is applied to ResNet and UNet, and the adjusted networks are verified on three very diverse benchmark data sets (i.e., Houston2018 data, Berlin data, and MUUFL data). Experimental results demonstrate the effectiveness of the proposed single-stream CNNs, and in particular ResNet18-DGConv improves the state-of-the-art classification overall accuracy (OA) on HS-SAR Berlin data set from 62.23% to 68.21%. In the experiments we have two interesting findings. First, using DGConv generally reduces test OA variance. Second, multi-stream is harmful to model performance if imposed to the first few layers, but becomes beneficial if applied to deeper layers. Altogether, the findings imply that multi-stream architecture, instead of being a strictly necessary component in deep learning models for multi-source remote sensing data, essentially plays the role of model regularizer. Our code is publicly available at https://github.com/yyyyangyi/Multi-source-RS-DGConv. We hope our work can inspire novel research in the future.

updated: Mon Sep 13 2021 16:10:41 GMT+0000 (UTC)

published: Mon Sep 13 2021 16:10:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト