PCNet: A Structure Similarity Enhancement Method for Multispectral and Multimodal Image Registration

Si-Yuan Cao; Beinan Yu; Lun Luo; Shu-Jie Chen; Chunguang Li; Hui-Liang Shen

PCNet: マルチスペクトルおよびマルチモーダル画像レジストレーションのための構造類似性強化法

マルチスペクトルおよびマルチモーダル画像は、マルチソース視覚情報融合の分野で重要な用途です。画像デバイスの変更や移動により、取得したマルチスペクトル画像とマルチモーダル画像は通常ずれているため、画像のレジストレーションが前提条件となります。一般的な画像のレジストレーションとは異なり、マルチスペクトルまたはマルチモーダル画像のレジストレーションは、強度と勾配の非線形変動による困難な問題です。この課題に対処するために、位相一致ネットワーク (PCNet) を提案して、マルチスペクトルまたはマルチモーダル画像の構造類似性を高めます。次に、ネットワークによって生成された類似性が強化された特徴マップを使用して、画像を整列させることができます。 PCNet は、よく知られている位相一致に着想を得て構築されています。このネットワークは、2 つの単純なトレーニング可能なレイヤーと一連の変更された学習可能なガボールカーネルに事前に位相一致を組み込みます。事前の知識のおかげで、トレーニングが完了すると、PCNet は、追加のチューニングなしで、フラッシュ/フラッシュなしおよび RGB/NIR 画像などのさまざまなマルチスペクトルおよびマルチモーダルデータに適用できます。また、事前分布はネットワークを軽量にします。 PCNet のトレーニング可能なパラメーターは、深層学習の登録方法である DHN の 2400 分の 1 ですが、その登録パフォーマンスは DHN を上回っています。実験結果は、PCNet が現在の最先端の従来のマルチモーダル登録アルゴリズムよりも優れていることを検証しています。さらに、PCNet は深層学習登録方法の補完的な部分として機能し、登録精度を大幅に向上させます。 UDHN の 1 ピクセル平均コーナーエラー (ACE) 未満の画像数の割合は、PCNet の処理後に 0.2% から 89.9% に上昇します。

Multispectral and multimodal images are of important usage in the field of multi-source visual information fusion. Due to the alternation or movement of image devices, the acquired multispectral and multimodal images are usually misaligned, and hence image registration is pre-requisite. Different from the registration of common images, the registration of multispectral or multimodal images is a challenging problem due to the nonlinear variation of intensity and gradient. To cope with this challenge, we propose the phase congruency network (PCNet) to enhance the structure similarity of multispectral or multimodal images. The images can then be aligned using the similarity-enhanced feature maps produced by the network. PCNet is constructed under the inspiration of the well-known phase congruency. The network embeds the phase congruency prior into two simple trainable layers and series of modified learnable Gabor kernels. Thanks to the prior knowledge, once trained, PCNet is applicable on a variety of multispectral and multimodal data such as flash/no-flash and RGB/NIR images without additional further tuning. The prior also makes the network lightweight. The trainable parameters of PCNet are 2400 times less than the deep-learning registration method DHN, while its registration performance surpasses DHN. Experimental results validate that PCNet outperforms current state-of-the-art conventional multimodal registration algorithms. Besides, PCNet can act as a complementary part of the deep-learning registration methods, which significantly boosts their registration accuracy. The percentage of the number of images under 1 pixel average corner error (ACE) of UDHN is raised from 0.2% to 89.9% after the processing of PCNet.

updated: Mon Oct 10 2022 03:15:09 GMT+0000 (UTC)

published: Wed Jun 09 2021 15:00:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト