Towards an Effective and Efficient Transformer for Rain-by-snow Weather Removal

Tao Gao; Yuanbo Wen; Kaihao Zhang; Peng Cheng; Ting Chen

雨雪除雪用の効果的かつ効率的な変圧器に向けて

雨による雪による天候の除去は、共存する雨の筋と雪の粒子を除去することを目的とした、天候によって劣化した画像の復元に特化したタスクです。このホワイトペーパーでは、この課題に対処する効率的で効果的な Transformer である RSFormer を提案します。最初に、階層アーキテクチャにおける畳み込みネットワーク (ConvNets) とビジョントランスフォーマー (ViTs) の近接性を調査し、実験的にそれらがステージ内特徴学習でほぼ実行されることを発見しました。これに基づいて、入力コンテンツに適応するための注意特性を維持しながら、計算コストの高い自己注意を置き換える Transformer のような畳み込みブロック (TCB) を利用します。また、クロスステージの進行がパフォーマンスの向上に重要であることを示し、グローバルとローカルの両方の依存関係をキャプチャしながら、機能をダウン/アップサンプリングするグローバルローカルセルフアテンションサンプリングメカニズム (GLASM) を提案します。最後に、RSCityScape と RS100K という 2 つの新しい降雪データセットを合成して、提案された RSFormer を評価します。広範な実験により、RSFormer が他の復元方法と比較して、パフォーマンスと時間消費の最適なトレードオフを実現することが検証されています。たとえば、パラメーターの数が 1.53% 減少し、推論時間が 15.6% 減少し、Restormer よりも優れています。データセット、ソースコード、事前トレーニング済みモデルは、https://github.com/chdwyb/RSFormer で入手できます。

Rain-by-snow weather removal is a specialized task in weather-degraded image restoration aiming to eliminate coexisting rain streaks and snow particles. In this paper, we propose RSFormer, an efficient and effective Transformer that addresses this challenge. Initially, we explore the proximity of convolution networks (ConvNets) and vision Transformers (ViTs) in hierarchical architectures and experimentally find they perform approximately at intra-stage feature learning. On this basis, we utilize a Transformer-like convolution block (TCB) that replaces the computationally expensive self-attention while preserving attention characteristics for adapting to input content. We also demonstrate that cross-stage progression is critical for performance improvement, and propose a global-local self-attention sampling mechanism (GLASM) that down-/up-samples features while capturing both global and local dependencies. Finally, we synthesize two novel rain-by-snow datasets, RSCityScape and RS100K, to evaluate our proposed RSFormer. Extensive experiments verify that RSFormer achieves the best trade-off between performance and time-consumption compared to other restoration methods. For instance, it outperforms Restormer with a 1.53% reduction in the number of parameters and a 15.6% reduction in inference time. Datasets, source code and pre-trained models are available at https://github.com/chdwyb/RSFormer.

updated: Fri Oct 27 2023 09:45:18 GMT+0000 (UTC)

published: Thu Apr 06 2023 04:39:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト