Global Adaptive Filtering Layer for Computer Vision

Viktor Shipitsin; Iaroslav Bespalov; Dmitry V. Dylov

コンピュータビジョンのためのグローバル適応フィルタリング層

ユニバーサルアダプティブニューラルレイヤーを考案して、コンピュータービジョンタスクを実行するベースニューラルネットワークの重みとともに、各画像の最適な周波数フィルターを「学習」します。提案されたアプローチは、空間領域でソース画像を取得し、周波数領域から最適な周波数を自動的に選択し、逆変換画像をメインニューラルネットワークに送信します。驚くべきことに、このような単純なアドオンレイヤーは、設計に関係なく、メインネットワークのパフォーマンスを劇的に向上させます。ライトネットワークでは、パフォーマンスメトリックが著しく向上することがわかります。一方、重いもののトレーニングは、アダプティブレイヤーがメインアーキテクチャと一緒に「学習」できるようになると、より速く収束します。人気のある自然および医療データのベンチマークを考慮して、分類、セグメンテーション、ノイズ除去、および消去という4つの古典的なコンピュータービジョンタスクでアイデアを検証します。

We devise a universal adaptive neural layer to "learn" optimal frequency filter for each image together with the weights of the base neural network that performs some computer vision task. The proposed approach takes the source image in the spatial domain, automatically selects the best frequencies from the frequency domain, and transmits the inverse-transform image to the main neural network. Remarkably, such a simple add-on layer dramatically improves the performance of the main network regardless of its design. We observe that the light networks gain a noticeable boost in the performance metrics; whereas, the training of the heavy ones converges faster when our adaptive layer is allowed to "learn" alongside the main architecture. We validate the idea in four classical computer vision tasks: classification, segmentation, denoising, and erasing, considering popular natural and medical data benchmarks.

updated: Wed Aug 04 2021 15:52:46 GMT+0000 (UTC)

published: Fri Oct 02 2020 19:43:49 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト