SIR: Self-supervised Image Rectification via Seeing the Same Scene from Multiple Different Lenses

Jinlong Fan; Jing Zhang; Dacheng Tao

SIR：複数の異なるレンズから同じシーンを見ることによる自己監視画像の平行化

ディープラーニングは、大規模な合成データセットに基づく教師ありトレーニングを介してディープニューラルネットワークの表現能力を活用することにより、画像修正におけるその力を実証しています。ただし、特定の歪みモデルの普遍性が限られており、歪みと補正のプロセスを明示的にモデル化していないため、モデルは合成画像に適合しすぎて、実際の魚眼画像では一般化されない場合があります。本論文では、異なるレンズからの同じシーンの歪んだ画像の補正結果は同じでなければならないという重要な洞察に基づいて、新しい自己監視画像補正（SIR）法を提案します。具体的には、共有エンコーダーといくつかの予測ヘッドを備えた新しいネットワークアーキテクチャを考案します。各予測ヘッドは、特定の歪みモデルの歪みパラメーターを予測します。さらに、微分可能なワーピングモジュールを活用して、歪みパラメータから修正された画像と再歪みされた画像を生成し、トレーニング中にそれらの間のモデル内およびモデル間の一貫性を活用します。これにより、地面を必要とせずに自己教師あり学習スキームが実現します。 -真の歪みパラメータまたは通常の画像。合成データセットと実際の魚眼画像での実験は、私たちの方法が、監視されたベースライン方法と代表的な最先端の方法と同等またはそれ以上のパフォーマンスを達成することを示しています。自己教師あり学習は、自己一貫性を維持しながら、歪みモデルの普遍性も向上させます。

Deep learning has demonstrated its power in image rectification by leveraging the representation capacity of deep neural networks via supervised training based on a large-scale synthetic dataset. However, the model may overfit the synthetic images and generalize not well on real-world fisheye images due to the limited universality of a specific distortion model and the lack of explicitly modeling the distortion and rectification process. In this paper, we propose a novel self-supervised image rectification (SIR) method based on an important insight that the rectified results of distorted images of a same scene from different lens should be the same. Specifically, we devise a new network architecture with a shared encoder and several prediction heads, each of which predicts the distortion parameter of a specific distortion model. We further leverage a differentiable warping module to generate the rectified images and re-distorted images from the distortion parameters and exploit the intra- and inter-model consistency between them during training, thereby leading to a self-supervised learning scheme without the need for ground-truth distortion parameters or normal images. Experiments on synthetic dataset and real-world fisheye images demonstrate that our method achieves comparable or even better performance than the supervised baseline method and representative state-of-the-art methods. Self-supervised learning also improves the universality of distortion models while keeping their self-consistency.

updated: Fri Jun 18 2021 07:26:29 GMT+0000 (UTC)

published: Mon Nov 30 2020 08:23:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト