A Deep Perceptual Measure for Lens and Camera Calibration

Yannick Hold-Geoffroy; Dominique Piché-Meunier; Kalyan Sunkavalli; Jean-Charles Bazin; François Rameau; Jean-François Lalonde

レンズとカメラのキャリブレーションのための深い知覚測定

画像の編集と合成は、デジタルアートから AR や VR 体験に至るまで、エンターテインメントのあらゆる分野で行われています。美しい合成画像を生成するには、カメラを幾何学的にキャリブレーションする必要があります。これは面倒な作業であり、物理的なキャリブレーションターゲットが必要です。従来のマルチイメージキャリブレーションプロセスの代わりに、深い畳み込みニューラルネットワークを使用して、ピッチ、ロール、視野、レンズ歪みなどのカメラキャリブレーションパラメーターを単一の画像から直接推測することを提案します。大規模なパノラマデータセットから自動的に生成されたサンプルを使用してこのネットワークをトレーニングし、標準 l2 エラーに関して競争力のある精度を実現します。ただし、このような標準エラーメトリックを最小化することは、多くのアプリケーションにとって最適ではない可能性があると主張します。この作業では、幾何学的なカメラのキャリブレーションの不正確さに対する人間の感度を調査します。この目的のために、大規模な人間の知覚研究を実施し、参加者に、正しいカメラキャリブレーションパラメーターとバイアスされたカメラキャリブレーションパラメーターを組み合わせた 3D オブジェクトのリアリズムを判断するよう依頼します。この研究に基づいて、カメラキャリブレーションの新しい知覚尺度を開発し、標準メトリックとこの新しい知覚尺度の両方で、ディープキャリブレーションネットワークが以前の単一画像ベースのキャリブレーション方法よりも優れていることを示します。最後に、仮想オブジェクトの挿入、画像の検索、合成など、いくつかのアプリケーションでキャリブレーションネットワークを使用する方法を示します。私たちのアプローチのデモンストレーションは、https://lvsn.github.io/deepcalib で入手できます。

Image editing and compositing have become ubiquitous in entertainment, from digital art to AR and VR experiences. To produce beautiful composites, the camera needs to be geometrically calibrated, which can be tedious and requires a physical calibration target. In place of the traditional multi-images calibration process, we propose to infer the camera calibration parameters such as pitch, roll, field of view, and lens distortion directly from a single image using a deep convolutional neural network. We train this network using automatically generated samples from a large-scale panorama dataset, yielding competitive accuracy in terms of standard l2 error. However, we argue that minimizing such standard error metrics might not be optimal for many applications. In this work, we investigate human sensitivity to inaccuracies in geometric camera calibration. To this end, we conduct a large-scale human perception study where we ask participants to judge the realism of 3D objects composited with correct and biased camera calibration parameters. Based on this study, we develop a new perceptual measure for camera calibration and demonstrate that our deep calibration network outperforms previous single-image based calibration methods both on standard metrics as well as on this novel perceptual measure. Finally, we demonstrate the use of our calibration network for several applications, including virtual object insertion, image retrieval, and compositing. A demonstration of our approach is available at https://lvsn.github.io/deepcalib .

updated: Thu Aug 25 2022 18:40:45 GMT+0000 (UTC)

published: Thu Aug 25 2022 18:40:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト