A Deep Perceptual Measure for Lens and Camera Calibration

Yannick Hold-Geoffroy; Dominique Piché-Meunier; Kalyan Sunkavalli; Jean-Charles Bazin; François Rameau; Jean-François Lalonde

レンズとカメラのキャリブレーションのための深い知覚測定

画像の編集と合成は、デジタルアートから AR や VR 体験に至るまで、エンターテインメントのいたるところで行われるようになりました。美しい合成画像を作成するには、カメラを幾何学的にキャリブレーションする必要がありますが、これは面倒な作業であり、物理的なキャリブレーションターゲットが必要です。従来の複数画像のキャリブレーションプロセスの代わりに、深層畳み込みニューラルネットワークを使用して、ピッチ、ロール、視野、レンズの歪みなどのカメラのキャリブレーションパラメーターを単一の画像から直接推論することを提案します。大規模なパノラマデータセットから自動的に生成されたサンプルを使用してこのネットワークをトレーニングし、標準の `2 誤差という点で優れた精度を実現します。ただし、このような標準誤差メトリクスを最小限に抑えることは、多くのアプリケーションにとって最適ではない可能性があると私たちは主張します。この研究では、幾何学的なカメラのキャリブレーションにおける不正確さに対する人間の感度を調査します。この目的を達成するために、私たちは大規模な人間の知覚研究を実施し、参加者に正確かつ偏ったカメラキャリブレーションパラメータで合成された 3D オブジェクトのリアリズムを判断してもらいます。この研究に基づいて、カメラキャリブレーションの新しい知覚尺度を開発し、当社のディープキャリブレーションネットワークが、標準指標とこの新しい知覚尺度の両方において、以前の単一画像ベースのキャリブレーション方法よりも優れていることを実証します。最後に、仮想オブジェクトの挿入、画像の取得、合成などのいくつかのアプリケーションに対するキャリブレーションネットワークの使用法を示します。私たちのアプローチのデモは https://lvsn.github.io/deepcalib でご覧いただけます。

Image editing and compositing have become ubiquitous in entertainment, from digital art to AR and VR experiences. To produce beautiful composites, the camera needs to be geometrically calibrated, which can be tedious and requires a physical calibration target. In place of the traditional multi-image calibration process, we propose to infer the camera calibration parameters such as pitch, roll, field of view, and lens distortion directly from a single image using a deep convolutional neural network. We train this network using automatically generated samples from a large-scale panorama dataset, yielding competitive accuracy in terms of standard `2 error. However, we argue that minimizing such standard error metrics might not be optimal for many applications. In this work, we investigate human sensitivity to inaccuracies in geometric camera calibration. To this end, we conduct a large-scale human perception study where we ask participants to judge the realism of 3D objects composited with correct and biased camera calibration parameters. Based on this study, we develop a new perceptual measure for camera calibration and demonstrate that our deep calibration network outperforms previous single-image based calibration methods both on standard metrics as well as on this novel perceptual measure. Finally, we demonstrate the use of our calibration network for several applications, including virtual object insertion, image retrieval, and compositing. A demonstration of our approach is available at https://lvsn.github.io/deepcalib .

updated: Wed Jul 26 2023 22:04:46 GMT+0000 (UTC)

published: Thu Aug 25 2022 18:40:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト