To The Point: Correspondence-driven monocular 3D category reconstruction

Filippos Kokkinos; Iasonas Kokkinos

要点：通信駆動の単眼3Dカテゴリの再構築

弱い監視から学習した2Dから3Dへの対応を使用して、単一の画像から3Dオブジェクトを再構築する方法であるTo The Point（TTP）を紹介します。最初に 3D テンプレートの頂点に対応する 2D 位置を回帰し、3D 形状投影によって 2D 位置を最適に説明する剛体カメラ変換と非剛体テンプレート変形を共同で推定することにより、2D 画像から 3D 形状を回復します。 3D-2Dの対応に依存することにより、単純なサンプルごとの最適化問題を使用して、カメラのポーズと非剛体変形のCNNベースの回帰を置き換え、それによって実質的により正確な3D再構成を取得します。この最適化を微分可能なレイヤーとして扱い、システム全体をエンドツーエンドでトレーニングします。複数のカテゴリで体系的な定量的改善を報告し、さまざまな形状、ポーズ、テクスチャの予測例を含む定性的な結果を提供します。プロジェクトのウェブサイト：https：//fkokkinos.github.io/to_the_point/。

We present To The Point (TTP), a method for reconstructing 3D objects from a single image using 2D to 3D correspondences learned from weak supervision. We recover a 3D shape from a 2D image by first regressing the 2D positions corresponding to the 3D template vertices and then jointly estimating a rigid camera transform and non-rigid template deformation that optimally explain the 2D positions through the 3D shape projection. By relying on 3D-2D correspondences we use a simple per-sample optimization problem to replace CNN-based regression of camera pose and non-rigid deformation and thereby obtain substantially more accurate 3D reconstructions. We treat this optimization as a differentiable layer and train the whole system in an end-to-end manner. We report systematic quantitative improvements on multiple categories and provide qualitative results comprising diverse shape, pose and texture prediction examples. Project website: https://fkokkinos.github.io/to_the_point/.

updated: Thu Jun 10 2021 11:21:14 GMT+0000 (UTC)

published: Thu Jun 10 2021 11:21:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト