Multi-Category Mesh Reconstruction From Image Collections

Alessandro Simoni; Stefano Pini; Roberto Vezzani; Rita Cucchiara

画像コレクションからのマルチカテゴリメッシュ再構成

最近、学習フレームワークは、単一のRGB画像からオブジェクトの正確な形状、ポーズ、およびテクスチャを推測する機能を示しています。ただし、現在の方法は、特定の事前確率を活用するために単一のカテゴリの画像コレクションでトレーニングされており、多くの場合、カテゴリ固有の3Dテンプレートを利用します。このホワイトペーパーでは、一連の変形可能な3Dモデルと、インスタンス固有の変形、ポーズ、およびテクスチャのセットを組み合わせて、オブジェクトのテクスチャメッシュを推測する代替アプローチを紹介します。以前の作品とは異なり、私たちの方法は、前景マスクとラフなカメラポーズのみを監視として使用して、複数のオブジェクトカテゴリの画像でトレーニングされています。特定の3Dテンプレートがない場合、フレームワークは、描写されたオブジェクトの3D形状を復元するために変形されるカテゴリレベルのモデルを学習します。インスタンス固有の変形は、学習した3Dメッシュの頂点ごとに個別に予測されるため、トレーニングプロセス中にメッシュを動的に細分化できます。実験は、提案されたフレームワークが異なるオブジェクトカテゴリを区別し、教師なしの方法でカテゴリ固有の形状事前分布を学習できることを示しています。予測された形状は滑らかで、トレーニングプロセス中の細分化の複数のステップから活用でき、2つの公開データセットで同等または最先端の結果を取得します。モデルとコードは公開されています。

Recently, learning frameworks have shown the capability of inferring the accurate shape, pose, and texture of an object from a single RGB image. However, current methods are trained on image collections of a single category in order to exploit specific priors, and they often make use of category-specific 3D templates. In this paper, we present an alternative approach that infers the textured mesh of objects combining a series of deformable 3D models and a set of instance-specific deformation, pose, and texture. Differently from previous works, our method is trained with images of multiple object categories using only foreground masks and rough camera poses as supervision. Without specific 3D templates, the framework learns category-level models which are deformed to recover the 3D shape of the depicted object. The instance-specific deformations are predicted independently for each vertex of the learned 3D mesh, enabling the dynamic subdivision of the mesh during the training process. Experiments show that the proposed framework can distinguish between different object categories and learn category-specific shape priors in an unsupervised manner. Predicted shapes are smooth and can leverage from multiple steps of subdivision during the training process, obtaining comparable or state-of-the-art results on two public datasets. Models and code are publicly released.

updated: Thu Oct 21 2021 16:32:31 GMT+0000 (UTC)

published: Thu Oct 21 2021 16:32:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト