Self-supervised Learning of 3D Objects from Natural Images

Hiroharu Kato; Tatsuya Harada

自然画像からの3Dオブジェクトの自己学習

分類された自然画像からオブジェクトの3D形状、ポーズ、およびテクスチャの単一ビュー再構成を自己監視方式で学習する方法を提示します。これは非常に不適切な問題であるため、トレーニング方法を慎重に設計し、制約を導入することが不可欠です。すべての要素を同時にトレーニングする難しさを回避するために、固定ポーズ分布と単純なテクスチャを使用したカテゴリ固有の基本形状のトレーニングを最初に提案し、その後、取得した形状を使用してポーズとテクスチャをトレーニングします。別の問題は、オブジェクトの表面のテクスチャを誤って再構築するために、形状と背景が過度に複雑になる場合があることです。それを抑制するために、オブジェクト表面と背景画像に強い正則化と制約を使用することを提案します。これら2つの手法を使用して、CIFAR-10やPASCALオブジェクトなどの自然画像コレクションをトレーニングに使用できることを示します。これは、合成データセット以外のさまざまなオブジェクトカテゴリで3Dオブジェクト再構築を実現する可能性を示します。

We present a method to learn single-view reconstruction of the 3D shape, pose, and texture of objects from categorized natural images in a self-supervised manner. Since this is a severely ill-posed problem, carefully designing a training method and introducing constraints are essential. To avoid the difficulty of training all elements at the same time, we propose training category-specific base shapes with fixed pose distribution and simple textures first, and subsequently training poses and textures using the obtained shapes. Another difficulty is that shapes and backgrounds sometimes become excessively complicated to mistakenly reconstruct textures on object surfaces. To suppress it, we propose using strong regularization and constraints on object surfaces and background images. With these two techniques, we demonstrate that we can use natural image collections such as CIFAR-10 and PASCAL objects for training, which indicates the possibility to realize 3D object reconstruction on diverse object categories beyond synthetic datasets.

updated: Wed Nov 20 2019 12:07:12 GMT+0000 (UTC)

published: Wed Nov 20 2019 12:07:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト