Learning 3D Object Categories by Looking Around Them

David Novotny; Diane Larlus; Andrea Vedaldi

周りを見回して3Dオブジェクトカテゴリを学習する

3Dオブジェクトカテゴリを学習するための従来のアプローチでは、合成データまたは手動監視のいずれかを使用します。本論文では、手動の注釈を必要とせず、代わりに移動する視点からオブジェクトを観察することによって手がかりとなる方法を提案します。私たちのシステムは、2つの革新に基づいています。3D形状を明示的に比較することなく、さまざまなビデオを堅牢に整列させるシャムの視点因数分解ネットワーク。部分的な観察からオブジェクトの完全な形状を抽出できる3D形状完成ネットワーク。また、確率的予測を実行するようにネットワークを構成することの利点と、ジオメトリを意識したデータ拡張スキームの利点についても説明します。公開されているベンチマークで最新の結果を取得します。

Traditional approaches for learning 3D object categories use either synthetic data or manual supervision. In this paper, we propose a method which does not require manual annotations and is instead cued by observing objects from a moving vantage point. Our system builds on two innovations: a Siamese viewpoint factorization network that robustly aligns different videos together without explicitly comparing 3D shapes; and a 3D shape completion network that can extract the full shape of an object from partial observations. We also demonstrate the benefits of configuring networks to perform probabilistic predictions as well as of geometry-aware data augmentation schemes. We obtain state-of-the-art results on publicly-available benchmarks.

updated: Thu Dec 02 2021 14:49:48 GMT+0000 (UTC)

published: Wed May 10 2017 21:01:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト