OnePose: One-Shot Object Pose Estimation without CAD Models

Jiaming Sun; Zihao Wang; Siyu Zhang; Xingyi He; Hongcheng Zhao; Guofeng Zhang; Xiaowei Zhou

OnePose：CADモデルを使用しないワンショットオブジェクトポーズ推定

オブジェクトポーズ推定のためのOnePoseという名前の新しい方法を提案します。既存のインスタンスレベルまたはカテゴリレベルのメソッドとは異なり、OnePoseはCADモデルに依存せず、インスタンスまたはカテゴリ固有のネットワークトレーニングなしで任意のカテゴリのオブジェクトを処理できます。 OnePoseは、視覚的なローカリゼーションからアイデアを引き出し、オブジェクトのスパースSfMモデルを構築するために、オブジェクトの単純なRGBビデオスキャンのみを必要とします。次に、このモデルは、一般的な特徴マッチングネットワークを使用して新しいクエリ画像に登録されます。既存の視覚的ローカリゼーション手法の実行時間が遅いことを軽減するために、クエリ画像の2D関心点をSfMモデルの3D点と直接一致させる新しいグラフ注意ネットワークを提案し、効率的で堅牢なポーズ推定を実現します。 OnePoseは、機能ベースのポーズトラッカーと組み合わせることで、日常の家庭用品の6Dポーズをリアルタイムで安定して検出および追跡できます。また、150個のオブジェクトの450個のシーケンスで構成される大規模なデータセットを収集しました。

We propose a new method named OnePose for object pose estimation. Unlike existing instance-level or category-level methods, OnePose does not rely on CAD models and can handle objects in arbitrary categories without instance- or category-specific network training. OnePose draws the idea from visual localization and only requires a simple RGB video scan of the object to build a sparse SfM model of the object. Then, this model is registered to new query images with a generic feature matching network. To mitigate the slow runtime of existing visual localization methods, we propose a new graph attention network that directly matches 2D interest points in the query image with the 3D points in the SfM model, resulting in efficient and robust pose estimation. Combined with a feature-based pose tracker, OnePose is able to stably detect and track 6D poses of everyday household objects in real-time. We also collected a large-scale dataset that consists of 450 sequences of 150 objects.

updated: Tue May 24 2022 17:59:21 GMT+0000 (UTC)

published: Tue May 24 2022 17:59:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト