DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency

Jiehong Lin; Zewei Wei; Zhihao Li; Songcen Xu; Kui Jia; Yuanqing Li

DualPoseNet：ポーズの一貫性の洗練された学習を備えたデュアルポーズネットワークを使用したカテゴリレベルの6Dオブジェクトのポーズとサイズの推定

カテゴリレベルの6Dオブジェクトのポーズとサイズの推定は、乱雑なシーンの単一の任意のビューで観察されるオブジェクトインスタンスの回転、平行移動、およびサイズの完全なポーズ構成を予測することです。この論文では、DualPoseNetと短縮された、このタスクのポーズ一貫性の洗練された学習を備えたデュアルポーズネットワークの新しい方法を提案します。 DualPoseNetは、共有ポーズエンコーダーの上に2つの並列ポーズデコーダーをスタックします。暗黙的なデコーダーは、明示的なものとは異なる動作メカニズムでオブジェクトのポーズを予測します。したがって、ポーズエンコーダのトレーニングに補完的な監督を課します。球面畳み込みに基づいてエンコーダーを構築し、外観と形状の観察からポーズに敏感な特徴をより適切に埋め込むための球面融合のモジュールを設計します。テスト用CADモデルがない場合、自己適応型損失項を使用して2つのデコーダー間で予測されるポーズの一貫性を強制することにより、テスト中に洗練されたポーズ予測を可能にするのは、暗黙のデコーダーの新しい導入です。カテゴリレベルとインスタンスレベルの両方のオブジェクトポーズデータセットのベンチマークに関する徹底的な実験により、設計の有効性が確認されます。 DualPoseNetは、高精度の領域で大きなマージンを持って既存のメソッドよりも優れています。私たちのコードはhttps://github.com/Gorilla-Lab-SCUT/DualPoseNetで公開されています。

Category-level 6D object pose and size estimation is to predict full pose configurations of rotation, translation, and size for object instances observed in single, arbitrary views of cluttered scenes. In this paper, we propose a new method of Dual Pose Network with refined learning of pose consistency for this task, shortened as DualPoseNet. DualPoseNet stacks two parallel pose decoders on top of a shared pose encoder, where the implicit decoder predicts object poses with a working mechanism different from that of the explicit one; they thus impose complementary supervision on the training of pose encoder. We construct the encoder based on spherical convolutions, and design a module of Spherical Fusion wherein for a better embedding of pose-sensitive features from the appearance and shape observations. Given no testing CAD models, it is the novel introduction of the implicit decoder that enables the refined pose prediction during testing, by enforcing the predicted pose consistency between the two decoders using a self-adaptive loss term. Thorough experiments on benchmarks of both category- and instance-level object pose datasets confirm efficacy of our designs. DualPoseNet outperforms existing methods with a large margin in the regime of high precision. Our code is released publicly at https://github.com/Gorilla-Lab-SCUT/DualPoseNet.

updated: Mon Aug 16 2021 13:02:58 GMT+0000 (UTC)

published: Thu Mar 11 2021 08:33:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト