Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

Jin Zeng; Yanfeng Tong; Yunmu Huang; Qiong Yan; Wenxiu Sun; Jing Chen; Yongtian Wang

階層的なRGB-D融合を使用した深表面法線推定

汎用RGB-Dカメラの可用性の高まりにより、シーン理解の分野での用途が拡大しました。ただし、基本的なシーン理解タスクとして、RGB-Dデータからの表面法線推定には徹底的な調査が欠けています。本論文では、単一のRGB-D画像からの表面法線推定のために、適応的特徴再重み付けを伴う階層的融合ネットワークを提案します。具体的には、カラー画像と深度からの特徴が複数のスケールで連続的に統合され、視覚的に顕著な詳細を維持しながらグローバルな表面の滑らかさを保証します。一方、深度の特徴は、入力深度の破損によるアーチファクトを回避するために、カラーブランチにマージする前に深度から推定された信頼マップで再重み付けされます。さらに、ハイブリッドマルチスケール損失関数は、ノイズのあるグランドトゥルースデータセットが与えられた場合に正確な法線推定を学習するように設計されています。広範な実験結果により、融合戦略と損失設計の有効性が検証され、最先端の通常の推定スキームよりも優れています。

The growing availability of commodity RGB-D cameras has boosted the applications in the field of scene understanding. However, as a fundamental scene understanding task, surface normal estimation from RGB-D data lacks thorough investigation. In this paper, a hierarchical fusion network with adaptive feature re-weighting is proposed for surface normal estimation from a single RGB-D image. Specifically, the features from color image and depth are successively integrated at multiple scales to ensure global surface smoothness while preserving visually salient details. Meanwhile, the depth features are re-weighted with a confidence map estimated from depth before merging into the color branch to avoid artifacts caused by input depth corruption. Additionally, a hybrid multi-scale loss function is designed to learn accurate normal estimation given noisy ground-truth dataset. Extensive experimental results validate the effectiveness of the fusion strategy and the loss design, outperforming state-of-the-art normal estimation schemes.

updated: Wed Nov 20 2019 07:12:19 GMT+0000 (UTC)

published: Sat Apr 06 2019 09:54:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト