HoW-3D: Holistic 3D Wireframe Perception from a Single Image

Wenchao Ma; Bin Tan; Nan Xue; Tianfu Wu; Xianwei Zheng; Gui-Song Xia

How-3D: 単一の画像からのホリスティック 3D ワイヤーフレーム認識

この論文では、ホリスティック 3D ワイヤーフレーム認識 (HoW-3D) の問題を研究します。これは、単一ビューの 2D 画像から、目に見える 3D ワイヤーフレームと目に見えないワイヤーフレームの両方を認識する新しいタスクです。オブジェクトの前面以外の表面は 1 つのビューで直接観察できないため、HoW-3D で見通し外 (NLOS) ジオメトリを推定することは根本的に困難な問題であり、コンピュータービジョンでは未解決のままです。 ABC-HoW ベンチマークを提案することで、HoW-3D の問題を研究します。このベンチマークは、12k のシングルビュー画像を含む ABC データセットと対応する全体的な 3D ワイヤーフレームモデルをソースとする CAD モデルの上に作成されます。大規模な ABC-HoW ベンチマークを利用できるようにすることで、新しい深層空間ゲシュタルト (DSG) モデルを提示して、目に見えるジャンクションと線分を基礎として学習し、ゲシュタルトの原則に従って、目に見える手がかりから NLOS 3D 構造を推測します。人間の視覚システム。私たちの実験では、単一ビューの画像から全体的な 3D ワイヤーフレームを推測する際に、DSG モデルが非常にうまく機能することを示しています。強力なベースライン手法と比較して、当社の DSG モデルは、単一ビュー画像で目に見えない線のジオメトリを検出する点で以前のワイヤーフレーム検出器よりも優れており、3D ワイヤーフレームを再構築する際の入力として忠実度の高い PointCloud を使用する従来技術と非常に競争力があります。

This paper studies the problem of holistic 3D wireframe perception (HoW-3D), a new task of perceiving both the visible 3D wireframes and the invisible ones from single-view 2D images. As the non-front surfaces of an object cannot be directly observed in a single view, estimating the non-line-of-sight (NLOS) geometries in HoW-3D is a fundamentally challenging problem and remains open in computer vision. We study the problem of HoW-3D by proposing an ABC-HoW benchmark, which is created on top of CAD models sourced from the ABC-dataset with 12k single-view images and the corresponding holistic 3D wireframe models. With our large-scale ABC-HoW benchmark available, we present a novel Deep Spatial Gestalt (DSG) model to learn the visible junctions and line segments as the basis and then infer the NLOS 3D structures from the visible cues by following the Gestalt principles of human vision systems. In our experiments, we demonstrate that our DSG model performs very well in inferring the holistic 3D wireframes from single-view images. Compared with the strong baseline methods, our DSG model outperforms the previous wireframe detectors in detecting the invisible line geometry in single-view images and is even very competitive with prior arts that take high-fidelity PointCloud as inputs on reconstructing 3D wireframes.

updated: Fri Aug 19 2022 05:06:58 GMT+0000 (UTC)

published: Mon Aug 15 2022 04:05:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト