DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts

Yidi Li; Yiqun Wang; Zhengda Lu; Jun Xiao

DepthGAN：セマンティックレイアウトからの屋内シーンのGANベースの深度生成

計算効率と精度によって制限されますが、複雑な3Dシーンを生成することは、既存の生成ネットワークにとって依然として困難な問題です。この作業では、入力としてセマンティックレイアウトのみを使用して深度マップを生成する新しい方法であるDepthGANを提案します。最初に、詳細マップで構造相関をキャプチャするためのジェネレータとして、適切に設計されたトランスブロックのカスケードを導入します。これにより、グローバルな特徴の集約とローカルな注意のバランスがとれます。一方、我々は、追加の外観監視情報を活用する、深さ生成においてエッジ保存を効率的にガイドするためのクロスアテンションフュージョンモジュールを提案します。最後に、Structured3dパノラマデータセットの透視図で広範な実験を行い、DepthGANが深度生成タスクの定量的結果と視覚効果の両方で優れたパフォーマンスを達成することを示します。さらに、3D屋内シーンは、生成された深度マップによって再構築できます。合理的な構造と空間的一貫性。

Limited by the computational efficiency and accuracy, generating complex 3D scenes remains a challenging problem for existing generation networks. In this work, we propose DepthGAN, a novel method of generating depth maps with only semantic layouts as input. First, we introduce a well-designed cascade of transformer blocks as our generator to capture the structural correlations in depth maps, which makes a balance between global feature aggregation and local attention. Meanwhile, we propose a cross-attention fusion module to guide edge preservation efficiently in depth generation, which exploits additional appearance supervision information. Finally, we conduct extensive experiments on the perspective views of the Structured3d panorama dataset and demonstrate that our DepthGAN achieves superior performance both on quantitative results and visual effects in the depth generation task.Furthermore, 3D indoor scenes can be reconstructed by our generated depth maps with reasonable structure and spatial coherency.

updated: Tue Mar 22 2022 04:18:45 GMT+0000 (UTC)

published: Tue Mar 22 2022 04:18:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト