RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects

Sascha Kirch; Valeria Olyunina; Jan Ondřej; Rafael Pagés; Sergio Martin; Clara Pérez-Molina

RGB-D-Fusion: 人型被写体の画像条件付き深度拡散

人型被写体の低解像度の単眼 RGB 画像から高解像度の深度マップを生成するマルチモーダル条件付きノイズ除去拡散確率モデル RGB-D-Fusion を紹介します。 RGB-D-Fusion は、最初に画像条件付きノイズ除去拡散確率モデルを使用して低解像度深度マップを生成し、次に低解像度 RGB-D 画像で条件付けされた 2 番目のノイズ除去拡散確率モデルを使用して深度マップをアップサンプリングします。さらに、超解像モデルの堅牢性を高めるために、新しい拡張技術である深度ノイズ拡張を導入します。

We present RGB-D-Fusion, a multi-modal conditional denoising diffusion probabilistic model to generate high resolution depth maps from low-resolution monocular RGB images of humanoid subjects. RGB-D-Fusion first generates a low-resolution depth map using an image conditioned denoising diffusion probabilistic model and then upsamples the depth map using a second denoising diffusion probabilistic model conditioned on a low-resolution RGB-D image. We further introduce a novel augmentation technique, depth noise augmentation, to increase the robustness of our super-resolution model.

updated: Sat Jul 29 2023 13:47:40 GMT+0000 (UTC)

published: Sat Jul 29 2023 13:47:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト