Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Zitian Zhang; Chuhua Xian

拡張クロスアテンションを使用したさまざまな照明下での一貫した深さ予測

本論文では、様々な照明条件下での複雑なシーンにおける一貫した深度予測の問題を解決することを目指しています。 RGB-Dセンサーまたは仮想レンダリングに基づく既存の屋内データセットには、2つの重大な制限があります。スパース深度マップ（NYU Depth V2）と非現実的な照明（SUN CG、SceneNet RGB-D）です。インターネット3D屋内シーンを使用し、それらの照明を手動で調整して、フォトリアリスティックなRGB写真とそれに対応する深度およびBRDFマップをレンダリングし、Variデータセットと呼ばれる新しい屋内深度データセットを取得することを提案します。グローバル情報を処理し、パラメータを削減するために、エンコードされた特徴に深さ方向に分離可能な拡張畳み込みを適用することにより、DCAという名前の単純な畳み込みブロックを提案します。さまざまな照明下で深度予測の一貫性を維持するために、これらの拡張された機能に対してクロスアテンションを実行します。私たちの方法は、Variデータセットの現在の最先端の方法と比較することによって評価され、実験で大幅な改善が見られます。また、アブレーションスタディを実施し、NYU Depth V2でモデルを微調整し、実際のデータで評価して、DCAブロックの有効性をさらに検証します。コード、事前トレーニング済みの重み、およびVariデータセットはオープンソースです。

In this paper, we aim to solve the problem of consistent depth prediction in complex scenes under various illumination conditions. The existing indoor datasets based on RGB-D sensors or virtual rendering have two critical limitations - sparse depth maps (NYU Depth V2) and non-realistic illumination (SUN CG, SceneNet RGB-D). We propose to use internet 3D indoor scenes and manually tune their illuminations to render photo-realistic RGB photos and their corresponding depth and BRDF maps, obtaining a new indoor depth dataset called Vari dataset. We propose a simple convolutional block named DCA by applying depthwise separable dilated convolution on encoded features to process global information and reduce parameters. We perform cross attention on these dilated features to retain the consistency of depth prediction under different illuminations. Our method is evaluated by comparing it with current state-of-the-art methods on Vari dataset and a significant improvement is observed in our experiments. We also conduct the ablation study, finetune our model on NYU Depth V2 and also evaluate on real-world data to further validate the effectiveness of our DCA block. The code, pre-trained weights and Vari dataset are open-sourced.

updated: Wed Dec 15 2021 10:02:46 GMT+0000 (UTC)

published: Wed Dec 15 2021 10:02:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト