Single Image Depth Estimation using Wavelet Decomposition

Michaël Ramamonjisoa; Michael Firman; Jamie Watson; Vincent Lepetit; Daniyar Turmukhambetov

ウェーブレット分解を使用した単一画像の深度推定

単眼画像から高効率で正確な深度を予測するための新しい方法を提案します。この最適な効率は、完全に微分可能なエンコーダーデコーダーアーキテクチャに統合されたウェーブレット分解を利用することで実現されます。スパースウェーブレット係数を予測することにより、忠実度の高い深度マップを再構築できることを示します。以前の研究とは対照的に、ウェーブレット係数は係数を直接監督することなく学習できることを示しています。代わりに、逆ウェーブレット変換によって再構成された最終的な深度画像のみを監視します。さらに、ウェーブレット係数は、グラウンドトゥルース深度にアクセスせずに、完全に自己監視されたシナリオで学習できることを示します。最後に、この方法をさまざまな最先端の単眼深度推定モデルに適用し、それぞれの場合で元のモデルと同様またはより良い結果をもたらし、デコーダーネットワークで必要な乗算加算は半分未満です。 https://github.com/nianticlabs/wavelet-monodepth のコード

We present a novel method for predicting accurate depths from monocular images with high efficiency. This optimal efficiency is achieved by exploiting wavelet decomposition, which is integrated in a fully differentiable encoder-decoder architecture. We demonstrate that we can reconstruct high-fidelity depth maps by predicting sparse wavelet coefficients. In contrast with previous works, we show that wavelet coefficients can be learned without direct supervision on coefficients. Instead we supervise only the final depth image that is reconstructed through the inverse wavelet transform. We additionally show that wavelet coefficients can be learned in fully self-supervised scenarios, without access to ground-truth depth. Finally, we apply our method to different state-of-the-art monocular depth estimation models, in each case giving similar or better results compared to the original model, while requiring less than half the multiply-adds in the decoder network. Code at https://github.com/nianticlabs/wavelet-monodepth

updated: Thu Jun 03 2021 17:42:25 GMT+0000 (UTC)

published: Thu Jun 03 2021 17:42:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト