PlaneDepth: Self-supervised Depth Estimation via Orthogonal Planes

Ruoyu Wang; Zehao Yu; Shenghua Gao

PlaneDepth: 直交平面による自己教師付き深度推定

複数の近前頭平行面ベースの深度表現は、自己教師あり単眼深度推定 (MDE) で印象的な結果を示しました。一方、そのような表現は、地面が正面平行面に垂直であるため、地面の不連続を引き起こし、自動運転における運転可能なスペースの識別に有害です。この論文では、垂直面と接地面を含む新しい直交面ベースのプレゼンテーションである PlaneDepth を提案します。 PlaneDepth は、入力画像の直交平面に基づくラプラシアン混合モデルを使用して深度分布を推定します。これらの平面は、参照ビューを合成して自己監視信号を提供するために使用されます。さらに、広く使用されているサイズ変更とトリミングのデータ拡張が直交性の仮定を破り、劣った平面予測につながることがわかりました。事前定義された平面と予測されたカメラポーズを修正するサイズ変更トリミング変換を明示的に構築することで、この問題に対処します。さらに、オクルージョンの直交平面表現の堅牢性を高めるために、バイラテラルオクルージョンマスクで監視された拡張自己蒸留損失を提案します。直交平面表現のおかげで、監視されていない方法で地面を抽出できます。これは自動運転にとって重要です。 KITTI データセットでの広範な実験により、この方法の有効性と効率が実証されています。コードは https://github.com/svip-lab/PlaneDepth で入手できます。

Multiple near frontal-parallel planes based depth representation demonstrated impressive results in self-supervised monocular depth estimation (MDE). Whereas, such a representation would cause the discontinuity of the ground as it is perpendicular to the frontal-parallel planes, which is detrimental to the identification of drivable space in autonomous driving. In this paper, we propose the PlaneDepth, a novel orthogonal planes based presentation, including vertical planes and ground planes. PlaneDepth estimates the depth distribution using a Laplacian Mixture Model based on orthogonal planes for an input image. These planes are used to synthesize a reference view to provide the self-supervision signal. Further, we find that the widely used resizing and cropping data augmentation breaks the orthogonality assumptions, leading to inferior plane predictions. We address this problem by explicitly constructing the resizing cropping transformation to rectify the predefined planes and predicted camera pose. Moreover, we propose an augmented self-distillation loss supervised with a bilateral occlusion mask to boost the robustness of orthogonal planes representation for occlusions. Thanks to our orthogonal planes representation, we can extract the ground plane in an unsupervised manner, which is important for autonomous driving. Extensive experiments on the KITTI dataset demonstrate the effectiveness and efficiency of our method. The code is available at https://github.com/svip-lab/PlaneDepth.

updated: Thu Mar 23 2023 10:01:33 GMT+0000 (UTC)

published: Tue Oct 04 2022 13:51:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト