Semi-supervised Deep Multi-view Stereo

Hongbin Xu; Zhipeng Zhou; Weitao Chen; Baigui Sun; Hao Li; Wenxiong Kang

半教師ありディープマルチビューステレオ

教師ありおよび教師なし設定における学習ベースのマルチビューステレオ (MVS) では、大きな進歩が見られました。精度と完全性におけるそれぞれの利点を組み合わせ、同時に高価なラベル付きデータの需要を減らすために、この論文では、MVS データのほんの一部だけが密な深度の地盤に付加されているという半教師あり設定における学習ベースの MVS の問題を調査します。真実。ただし、シナリオの多様性とビューの柔軟な設定により、ラベルなしデータとラベル付きデータが同じラベル空間とデータ分布を共有するという、古典的な半教師あり学習の基本的な前提が崩れる可能性があります。これは、半教師あり分布ギャップと呼ばれます。 MVS 問題のあいまいさ。これらの問題に対処するために、私たちは、新しい半教師あり分散拡張 MVS フレームワーク、つまり SDA-MVS を提案します。基本的な仮定が MVS データで機能する単純なケースでは、一貫性の正則化により、元のサンプルとランダムに拡張されたサンプルの間でモデル予測の一貫性が促進されます。 MVS データで基本的な仮定が矛盾するというさらに厄介なケースに対して、分布ギャップによって引き起こされる悪影響を軽減するために、新しいスタイルの一貫性損失を提案します。ラベルなしサンプルの視覚スタイルはラベル付きサンプルに転送されてギャップが縮小され、生成されたサンプルのモデル予測は元のラベル付きサンプルのラベルでさらに監視されます。複数の MVS データセットの半教師あり設定での実験結果は、提案された方法の優れたパフォーマンスを示しています。バックボーンネットワークで同じ設定を使用すると、私たちが提案した SDA-MVS は、完全に監視されたベースラインと監視されていないベースラインよりも優れたパフォーマンスを発揮します。

Significant progress has been witnessed in learning-based Multi-view Stereo (MVS) under supervised and unsupervised settings. To combine their respective merits in accuracy and completeness, meantime reducing the demand for expensive labeled data, this paper explores the problem of learning-based MVS in a semi-supervised setting that only a tiny part of the MVS data is attached with dense depth ground truth. However, due to huge variation of scenarios and flexible settings in views, it may break the basic assumption in classic semi-supervised learning, that unlabeled data and labeled data share the same label space and data distribution, named as semi-supervised distribution-gap ambiguity in the MVS problem. To handle these issues, we propose a novel semi-supervised distribution-augmented MVS framework, namely SDA-MVS. For the simple case that the basic assumption works in MVS data, consistency regularization encourages the model predictions to be consistent between original sample and randomly augmented sample. For further troublesome case that the basic assumption is conflicted in MVS data, we propose a novel style consistency loss to alleviate the negative effect caused by the distribution gap. The visual style of unlabeled sample is transferred to labeled sample to shrink the gap, and the model prediction of generated sample is further supervised with the label in original labeled sample. The experimental results in semi-supervised settings of multiple MVS datasets show the superior performance of the proposed method. With the same settings in backbone network, our proposed SDA-MVS outperforms its fully-supervised and unsupervised baselines.

updated: Mon Aug 07 2023 08:33:53 GMT+0000 (UTC)

published: Sun Jul 24 2022 09:37:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト