ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild

Wang Zhao; Shaohui Liu; Hengkai Guo; Wenping Wang; Yong-Jin Liu

ParticleSfM：野生の移動カメラをローカライズするための高密度ポイント軌道の活用

単眼ビデオから動くカメラのポーズを推定することは、特に既存のカメラポーズ推定方法のパフォーマンスが幾何学的に一貫していないピクセルの影響を受けやすい動的環境に動くオブジェクトが存在するため、難しい問題です。この課題に取り組むために、ペアワイズオプティカルフローから初期化された高密度対応に基づくビデオの堅牢で高密度の間接的な運動からの構造の方法を提示します。私たちの重要なアイデアは、長距離ビデオの対応を密な点の軌跡として最適化し、それを使用してモーションセグメンテーションのロバスト推定を学習することです。不規則な点軌道データを処理するために、新しいニューラルネットワークアーキテクチャが提案されています。次に、カメラのポーズが推定され、静的として分類される長距離のポイント軌道の部分に対するグローバルバンドル調整で最適化されます。 MPI Sintelデータセットでの実験は、私たちのシステムが既存の最先端の方法と比較して非常に正確なカメラ軌道を生成することを示しています。さらに、私たちの方法は、完全に静的なシーンでカメラのポーズの妥当な精度を維持することができます。これは、エンドツーエンドの深層学習を備えた強力な最先端の密な対応ベースの方法を一貫して上回り、密な間接的な方法の可能性を示していますオプティカルフローとポイント軌道に基づいています。ポイント軌道表現は一般的であるため、動的オブジェクトの複雑な動きを伴う野生の単眼ビデオの結果と比較をさらに示します。コードはhttps://github.com/bytedance/particle-sfmで入手できます。

Estimating the pose of a moving camera from monocular video is a challenging problem, especially due to the presence of moving objects in dynamic environments, where the performance of existing camera pose estimation methods are susceptible to pixels that are not geometrically consistent. To tackle this challenge, we present a robust dense indirect structure-from-motion method for videos that is based on dense correspondence initialized from pairwise optical flow. Our key idea is to optimize long-range video correspondence as dense point trajectories and use it to learn robust estimation of motion segmentation. A novel neural network architecture is proposed for processing irregular point trajectory data. Camera poses are then estimated and optimized with global bundle adjustment over the portion of long-range point trajectories that are classified as static. Experiments on MPI Sintel dataset show that our system produces significantly more accurate camera trajectories compared to existing state-of-the-art methods. In addition, our method is able to retain reasonable accuracy of camera poses on fully static scenes, which consistently outperforms strong state-of-the-art dense correspondence based methods with end-to-end deep learning, demonstrating the potential of dense indirect methods based on optical flow and point trajectories. As the point trajectory representation is general, we further present results and comparisons on in-the-wild monocular videos with complex motion of dynamic objects. Code is available at https://github.com/bytedance/particle-sfm.

updated: Tue Jul 19 2022 09:19:45 GMT+0000 (UTC)

published: Tue Jul 19 2022 09:19:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト