Scalable Scene Flow from Point Clouds in the Real World

Philipp Jund; Chris Sweeney; Nichola Abdo; Zhifeng Chen; Jonathon Shlens

実世界の点群からのスケーラブルなシーンフロー

自動運転車は非常に動的な環境で動作するため、シーンのどの側面が移動しているか、どこに移動しているかを正確に評価する必要があります。シーンフローと呼ばれる3Dモーション推定への一般的なアプローチは、連続するLiDARスキャンからの3Dポイントクラウドデータを使用することですが、そのようなアプローチは、実世界の注釈付きLiDARデータのサイズが小さいために制限されています。この作業では、対応する追跡された3Dオブジェクトから派生したシーンフロー推定用の新しい大規模データセットを紹介します。これは、注釈付きフレームの数に関して、以前の実世界のデータセットよりも約1,000倍大きくなります。利用可能な実際のLiDARデータの量に基づいて以前の作業がどのように制限されたかを示し、最先端の予測パフォーマンスを達成するには、より大きなデータセットが必要であることを示唆しています。さらに、ダウンサンプリングなどのポイントクラウドで操作するための以前のヒューリスティックがパフォーマンスを大幅に低下させ、フルポイントクラウドで扱いやすい新しいクラスのモデルを動機付ける方法を示します。この問題に対処するために、フルポイントクラウドでリアルタイムの推論を提供するFastFlow3Dアーキテクチャを導入します。さらに、エゴモーションを考慮し、オブジェクトタイプごとの内訳を提供することで、現実世界の側面をより適切にキャプチャする、人間が解釈できるメトリックを設計します。このデータセットが、実世界のシーンフローシステムを開発するための新しい機会を提供することを願っています。

Autonomous vehicles operate in highly dynamic environments necessitating an accurate assessment of which aspects of a scene are moving and where they are moving to. A popular approach to 3D motion estimation, termed scene flow, is to employ 3D point cloud data from consecutive LiDAR scans, although such approaches have been limited by the small size of real-world, annotated LiDAR data. In this work, we introduce a new large-scale dataset for scene flow estimation derived from corresponding tracked 3D objects, which is ∼1,000× larger than previous real-world datasets in terms of the number of annotated frames. We demonstrate how previous works were bounded based on the amount of real LiDAR data available, suggesting that larger datasets are required to achieve state-of-the-art predictive performance. Furthermore, we show how previous heuristics for operating on point clouds such as down-sampling heavily degrade performance, motivating a new class of models that are tractable on the full point cloud. To address this issue, we introduce the FastFlow3D architecture which provides real time inference on the full point cloud. Additionally, we design human-interpretable metrics that better capture real world aspects by accounting for ego-motion and providing breakdowns per object type. We hope that this dataset may provide new opportunities for developing real world scene flow systems.

updated: Mon Oct 25 2021 22:46:02 GMT+0000 (UTC)

published: Mon Mar 01 2021 20:56:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト