End-to-end Autonomous Driving with Semantic Depth Cloud Mapping and Multi-agent

Oskar Natan; Jun Miura

セマンティックデプスクラウドマッピングとマルチエージェントによるエンドツーエンドの自動運転

自動運転車のポイントツーポイントナビゲーションのタスクに焦点を当て、知覚タスクと制御タスクの両方を同時に実行するためのエンドツーエンドおよびマルチタスク学習方法でトレーニングされた新しい深層学習モデルを提案します。このモデルは、グローバルプランナーによって定義された一連のルートをたどることにより、自我車両を安全に運転するために使用されます。モデルの知覚部分は、セマンティックセグメンテーション、セマンティックデプスクラウド（SDC）マッピング、信号機の状態と一時停止の標識の予測を実行しながら、RGBDカメラによって提供される高次元の観測データをエンコードするために使用されます。次に、制御部分は、GPSと速度計によって提供される追加情報とともに、エンコードされた特徴をデコードして、潜在的な特徴空間に伴うウェイポイントを予測します。さらに、2つのエージェントを使用してこれらの出力を処理し、最終アクションとしてステアリング、スロットル、およびブレーキのレベルを決定する制御ポリシーを作成します。モデルは、CARLAシミュレーターで評価され、実際の状況を模倣するために、通常の敵対的な状況とさまざまな天候で作成されたさまざまなシナリオが使用されます。さらに、運転の複数の側面でのパフォーマンスを正当化するために、いくつかの最近のモデルとの比較研究を行っています。さらに、SDCマッピングとマルチエージェントに関するアブレーション研究も実施して、それらの役割と動作を理解します。その結果、私たちのモデルは、より少ないパラメーターと計算負荷でさえ、最高の運転スコアを達成します。将来の研究をサポートするために、https：//github.com/oskarnatan/end-to-end-drivingでコードを共有しています。

Focusing on the task of point-to-point navigation for an autonomous driving vehicle, we propose a novel deep learning model trained with end-to-end and multi-task learning manners to perform both perception and control tasks simultaneously. The model is used to drive the ego vehicle safely by following a sequence of routes defined by the global planner. The perception part of the model is used to encode high-dimensional observation data provided by an RGBD camera while performing semantic segmentation, semantic depth cloud (SDC) mapping, and traffic light state and stop sign prediction. Then, the control part decodes the encoded features along with additional information provided by GPS and speedometer to predict waypoints that come with a latent feature space. Furthermore, two agents are employed to process these outputs and make a control policy that determines the level of steering, throttle, and brake as the final action. The model is evaluated on CARLA simulator with various scenarios made of normal-adversarial situations and different weathers to mimic real-world conditions. In addition, we do a comparative study with some recent models to justify the performance in multiple aspects of driving. Moreover, we also conduct an ablation study on SDC mapping and multi-agent to understand their roles and behavior. As a result, our model achieves the highest driving score even with fewer parameters and computation load. To support future studies, we share our codes at https://github.com/oskarnatan/end-to-end-driving.

updated: Wed Jun 22 2022 04:21:30 GMT+0000 (UTC)

published: Tue Apr 12 2022 03:57:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト