Contingencies from Observations: Tractable Contingency Planning with Learned Behavior Models

Nicholas Rhinehart; Jeff He; Charles Packer; Matthew A. Wright; Rowan McAllister; Joseph E. Gonzalez; Sergey Levine

観察からの不測の事態：学習された行動モデルによる実行可能な不測の事態の計画

人間は、他のエージェントの将来の行動や精神状態など、将来の出来事について正確に推論することによって決定を下す驚くべき能力を持っています。混雑した交差点を車で運転することを検討してください。車の物理的性質、他のドライバーの意図、および自分の意図に対する彼らの信念について推論する必要があります。方向転換の合図をすると、別のドライバーがあなたに譲る可能性があります。または、追い越し車線に入ると、別のドライバーが減速して、前に合流する余地を与える可能性があります。有能なドライバーは、次の行動を起こす前に、他のエージェントのさまざまな潜在的な将来の行動に安全に対応する方法を計画する必要があります。これには、緊急時対応計画が必要です。将来のイベントの確率的結果に依存する一連の条件付きアクションを明示的に計画します。この作業では、高次元のシーン観測と低次元の行動観測を使用してエンドツーエンドで学習される汎用の緊急時対応プランナーを開発します。条件付き自己回帰フローモデルを使用して、コンパクトな緊急時対応計画スペースを作成し、このモデルが行動観察から緊急時対応を扱いやすく学習する方法を示します。ドライビングシミュレーター（CARLA）で現実的なマルチエージェントシナリオの閉ループ制御ベンチマークを開発しました。このベンチマークでは、いくつかの最先端技術を含む、マルチエージェントの将来の動作を推論するさまざまな非偶発的方法と比較します。ディープラーニングベースの計画アプローチ。これらの非偶発的計画手法は基本的にこのベンチマークに失敗することを示し、私たちの深い緊急時対応計画手法が大幅に優れたパフォーマンスを達成することを発見しました。ベンチマークを実行して結果を再現するためのコードは、https：//sites.google.com/view/contingency-planningで入手できます。

Humans have a remarkable ability to make decisions by accurately reasoning about future events, including the future behaviors and states of mind of other agents. Consider driving a car through a busy intersection: it is necessary to reason about the physics of the vehicle, the intentions of other drivers, and their beliefs about your own intentions. If you signal a turn, another driver might yield to you, or if you enter the passing lane, another driver might decelerate to give you room to merge in front. Competent drivers must plan how they can safely react to a variety of potential future behaviors of other agents before they make their next move. This requires contingency planning: explicitly planning a set of conditional actions that depend on the stochastic outcome of future events. In this work, we develop a general-purpose contingency planner that is learned end-to-end using high-dimensional scene observations and low-dimensional behavioral observations. We use a conditional autoregressive flow model to create a compact contingency planning space, and show how this model can tractably learn contingencies from behavioral observations. We developed a closed-loop control benchmark of realistic multi-agent scenarios in a driving simulator (CARLA), on which we compare our method to various noncontingent methods that reason about multi-agent future behavior, including several state-of-the-art deep learning-based planning approaches. We illustrate that these noncontingent planning methods fundamentally fail on this benchmark, and find that our deep contingency planning method achieves significantly superior performance. Code to run our benchmark and reproduce our results is available at https://sites.google.com/view/contingency-planning

updated: Wed Apr 21 2021 14:30:20 GMT+0000 (UTC)

published: Wed Apr 21 2021 14:30:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト