RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting

Jiachen Li; Fan Yang; Hengbo Ma; Srikanth Malla; Masayoshi Tomizuka; Chiho Choi

雨：動き予測のための強化されたハイブリッド注意推論ネットワーク

モーション予測は、さまざまなドメイン（自動運転、人間とロボットの相互作用など）で重要な役割を果たします。これは、一連の過去の観測から将来のモーションシーケンスを予測することを目的としています。ただし、観察された要素の重要性のレベルは異なる場合があります。一部の情報は、特定の状況では、予測とは無関係であるか、気が散ることさえあります。この問題に対処するために、ハイブリッド注意メカニズムに基づく動的なキー情報の選択とランク付けを備えた一般的なモーション予測フレームワーク（RAINという名前）を提案します。一般的なフレームワークは、マルチエージェントの軌道予測タスクと人間の動きの予測タスクをそれぞれ処理するためにインスタンス化されます。前者のタスクでは、モデルは、グラフ表現を使用してエージェント間の関係を認識し、それらの相対的な重要性を判断することを学習します。後者のタスクでは、モデルは長期的な人間の動きの時間的近接性と依存性をキャプチャすることを学習します。また、フレームワークのさまざまなモジュールのパラメーターを最適化するための交互のトレーニング戦略を備えた効果的な2段階のトレーニングパイプラインを提案します。さまざまなドメインでの合成シミュレーションとモーション予測ベンチマークの両方でフレームワークを検証し、私たちの方法が最先端の予測パフォーマンスを達成するだけでなく、解釈可能で合理的なハイブリッド注意の重みを提供することを示します。

Motion forecasting plays a significant role in various domains (e.g., autonomous driving, human-robot interaction), which aims to predict future motion sequences given a set of historical observations. However, the observed elements may be of different levels of importance. Some information may be irrelevant or even distracting to the forecasting in certain situations. To address this issue, we propose a generic motion forecasting framework (named RAIN) with dynamic key information selection and ranking based on a hybrid attention mechanism. The general framework is instantiated to handle multi-agent trajectory prediction and human motion forecasting tasks, respectively. In the former task, the model learns to recognize the relations between agents with a graph representation and to determine their relative significance. In the latter task, the model learns to capture the temporal proximity and dependency in long-term human motions. We also propose an effective double-stage training pipeline with an alternating training strategy to optimize the parameters in different modules of the framework. We validate the framework on both synthetic simulations and motion forecasting benchmarks in different domains, demonstrating that our method not only achieves state-of-the-art forecasting performance, but also provides interpretable and reasonable hybrid attention weights.

updated: Tue Aug 03 2021 06:30:30 GMT+0000 (UTC)

published: Tue Aug 03 2021 06:30:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト