Regularizing Neural Networks for Future Trajectory Prediction via Inverse Reinforcement Learning Framework

Dooseop Choi; Kyoungwook Min; Jeongdan Choi

逆強化学習フレームワークを介した将来の軌道予測のためのニューラルネットワークの正則化

エージェントの将来の軌跡は、過去の軌跡だけでなく、シーンのコンテキストによっても影響を受けるため、動的なシーンにおけるエージェントの遠い将来の軌跡を予測することは簡単な問題ではありません。この問題に取り組むために、リカレントニューラルネットワーク（RNN）に基づくモデルと、モデルをトレーニングするための新しい方法を提案します。提案されたモデルは、エンコーダーが入力（過去の軌跡とシーンコンテキスト情報）をエンコードし、デコーダーがエンコーダーによって与えられたコンテキストベクトルから軌跡を生成するエンコーダーデコーダーアーキテクチャに基づいています。提案されたモデルのネットワークをトレーニングして、実際の軌道に最も近い将来の軌道を作成し、報酬関数からの報酬を最大化します。報酬関数も同時に訓練され、地上真実の軌跡からの報酬とその推定値の間のマージンを最大化します。報酬関数は、提案されたモデルの正則化の役割を果たします。そのため、訓練されたネットワークは、予測タスクのためにシーンコンテキスト情報をより活用できます。提案されたモデルをいくつかの公開データセットで評価しました。実験結果は、提案されたモデルの予測性能が正則化によって大幅に改善され、精度の点で最新技術を上回ることを示しています。実装コードはhttps://github.com/d1024choi/traj-pred-irl/で入手できます。

Predicting distant future trajectories of agents in a dynamic scene is not an easy problem because the future trajectory of an agent is affected by not only his/her past trajectory but also the scene contexts. To tackle this problem, we propose a model based on recurrent neural networks (RNNs) and a novel method for training the model. The proposed model is based on an encoder-decoder architecture where the encoder encodes inputs (past trajectories and scene context information) while the decoder produces a trajectory from the context vector given by the encoder. We train the networks of the proposed model to produce a future trajectory, which is the closest to the true trajectory, while maximizing a reward from a reward function. The reward function is also trained at the same time to maximize the margin between the rewards from the ground-truth trajectory and its estimate. The reward function plays the role of a regularizer for the proposed model so the trained networks are able to better utilize the scene context information for the prediction task. We evaluated the proposed model on several public datasets. Experimental results show that the prediction performance of the proposed model is much improved by the regularization, which outperforms the-state-of-the-arts in terms of accuracy. The implementation codes are available at https://github.com/d1024choi/traj-pred-irl/.

updated: Thu Dec 26 2019 00:32:53 GMT+0000 (UTC)

published: Wed Jul 10 2019 06:00:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト