Is attention to bounding boxes all you need for pedestrian action prediction?

Lina Achaji; Julien Moreau; Thibault Fouqueray; Francois Aioun; Francois Charpillet

バウンディングボックスに注意を払うだけで、歩行者の行動を予測できますか？

運転シナリオの複雑さに関係するのは、もはや人間の運転手だけではありません。自動運転車（AV）も同様にこのプロセスに関与するようになっています。今日、都市部でのAVの開発は、歩行者などの脆弱な道路利用者（VRU）にとって本質的な安全上の懸念を引き起こしています。したがって、道路をより安全にするためには、歩行者の将来の行動を分類して予測することが重要です。この論文では、開始された軌道のダイナミクスに基づいて歩行者の道路横断の意思決定を予測できるTransformerモデルの複数のバリエーションに基づくフレームワークを提示します。入力機能としてバウンディングボックスのみを使用すると、PIEデータセットで91％の予測精度と0.83のF1スコアに到達することで、以前の最先端の結果を上回ることができることを示しました。さらに、行動予測にCARLAを使用した大規模シミュレーションデータセット（CP2A）を導入しました。私たちのモデルは、このデータセットで同様に高精度（91％）とF1スコア（0.91）に達しました。興味深いことに、CP2AデータセットでTransformerモデルを事前トレーニングしてから、PIEデータセットで微調整することがアクション予測タスクに有益であることを示しました。最後に、私たちのモデルの結果は、環境コンテキストを必要とせずに歩行者の行動を予測する人間の能力をテストするために作成した「バウンディングボックスへの人間の注意」実験によって正常にサポートされています。データセットとモデルのコードは、https：//github.com/linaashaji/Action_Anticipationで入手できます。

The human driver is no longer the only one concerned with the complexity of the driving scenarios. Autonomous vehicles (AV) are similarly becoming involved in the process. Nowadays, the development of AVs in urban places raises essential safety concerns for vulnerable road users (VRUs) such as pedestrians. Therefore, to make the roads safer, it is critical to classify and predict the pedestrians' future behavior. In this paper, we present a framework based on multiple variations of the Transformer models able to infer predict the pedestrian street-crossing decision-making based on the dynamics of its initiated trajectory. We showed that using solely bounding boxes as input features can outperform the previous state-of-the-art results by reaching a prediction accuracy of 91% and an F1-score of 0.83 on the PIE dataset. In addition, we introduced a large-size simulated dataset (CP2A) using CARLA for action prediction. Our model has similarly reached high accuracy (91%) and F1-score (0.91) on this dataset. Interestingly, we showed that pre-training our Transformer model on the CP2A dataset and then fine-tuning it on the PIE dataset is beneficial for the action prediction task. Finally, our model's results are successfully supported by the "human attention to bounding boxes" experiment which we created to test humans ability for pedestrian action prediction without the need for environmental context. The code for the dataset and the models is available at: https://github.com/linaashaji/Action_Anticipation

updated: Mon Apr 11 2022 09:54:57 GMT+0000 (UTC)

published: Fri Jul 16 2021 17:47:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト