A Follow-the-Leader Strategy using Hierarchical Deep Neural Networks with Grouped Convolutions

Jose Solomon; Francois Charette

グループ化された畳み込みを伴う階層型ディープニューラルネットワークを使用したフォローザリーダー戦略

リーダーに従うタスクは、階層型ディープニューラルネットワーク（DNN）のエンドツーエンドの運転モデルを使用して実装され、対象の歩行者の方向と速度に一致します。モデルは分類子DNNを使用して、歩行者がカメラセンサーの視野内にいるかどうかを判断します。歩行者がいる場合、カメラからの画像ストリームは回帰DNNに送られ、自動運転車のステアリングとスロットルを同時に調整して、歩行者とのケイデンスを維持します。歩行者が見えない場合、車両は簡単な探索的検索戦略を使用して追跡目的を再取得します。分類器と回帰DNNは、グループ化された畳み込みを組み込んで、モデルのパフォーマンスを向上させるだけでなく、パラメーター数と計算の待ち時間を大幅に削減します。モデルは、インテリジェンスプロセッシングユニット（IPU）でトレーニングされ、トレーニングまでの時間を最小限に抑えるために、そのきめ細かい計算機能を活用します。この結果は、自動運転車のステアリングとスロットルのプロファイルに関して非常に堅牢な追跡動作を示していますが、生成するデータ収集は最小限で済みます。トレーニングサンプルの処理に関するスループットは、IPUをグループ化された畳み込みと組み合わせて使用することにより、分類器のトレーニングの場合は約3.5倍、回帰ネットワークの場合は約7倍に向上しました。歩行者を追跡する車両の記録が作成され、Webで入手できます。これは、SN ComputerScienceに掲載された記事のプレプリントです。最終的な認証済みバージョンは、https：//doi.org/https：//doi.org/10.1007/s42979-021-00572-1からオンラインで入手できます。

The task of following-the-leader is implemented using a hierarchical Deep Neural Network (DNN) end-to-end driving model to match the direction and speed of a target pedestrian. The model uses a classifier DNN to determine if the pedestrian is within the field of view of the camera sensor. If the pedestrian is present, the image stream from the camera is fed to a regression DNN which simultaneously adjusts the autonomous vehicle's steering and throttle to keep cadence with the pedestrian. If the pedestrian is not visible, the vehicle uses a straightforward exploratory search strategy to reacquire the tracking objective. The classifier and regression DNNs incorporate grouped convolutions to boost model performance as well as to significantly reduce parameter count and compute latency. The models are trained on the Intelligence Processing Unit (IPU) to leverage its fine-grain compute capabilities in order to minimize time-to-train. The results indicate very robust tracking behavior on the part of the autonomous vehicle in terms of its steering and throttle profiles, while requiring minimal data collection to produce. The throughput in terms of processing training samples has been boosted by the use of the IPU in conjunction with grouped convolutions by a factor ~3.5 for training of the classifier and a factor of ~7 for the regression network. A recording of the vehicle tracking a pedestrian has been produced and is available on the web. This is a preprint of an article published in SN Computer Science. The final authenticated version is available online at: https://doi.org/https://doi.org/10.1007/s42979-021- 00572-1.

updated: Sat Apr 17 2021 17:21:00 GMT+0000 (UTC)

published: Wed Nov 04 2020 16:04:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト