Reducing Overconfidence Predictions for Autonomous Driving Perception

Gledson Melotti; Cristiano Premebida; Jordan J. Bird; Diego R. Faria; Nuno Gonçalves

自動運転知覚に対する自信過剰予測の削減

オブジェクト認識のための最先端の深層学習では、SoftMax関数とSigmoid関数が最も一般的に予測子出力として使用されます。このようなレイヤーは、適切な確率スコアではなく、自信過剰な予測を生成することが多く、自動運転やロボット工学に適用される「重要な」知覚システムの意思決定に悪影響を与える可能性があります。これを考慮して、この作業の実験は、事前にトレーニングされたネットワークのロジット層スコアから計算された分布に基づく確率論的アプローチを提案します。最尤法（ML）および最大事後確率（MAP）関数は、オブジェクト認識のSoftMaxおよびSigmoidベースの予測よりも確率的解釈に適していることを示します。 KITTIおよびLyftレベル5データセットからのRGB画像およびLiDAR（RV：範囲ビュー）データを介して個別のセンサーモダリティを調査します。このアプローチでは、通常のSoftMaxおよびSigmoidレイヤーと比較して有望なパフォーマンスが示され、解釈可能な確率が可能になります。予測。このホワイトペーパーで紹介したアプローチのもう1つの利点は、MLおよびMAP機能を既存のトレーニング済みネットワークに実装できることです。つまり、このアプローチは、事前トレーニング済みネットワークのロジットレイヤーの出力から恩恵を受けます。したがって、MLおよびMAP関数はテスト/予測フェーズで使用されるため、新しいトレーニングフェーズを実行する必要はありません。

In state-of-the-art deep learning for object recognition, SoftMax and Sigmoid functions are most commonly employed as the predictor outputs. Such layers often produce overconfident predictions rather than proper probabilistic scores, which can thus harm the decision-making of `critical' perception systems applied in autonomous driving and robotics. Given this, the experiments in this work propose a probabilistic approach based on distributions calculated out of the Logit layer scores of pre-trained networks. We demonstrate that Maximum Likelihood (ML) and Maximum a-Posteriori (MAP) functions are more suitable for probabilistic interpretations than SoftMax and Sigmoid-based predictions for object recognition. We explore distinct sensor modalities via RGB images and LiDARs (RV: range-view) data from the KITTI and Lyft Level-5 datasets, where our approach shows promising performance compared to the usual SoftMax and Sigmoid layers, with the benefit of enabling interpretable probabilistic predictions. Another advantage of the approach introduced in this paper is that the ML and MAP functions can be implemented in existing trained networks, that is, the approach benefits from the output of the Logit layer of pre-trained networks. Thus, there is no need to carry out a new training phase since the ML and MAP functions are used in the test/prediction phase.

updated: Thu May 12 2022 03:32:33 GMT+0000 (UTC)

published: Wed Feb 16 2022 01:59:55 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト