Rethinking and Designing a High-performing Automatic License Plate Recognition Approach

Yi Wang; Zhen-Peng Bian; Yunhao Zhou; Lap-Pui Chau

高性能自動ナンバープレート認識アプローチの再考と設計

この論文では、リアルタイムで正確な自動ナンバープレート認識（ALPR）アプローチを提案します。私たちの研究は、4つの洞察でALPRの優れた設計を示しています。（1）リサンプリングベースのカスケードフレームワークは、速度と精度の両方に有益です。（2）非常に効率的なナンバープレート認識は、追加の文字セグメンテーションとリカレントニューラルネットワーク（RNN）を豊富に持つ必要がありますが、単純な畳み込みニューラルネットワーク（CNN）を採用します。（3）CNNの場合、ナンバープレート上の頂点情報を利用すると、認識パフォーマンスが向上します。（4）重み共有文字分類器は、小規模データセットのトレーニング画像の不足に対処します。これらの洞察に基づいて、VSNetと呼ばれる新しいALPRアプローチを提案します。具体的には、VSNetには2つのCNNが含まれています。つまり、ナンバープレート検出用のVertexNetとナンバープレート認識用のSCR-Netであり、リサンプリングベースのカスケード方式で統合されています。 VertexNetでは、ナンバープレートの空間的特徴を抽出するための効率的な統合ブロックを提案します。頂点監視情報を使用して、ナンバープレートをSCR-Netの入力画像として修正できるように、VertexNetで頂点推定ブランチを提案します。 SCR-Netでは、左から右への特徴抽出のための水平符号化手法を導入し、文字認識のための重み共有分類器を提案します。実験結果は、提案されたVSNetが、エラー率の相対的な改善を50％以上上回り、149 FPSの推論速度でCCPDおよびAOLPデータセットで99％を超える認識精度を達成することを示しています。さらに、私たちの方法は、目に見えないPKUDataおよびCLPDデータセットで評価した場合の優れた一般化機能を示しています。

In this paper, we propose a real-time and accurate automatic license plate recognition (ALPR) approach. Our study illustrates the outstanding design of ALPR with four insights: (1) the resampling-based cascaded framework is beneficial to both speed and accuracy; (2) the highly efficient license plate recognition should abundant additional character segmentation and recurrent neural network (RNN), but adopt a plain convolutional neural network (CNN); (3) in the case of CNN, taking advantage of vertex information on license plates improves the recognition performance; and (4) the weight-sharing character classifier addresses the lack of training images in small-scale datasets. Based on these insights, we propose a novel ALPR approach, termed VSNet. Specifically, VSNet includes two CNNs, i.e., VertexNet for license plate detection and SCR-Net for license plate recognition, integrated in a resampling-based cascaded manner. In VertexNet, we propose an efficient integration block to extract the spatial features of license plates. With vertex supervisory information, we propose a vertex-estimation branch in VertexNet such that license plates can be rectified as the input images of SCR-Net. In SCR-Net, we introduce a horizontal encoding technique for left-to-right feature extraction and propose a weight-sharing classifier for character recognition. Experimental results show that the proposed VSNet outperforms state-of-the-art methods by more than 50% relative improvement on error rate, achieving > 99% recognition accuracy on CCPD and AOLP datasets with 149 FPS inference speed. Moreover, our method illustrates an outstanding generalization capability when evaluated on the unseen PKUData and CLPD datasets.

updated: Thu Jun 17 2021 07:05:00 GMT+0000 (UTC)

published: Mon Nov 30 2020 16:03:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト