Vehicle-Rear: A New Dataset to Explore Feature Fusion for Vehicle Identification Using Convolutional Neural Networks

Icaro O. de Oliveira; Rayson Laroca; David Menotti; Keiko V. O. Fonseca; Rodrigo Minetto

Vehicle-Rear：畳み込みニューラルネットワークを使用した車両識別のための機能融合を調査するための新しいデータセット

この作業は、重なり合わないカメラによる車両識別の問題に対処します。主な貢献として、車両識別用の新しいデータセットであるVehicle-Rearを紹介します。このデータセットには、3時間以上の高解像度ビデオが含まれており、約3,000台の車両のメーカー、モデル、色、年式に関する正確な情報が含まれています。ナンバープレートの位置と識別に。データセットを探索するために、利用可能な最も特徴的で永続的な2つの機能である車両の外観とナンバープレートを同時に使用する2ストリームCNNを設計します。これは、主要な問題に取り組む試みです。類似したデザインの車両または非常に近いナンバープレート識別子によって引き起こされる誤警報です。最初のネットワークストリームでは、形状の類似性は、2つの異なるカメラによって記録された低解像度の車両パッチのペアを使用するシャムCNNによって識別されます。 2番目のストリームでは、OCRのCNNを使用して、高解像度のナンバープレートパッチのペアからテキスト情報、信頼スコア、および文字列の類似性を抽出します。次に、両方のストリームの機能が、完全に接続された一連のレイヤーによってマージされ、決定されます。私たちの実験では、2ストリームネットワークを、単一または複数の車両機能を使用するいくつかのよく知られたCNNアーキテクチャと比較しました。アーキテクチャ、トレーニング済みモデル、およびデータセットは、https：//github.com/icarofua/vehicle-rearで公開されています。

This work addresses the problem of vehicle identification through non-overlapping cameras. As our main contribution, we introduce a novel dataset for vehicle identification, called Vehicle-Rear, that contains more than three hours of high-resolution videos, with accurate information about the make, model, color and year of nearly 3,000 vehicles, in addition to the position and identification of their license plates. To explore our dataset we design a two-stream CNN that simultaneously uses two of the most distinctive and persistent features available: the vehicle's appearance and its license plate. This is an attempt to tackle a major problem: false alarms caused by vehicles with similar designs or by very close license plate identifiers. In the first network stream, shape similarities are identified by a Siamese CNN that uses a pair of low-resolution vehicle patches recorded by two different cameras. In the second stream, we use a CNN for OCR to extract textual information, confidence scores, and string similarities from a pair of high-resolution license plate patches. Then, features from both streams are merged by a sequence of fully connected layers for decision. In our experiments, we compared the two-stream network against several well-known CNN architectures using single or multiple vehicle features. The architectures, trained models, and dataset are publicly available at https://github.com/icarofua/vehicle-rear.

updated: Sun Jul 25 2021 11:39:59 GMT+0000 (UTC)

published: Wed Nov 13 2019 15:23:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト