Evaluating the Transferability and Adversarial Discrimination of Convolutional Neural Networks for Threat Object Detection and Classification within X-Ray Security Imagery

Yona Falinie A. Gaus; Neelanjan Bhowmik; Samet Akcay; Toby P. Breckon

X線セキュリティ画像内の脅威オブジェクトの検出と分類のための畳み込みニューラルネットワークの転送可能性と敵対的識別の評価

X線画像セキュリティスクリーニングは、脅威または禁止されたアイテムのさまざまなプロファイルに対する輸送セキュリティを維持するために不可欠です。特に興味深いのは、複雑で散らかったX線セキュリティ画像内の銃器やナイフなどの武器の自動検出と分類です。ここでは、さまざまなエンドツーエンドのオブジェクト検出畳み込みニューラルネットワーク（CNN）アーキテクチャを調査することにより、この問題に対処します。 Faster R-CNN、Mask R-CNN、およびRetinaNetアーキテクチャにまたがるいくつかの主要なバリアントを評価して、さまざまなイメージングジオメトリ、画像解像度、マテリアルカラープロファイルを備えたさまざまなX線スキャナー間でのこのようなモデルの転送可能性を調査します。 X線脅威画像の限られた可用性は課題をもたらす可能性がありますが、このようなスキャナー間の一般化が複数のクラスの検出問題にわたって存在するかどうかを評価する転送学習アプローチを採用しています。全体として、ResNet_101分類ネットワークを備えたFaster R-CNNアーキテクチャを使用して最大の検出パフォーマンスを達成し、さまざまなX線イメージングソースから3クラスおよび2クラスのアイテムの平均平均精度（mAP）を0.88および0.86取得します。私たちの結果は、クロススキャナーのパフォーマンスの点で驚くほどの汎用性を示しています（mAP：0.87、銃器検出：0.94 AP）。さらに、銃器検出用に特別に生成された敵対的データセットを使用して、このようなネットワークに固有の敵対的識別能力を調べます-5％という低い低偽陽性で、これはX内でのそのような脅威検出の課題と見込みの両方を示しています光線セキュリティ画像。

X-ray imagery security screening is essential to maintaining transport security against a varying profile of threat or prohibited items. Particular interest lies in the automatic detection and classification of weapons such as firearms and knives within complex and cluttered X-ray security imagery. Here, we address this problem by exploring various end-to-end object detection Convolutional Neural Network (CNN) architectures. We evaluate several leading variants spanning the Faster R-CNN, Mask R-CNN, and RetinaNet architectures to explore the transferability of such models between varying X-ray scanners with differing imaging geometries, image resolutions and material colour profiles. Whilst the limited availability of X-ray threat imagery can pose a challenge, we employ a transfer learning approach to evaluate whether such inter-scanner generalisation may exist over a multiple class detection problem. Overall, we achieve maximal detection performance using a Faster R-CNN architecture with a ResNet_101 classification network, obtaining 0.88 and 0.86 of mean Average Precision (mAP) for a three-class and two class item from varying X-ray imaging sources. Our results exhibit a remarkable degree of generalisability in terms of cross-scanner performance (mAP: 0.87, firearm detection: 0.94 AP). In addition, we examine the inherent adversarial discriminative capability of such networks using a specifically generated adversarial dataset for firearms detection - with a variable low false positive, as low as 5%, this shows both the challenge and promise of such threat detection within X-ray security imagery.

updated: Wed Nov 20 2019 15:29:12 GMT+0000 (UTC)

published: Wed Nov 20 2019 15:29:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト