Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark

Boying Wang; Libo Zhang; Longyin Wen; Xianglong Liu; Yanjun Wu

現実世界で禁止されているアイテムの検出に向けて：大規模なX線ベンチマーク

コンピュータビジョンテクノロジを使用した自動セキュリティ検査は、クラス内の差異、クラスの不均衡、オクルージョンなどのさまざまな要因により、実際のシナリオでは困難な作業です。以前の方法のほとんどは、大規模なデータセットがないために禁止されたアイテムが乱雑なオブジェクトに意図的に隠され、実際のシナリオでのアプリケーションが制限されているケースを解決することはめったにありません。現実世界の禁止アイテムの検出に向けて、PIDrayという名前の大規模なデータセットを収集します。これは、禁止アイテムの検出、特に意図的に隠されたアイテムの現実のシナリオにおけるさまざまなケースをカバーしています。多大な労力を費やして、私たちのデータセットには、高品質の注釈付きセグメンテーションマスクとバウンディングボックスを備えた47,677個のX線画像に12のカテゴリの禁止アイテムが含まれています。私たちの知る限り、これはこれまでで最大の禁止アイテム検出データセットです。一方、選択的高密度注意ネットワーク（SDANet）を設計して、高密度注意モジュールと依存関係改良モジュールで構成される強力なベースラインを構築します。空間的およびチャネルごとの密な注意によって形成された密な注意モジュールは、パフォーマンスを向上させるための識別機能を学習するように設計されています。依存関係の絞り込みモジュールは、マルチスケール機能の依存関係を活用するために使用されます。収集されたPIDrayデータセットに対して実施された広範な実験は、提案された方法が、特に意図的に隠されたアイテムを検出するために、最先端の方法に対して有利に機能することを示しています。

Automatic security inspection using computer vision technology is a challenging task in real-world scenarios due to various factors, including intra-class variance, class imbalance, and occlusion. Most of the previous methods rarely solve the cases that the prohibited items are deliberately hidden in messy objects due to the lack of large-scale datasets, restricted their applications in real-world scenarios. Towards real-world prohibited item detection, we collect a large-scale dataset, named as PIDray, which covers various cases in real-world scenarios for prohibited item detection, especially for deliberately hidden items. With an intensive amount of effort, our dataset contains 12 categories of prohibited items in 47,677 X-ray images with high-quality annotated segmentation masks and bounding boxes. To the best of our knowledge, it is the largest prohibited items detection dataset to date. Meanwhile, we design the selective dense attention network (SDANet) to construct a strong baseline, which consists of the dense attention module and the dependency refinement module. The dense attention module formed by the spatial and channel-wise dense attentions, is designed to learn the discriminative features to boost the performance. The dependency refinement module is used to exploit the dependencies of multi-scale features. Extensive experiments conducted on the collected PIDray dataset demonstrate that the proposed method performs favorably against the state-of-the-art methods, especially for detecting the deliberately hidden items.

updated: Mon Aug 16 2021 11:14:16 GMT+0000 (UTC)

published: Mon Aug 16 2021 11:14:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト