PIDray: A Large-scale X-ray Benchmark for Real-World Prohibited Item Detection

Libo Zhang; Lutao Jiang; Ruyi Ji; Heng Fan

PIDray: 実世界の禁止アイテム検出のための大規模 X 線ベンチマーク

コンピュータービジョンテクノロジに依存する自動セキュリティ検査は、クラス内分散、クラスの不均衡、閉塞などの多くの要因により、実際のシナリオでは困難なタスクです。これまでのほとんどの方法は、大規模なデータセットが不足しているため、禁止されたアイテムが乱雑なオブジェクトに意図的に隠されている場合にほとんど触れず、その適用を妨げていました。この問題に対処し、関連する研究を促進するために、PIDray という名前の大規模なデータセットを提示します。これは、禁止されたアイテムの検出、特に意図的に隠されたアイテムの実際のシナリオでのさまざまなケースをカバーしています。具体的には、PIDray は 12 カテゴリの禁止品目の 124,486 枚の X 線画像を収集し、各画像には慎重な検査によって手動で注釈が付けられます。これにより、これまでで最大の禁止品目検出データセットになります。一方、PIDray のベースラインアルゴリズムを開発するための一般的な分割統治パイプラインを提案します。具体的には、ツリーのような構造を採用して、PIDray データセットのロングテールの問題の影響を抑制します。最初のコースグレインノードには、ヘッドカテゴリの影響を軽減するためのバイナリ分類が割り当てられます。グレインノードは、テールカテゴリの特定のタスク専用です。このシンプルでありながら効果的なスキームに基づいて、オブジェクト検出、インスタンスセグメンテーション、マルチラベル分類タスク全体で強力なタスク固有のベースラインを提供し、一般的なデータセット (COCO や PASCAL VOC など) で一般化機能を検証します。 PIDray に関する広範な実験は、提案された方法が現在の最先端の方法に対して、特に意図的に隠されたアイテムに対して有利に機能することを示しています。ベンチマークとコードは、https://github.com/lutao2021/PIDray でリリースされます。

Automatic security inspection relying on computer vision technology is a challenging task in real-world scenarios due to many factors, such as intra-class variance, class imbalance, and occlusion. Most previous methods rarely touch the cases where the prohibited items are deliberately hidden in messy objects because of the scarcity of large-scale datasets, hindering their applications. To address this issue and facilitate related research, we present a large-scale dataset, named PIDray, which covers various cases in real-world scenarios for prohibited item detection, especially for deliberately hidden items. In specific, PIDray collects 124,486 X-ray images for 12 categories of prohibited items, and each image is manually annotated with careful inspection, which makes it, to our best knowledge, to largest prohibited items detection dataset to date. Meanwhile, we propose a general divide-and-conquer pipeline to develop baseline algorithms on PIDray. Specifically, we adopt the tree-like structure to suppress the influence of the long-tailed issue in the PIDray dataset, where the first course-grained node is tasked with the binary classification to alleviate the influence of head category, while the subsequent fine-grained node is dedicated to the specific tasks of the tail categories. Based on this simple yet effective scheme, we offer strong task-specific baselines across object detection, instance segmentation, and multi-label classification tasks and verify the generalization ability on common datasets (e.g., COCO and PASCAL VOC). Extensive experiments on PIDray demonstrate that the proposed method performs favorably against current state-of-the-art methods, especially for deliberately hidden items. Our benchmark and codes will be released at https://github.com/lutao2021/PIDray.

updated: Sat Nov 19 2022 18:31:34 GMT+0000 (UTC)

published: Sat Nov 19 2022 18:31:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト