Large-Scale Open-Set Classification Protocols for ImageNet

Jesus Andres Palechor Anacona; Annesha Bhoumik; Manuel Günther

ImageNet 用の大規模オープンセット分類プロトコル

Open-Set Classification (OSC) は、クローズドセット分類モデルを実世界のシナリオに適応させることを目的としています。この場合、分類器は既知のクラスのサンプルに正しくラベルを付ける必要がありますが、これまでに見られなかった未知のサンプルを拒否します。ごく最近になって、これらの未知のサンプルを正しく処理できるアルゴリズムの調査が開始されました。これらのアプローチの一部は、分類器が拒否することを学習したネガティブサンプルをトレーニングセットに含めることで OSC に対処します。これらのデータが、未知のクラスに対する分類器の堅牢性を高めることを期待しています。これらのアプローチのほとんどは、MNIST、SVHN、または CIFAR などの小規模で低解像度の画像データセットで評価されるため、現実世界への適用可能性を評価し、それらを相互に比較することは困難です。既知のクラスと未知のクラスの間で異なるレベルの類似性を持つ自然画像の豊富なデータセットを提供する 3 つのオープンセットプロトコルを提案します。プロトコルは、実際のシナリオに近いトレーニングおよびテストデータを提供するために選択された ImageNet クラスのサブセットで構成されています。さらに、深層学習モデルのトレーニングが既知のサンプルの分類と未知のサンプルの拒否の両方に対応しているかどうかを評価するために使用できる新しい検証メトリックを提案します。プロトコルを使用して、2 つのベースラインオープンセットアルゴリズムのパフォーマンスを標準の SoftMax ベースラインと比較し、アルゴリズムがトレーニング中に見られたネガティブサンプルと部分的に分布外検出タスクでうまく機能することを発見しましたが、ドロップします。以前に見られなかった未知のクラスからのサンプルの存在下でのパフォーマンス。

Open-Set Classification (OSC) intends to adapt closed-set classification models to real-world scenarios, where the classifier must correctly label samples of known classes while rejecting previously unseen unknown samples. Only recently, research started to investigate on algorithms that are able to handle these unknown samples correctly. Some of these approaches address OSC by including into the training set negative samples that a classifier learns to reject, expecting that these data increase the robustness of the classifier on unknown classes. Most of these approaches are evaluated on small-scale and low-resolution image datasets like MNIST, SVHN or CIFAR, which makes it difficult to assess their applicability to the real world, and to compare them among each other. We propose three open-set protocols that provide rich datasets of natural images with different levels of similarity between known and unknown classes. The protocols consist of subsets of ImageNet classes selected to provide training and testing data closer to real-world scenarios. Additionally, we propose a new validation metric that can be employed to assess whether the training of deep learning models addresses both the classification of known samples and the rejection of unknown samples. We use the protocols to compare the performance of two baseline open-set algorithms to the standard SoftMax baseline and find that the algorithms work well on negative samples that have been seen during training, and partially on out-of-distribution detection tasks, but drop performance in the presence of samples from previously unseen unknown classes.

updated: Thu Oct 13 2022 07:01:34 GMT+0000 (UTC)

published: Thu Oct 13 2022 07:01:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト