Measuring Human Perception to Improve Open Set Recognition

Jin Huang; Student Member; Derek Prijatelj; Justin Dulay; Walter Scheirer

人間の知覚を測定してオープンセットの認識を改善する

オブジェクトが既知であるか新規であるかを認識する人間の能力は、現在、すべてのオープンセット認識アルゴリズムよりも優れています。心理学からの視覚心理物理学の方法と手順によって測定される人間の知覚は、コンピュータービジョンの視覚認識タスクの新規性を管理するための追加のデータストリームを提供できます。たとえば、人間の被験者から測定された反応時間は、既知のクラスのサンプルが新しいクラスのサンプルと混同される可能性があるかどうかについての洞察を提供できます。この作業では、オブジェクト認識に関連する 200,000 を超える人間の反応時間測定値を収集する大規模な行動実験を設計および実行しました。収集されたデータは、サンプルレベルでオブジェクト間で有意に異なる反応時間を示しました。したがって、さまざまな画像に対して可変の反応時間を示す深いネットワークでの人間の行動との一貫性を強制する、新しい心理物理学的損失関数を設計しました。生物学的視覚の場合と同様に、このアプローチにより、ラベル付けされたトレーニングデータが限られているレジームで優れたオープンセット認識パフォーマンスを達成できます。 ImageNet のデータを使用した実験を通じて、この新しい定式化でマルチスケール DenseNet をトレーニングすると、大幅な改善が観察されました。損失関数でトレーニングされたモデルは、トップ 1 の検証精度を 7%、既知のサンプルでのトップ 1 のテスト精度を 18% 大幅に改善しました。、未知のサンプルでのトップ 1 のテスト精度は 33% です。私たちの方法を文献からの 10 のオープンセット認識方法と比較しましたが、これらはすべて複数の指標で優れていました。

The human ability to recognize when an object is known or novel currently outperforms all open set recognition algorithms. Human perception as measured by the methods and procedures of visual psychophysics from psychology can provide an additional data stream for managing novelty in visual recognition tasks in computer vision. For instance, measured reaction time from human subjects can offer insight as to whether a known class sample may be confused with a novel one. In this work, we designed and performed a large-scale behavioral experiment that collected over 200,000 human reaction time measurements associated with object recognition. The data collected indicated reaction time varies meaningfully across objects at the sample level. We therefore designed a new psychophysical loss function that enforces consistency with human behavior in deep networks which exhibit variable reaction time for different images. As in biological vision, this approach allows us to achieve good open set recognition performance in regimes with limited labeled training data. Through experiments using data from ImageNet, significant improvement is observed when training Multi-Scale DenseNets with this new formulation: models trained with our loss function significantly improved top-1 validation accuracy by 7%, top-1 test accuracy on known samples by 18%, and top-1 test accuracy on unknown samples by 33%. We compared our method to 10 open set recognition methods from the literature, which were all outperformed on multiple metrics.

updated: Thu Sep 08 2022 01:19:36 GMT+0000 (UTC)

published: Thu Sep 08 2022 01:19:36 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト