Evaluating Visual Number Discrimination in Deep Neural Networks

Ivana Kajić; Aida Nematzadeh

深層ニューラルネットワークにおける視覚的数弁別の評価

大量と少量を区別する能力は、人間と動物の両方における基本的な数的能力の中核的な側面です。この作業では、視覚用に設計された最先端のニューラルネットワークがこの基本的な能力をどの程度発揮するかを調べます。動物と幼児の数値認知の研究に動機付けられて、数値二分法を使用して、神経アーキテクチャのさまざまなファミリーでの数値弁別をテストします。私たちの結果は、視覚固有の誘導バイアスが数の識別に役立つことを示唆しています。そのようなバイアスを持つモデルは、タスクのテストエラーが最も少なく、多くの場合、タスクを実行する人間や動物のものに定性的に似た心理測定曲線を持っているからです。ただし、パフォーマンスの標準メトリクスで測定された最も強力なモデルでさえ、さまざまなトレーニングおよびテスト条件を使用した転送実験で量を区別できず、そのような誘導バイアスが十分ではない可能性があることを示しています。

The ability to discriminate between large and small quantities is a core aspect of basic numerical competence in both humans and animals. In this work, we examine the extent to which the state-of-the-art neural networks designed for vision exhibit this basic ability. Motivated by studies in animal and infant numerical cognition, we use the numerical bisection procedure to test number discrimination in different families of neural architectures. Our results suggest that vision-specific inductive biases are helpful in numerosity discrimination, as models with such biases have lowest test errors on the task, and often have psychometric curves that qualitatively resemble those of humans and animals performing the task. However, even the strongest models, as measured on standard metrics of performance, fail to discriminate quantities in transfer experiments with differing training and testing conditions, indicating that such inductive biases might not be sufficient.

updated: Mon Mar 13 2023 15:14:26 GMT+0000 (UTC)

published: Mon Mar 13 2023 15:14:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト