Modeling Gestalt Visual Reasoning on the Raven's Progressive Matrices Intelligence Test Using Generative Image Inpainting Techniques

Tianyu Hua; Maithilee Kunda

ジェネレイティブイメージインペインティングテクニックを使用したレイヴンのプログレッシブマトリックスインテリジェンステストでのゲシュタルト視覚推論のモデリング

心理学者は、レイヴンのプログレッシブ行列が一般的な人間の知能の非常に効果的なテストであると認識しています。 AIコミュニティでは、テストに関するさまざまなトップダウンの審議的推論を調査するために多くの計算モデルが開発されていますが、人間のテストでも重要な、ゲシュタルト画像の完成などのボトムアップの知覚プロセスに関する研究はあまりありませんパフォーマンス。この作業では、コンピュータービジョンの生成画像修復手法を使用して、Ravenのテストでのゲシュタルトの視覚的推論をどのようにモデル化できるかを調査します。私たちは、オブジェクトのフォトリアリスティックな画像のみで訓練された自己教師付き修復モデルが、9歳の子供の平均パフォーマンスに対応する色付きプログレッシブマトリックスで27/36のスコアを達成することを示します。また、他のデータセット（顔、場所、テクスチャ）でトレーニングされたモデルも同様に機能しないことを示します。私たちの結果は、実世界の画像で視覚的な規則性を学習することが、人工的なテスト刺激についての成功した推論にどのように変換できるかを示しています。反対に、我々の結果はそのような転送の限界も強調しており、それはレイヴンのような知能テストがしばしば個人の社会文化的背景に敏感である理由を説明するかもしれません。

Psychologists recognize Raven's Progressive Matrices as a very effective test of general human intelligence. While many computational models have been developed by the AI community to investigate different forms of top-down, deliberative reasoning on the test, there has been less research on bottom-up perceptual processes, like Gestalt image completion, that are also critical in human test performance. In this work, we investigate how Gestalt visual reasoning on the Raven's test can be modeled using generative image inpainting techniques from computer vision. We demonstrate that a self-supervised inpainting model trained only on photorealistic images of objects achieves a score of 27/36 on the Colored Progressive Matrices, which corresponds to average performance for nine-year-old children. We also show that models trained on other datasets (faces, places, and textures) do not perform as well. Our results illustrate how learning visual regularities in real-world images can translate into successful reasoning about artificial test stimuli. On the flip side, our results also highlight the limitations of such transfer, which may explain why intelligence tests like the Raven's are often sensitive to people's individual sociocultural backgrounds.

updated: Tue Nov 26 2019 08:32:20 GMT+0000 (UTC)

published: Mon Nov 18 2019 16:16:55 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト