Flexible Compositional Learning of Structured Visual Concepts

Yanli Zhou; Brenden M. Lake

構造化された視覚的概念の柔軟な構成学習

人間は非常に効率的な学習者であり、ほんの数例から新しい概念の意味を理解する能力があります。一般的なコンピュータビジョンシステムとは異なり、人間は視覚世界の構成構造を柔軟に活用して、新しい概念を既存の概念の組み合わせとして理解することができます。現在の論文では、豊かな関係構造を持つ抽象的な視覚形式を使用して、人々がさまざまなタイプの視覚的構成をどのように学習するかを研究しています。さまざまなシナリオのほんの数例から、人々が意味のある構成の一般化を行うことができることがわかり、行動データに密接に適合するベイズプログラム帰納モデルを開発します。構成性の特殊なケースを調べる過去の作業とは異なり、私たちの作業は、単一の計算アプローチが多くの異なるタイプの構成一般化をどのように説明できるかを示しています。

Humans are highly efficient learners, with the ability to grasp the meaning of a new concept from just a few examples. Unlike popular computer vision systems, humans can flexibly leverage the compositional structure of the visual world, understanding new concepts as combinations of existing concepts. In the current paper, we study how people learn different types of visual compositions, using abstract visual forms with rich relational structure. We find that people can make meaningful compositional generalizations from just a few examples in a variety of scenarios, and we develop a Bayesian program induction model that provides a close fit to the behavioral data. Unlike past work examining special cases of compositionality, our work shows how a single computational approach can account for many distinct types of compositional generalization.

updated: Thu May 20 2021 15:48:05 GMT+0000 (UTC)

published: Thu May 20 2021 15:48:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト