A HINT from Arithmetic: On Systematic Generalization of Perception, Syntax, and Semantics

Qing Li; Siyuan Huang; Yining Hong; Yixin Zhu; Ying Nian Wu; Song-Chun Zhu

算術からのヒント：知覚、構文、および意味論の体系的な一般化について

算数を習得し、目に見えない問題に一般化する人間の驚くべき能力に触発されて、知覚、構文、および意味論の3つの異なるレベルで一般化可能な概念を学習するマシンの能力を研究するための新しいデータセットHINTを提示します。特に、数字と演算子の両方を含むHINTの概念は、弱く監視された方法で学習する必要があります。手書き式の最終結果のみが監視として提供されます。学習エージェントは、画像などの生の信号から概念がどのように認識されるか（つまり、知覚）、複数の概念が構造的に組み合わされて有効な式（つまり、構文）を形成する方法、および概念が実現されてさまざまな推論タスクを実行する方法（つまり、、セマンティクス）。体系的な一般化に焦点を当てて、学習した概念の内挿と外挿の両方を評価するために、5つのテストセットを慎重に設計します。この困難な問題に取り組むために、ニューラルネットワークを文法解析およびプログラム合成と統合することによってニューラルシンボリックシステムを提案します。これは、新しい演繹-拉致戦略によって学習されます。実験では、提案されたニューラルシンボリックシステムは強力な一般化機能を示し、RNNやTransformerなどのエンドツーエンドのニューラルメソッドを大幅に上回っています。結果は、構文とセマンティクスの外挿に対する再帰的事前確率の重要性も示しています。

Inspired by humans' remarkable ability to master arithmetic and generalize to unseen problems, we present a new dataset, HINT, to study machines' capability of learning generalizable concepts at three different levels: perception, syntax, and semantics. In particular, concepts in HINT, including both digits and operators, are required to learn in a weakly-supervised fashion: Only the final results of handwriting expressions are provided as supervision. Learning agents need to reckon how concepts are perceived from raw signals such as images (i.e., perception), how multiple concepts are structurally combined to form a valid expression (i.e., syntax), and how concepts are realized to afford various reasoning tasks (i.e., semantics). With a focus on systematic generalization, we carefully design a five-fold test set to evaluate both the interpolation and the extrapolation of learned concepts. To tackle this challenging problem, we propose a neural-symbolic system by integrating neural networks with grammar parsing and program synthesis, learned by a novel deduction--abduction strategy. In experiments, the proposed neural-symbolic system demonstrates strong generalization capability and significantly outperforms end-to-end neural methods like RNN and Transformer. The results also indicate the significance of recursive priors for extrapolation on syntax and semantics.

updated: Tue Mar 02 2021 01:32:54 GMT+0000 (UTC)

published: Tue Mar 02 2021 01:32:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト