How Modular Should Neural Module Networks Be for Systematic Generalization?

Vanessa D'Amario; Tomotake Sasaki; Xavier Boix

体系的な一般化のためにニューラルモジュールネットワークはどのようにモジュール化されるべきですか？

ニューラルモジュールネットワーク（NMN）は、サブタスクに取り組むモジュールの構成を介して視覚的な質問応答（VQA）を目指しています。 NMNは、体系的な一般化を実現するための有望な戦略です。つまり、トレーニング分布のバイアス要因を克服します。ただし、体系的な一般化を容易にするNMNの側面は完全には理解されていません。この論文では、モジュール性が定義される段階と程度が体系的な一般化に大きな影響を与えることを示します。 3つのVQAデータセット（複数の属性を持つMNIST、SQOOP、およびCLEVR-CoGenT）での一連の実験で、ネットワークのモジュール性の程度を調整すると、特に画像エンコーダーの段階で、大幅に高い体系的な一般化に到達することがわかりました。これらの発見は、体系的な一般化の点で以前のものよりも優れた新しいNMNアーキテクチャにつながります。

Neural Module Networks (NMNs) aim at Visual Question Answering (VQA) via composition of modules that tackle a sub-task. NMNs are a promising strategy to achieve systematic generalization, i.e. overcoming biasing factors in the training distribution. However, the aspects of NMNs that facilitate systematic generalization are not fully understood. In this paper, we demonstrate that the stage and the degree at which modularity is defined has large influence on systematic generalization. In a series of experiments on three VQA datasets (MNIST with multiple attributes, SQOOP, and CLEVR-CoGenT), our results reveal that tuning the degree of modularity in the network, especially at the image encoder stage, reaches substantially higher systematic generalization. These findings lead to new NMN architectures that outperform previous ones in terms of systematic generalization.

updated: Tue Jun 15 2021 14:13:47 GMT+0000 (UTC)

published: Tue Jun 15 2021 14:13:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト