TraVLR: Now You See It, Now You Don't! A Bimodal Dataset for Evaluating Visio-Linguisic Reasoning

Keng Ji Chow; Samson Tan; Min-Yen Kan

TraVLR: 見えるようになりましたが、見えなくなりました!視覚言語推論を評価するためのバイモーダルデータセット

多数の視覚言語 (V+L) 表現学習方法が開発されていますが、既存のデータセットでは、それらが統一された空間で視覚的および言語的概念をどの程度表現しているかを適切に評価していません。クロスモーダル転送を含む、V + L モデルのいくつかの新しい評価設定を提案します。さらに、既存の V+L ベンチマークでは、データセット全体のグローバル精度スコアが報告されることが多く、モデルが失敗して成功する特定の推論タスクを特定することが困難になります。 4 つの V+L 推論タスクを含む合成データセットである TraVLR を紹介します。 TraVLR の総合的な性質により、タスク関連の次元に沿ってトレーニングとテストの分布を制約することができ、分布外の一般化の評価が可能になります。 TraVLR の各例では、シーンを 2 つのモダリティで重複してエンコードし、関連情報を失うことなく、トレーニングまたはテスト中に削除または追加できます。最先端の 4 つの V+L モデルのパフォーマンスを比較すると、同じモダリティのテスト例ではうまく機能しますが、クロスモーダル転送ではすべて失敗し、追加または削除に対応する成功は限られています。 1 つのモダリティ。 TraVLR は、研究コミュニティ向けのオープンチャレンジとしてリリースされています。

Numerous visio-linguistic (V+L) representation learning methods have been developed, yet existing datasets do not adequately evaluate the extent to which they represent visual and linguistic concepts in a unified space. We propose several novel evaluation settings for V+L models, including cross-modal transfer. Furthermore, existing V+L benchmarks often report global accuracy scores on the entire dataset, making it difficult to pinpoint the specific reasoning tasks that models fail and succeed at. We present TraVLR, a synthetic dataset comprising four V+L reasoning tasks. TraVLR's synthetic nature allows us to constrain its training and testing distributions along task-relevant dimensions, enabling the evaluation of out-of-distribution generalisation. Each example in TraVLR redundantly encodes the scene in two modalities, allowing either to be dropped or added during training or testing without losing relevant information. We compare the performance of four state-of-the-art V+L models, finding that while they perform well on test examples from the same modality, they all fail at cross-modal transfer and have limited success accommodating the addition or deletion of one modality. We release TraVLR as an open challenge for the research community.

updated: Sat Mar 04 2023 12:57:41 GMT+0000 (UTC)

published: Sun Nov 21 2021 07:22:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト