Toward a Human-Level Video Understanding Intelligence

Yu-Jung Heo; Minsu Lee; Seongho Choi; Woo Suk Choi; Minjung Shin; Minjoon Jung; Jeh-Kwang Ryu; Byoung-Tak Zhang

インテリジェンスを理解する人間レベルのビデオに向けて

ビデオクリップを見たり、ビデオストーリーについて人間と会話したりできるAIエージェントの開発を目指しています。ビデオ理解インテリジェンスの開発は非常に困難な作業であり、AIエージェントの進行状況を適切に測定および分析するための評価方法も不足しています。この論文では、ビデオ理解インテリジェンスの効果的かつ実用的な評価とAIエージェントの人間らしさの評価を提供するビデオチューリングテストを提案します。ビデオチューリングテストの一般的な形式と手順を定義し、提案されたテストの有効性と有用性を確認するためのケーススタディを提示します。

We aim to develop an AI agent that can watch video clips and have a conversation with human about the video story. Developing video understanding intelligence is a significantly challenging task, and evaluation methods for adequately measuring and analyzing the progress of AI agent are lacking as well. In this paper, we propose the Video Turing Test to provide effective and practical assessments of video understanding intelligence as well as human-likeness evaluation of AI agents. We define a general format and procedure of the Video Turing Test and present a case study to confirm the effectiveness and usefulness of the proposed test.

updated: Mon Oct 18 2021 01:46:06 GMT+0000 (UTC)

published: Fri Oct 08 2021 15:41:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト