Tiny Video Networks

AJ Piergiovanni; Anelia Angelova; Michael S. Ryoo

小さなビデオネットワーク

Tiny Video Networks

ビデオの理解は、現実の世界で働く自律エージェントの能力に大きな影響を与える挑戦的な問題です。それでも、これまでのソリューションは計算量が多く、最速のアルゴリズムが強力なGPUでビデオスニペットごとに0.5秒以上実行されています。ビデオアーキテクチャ学習に関する斬新なアイデアであるTinyVideo Networksを提案します。これは、ビデオを理解するための非常に効率的なモデルを自動的に設計します。小さなビデオモデルは、CPUではビデオあたり37ミリ秒、標準GPUでは10ミリ秒という、競争力のあるパフォーマンスで動作します。

Video understanding is a challenging problem with great impact on the abilities of autonomous agents working in the real-world. Yet, solutions so far have been computationally intensive, with the fastest algorithms running for more than half a second per video snippet on powerful GPUs. We propose a novel idea on video architecture learning - Tiny Video Networks - which automatically designs highly efficient models for video understanding. The tiny video models run with competitive performance for as low as 37 milliseconds per video on a CPU and 10 milliseconds on a standard GPU.

updated: Wed Jun 30 2021 01:25:26 GMT+0000 (UTC)

published: Tue Oct 15 2019 17:55:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト