Discerning Generic Event Boundaries in Long-Form Wild Videos

Ayush K Rai; Tarun Krishna; Julia Dietlmeier; Kevin McGuinness; Alan F Smeaton; Noel E O'Connor

長い形式のワイルドビデオでの一般的なイベント境界の識別

ビデオ内の一般的な分類法のないイベント境界の検出は、全体的なビデオの理解に向けた大きな前進を表しています。この論文では、ビデオから時空間特徴を学習できる、2ストリームのインフレート3D畳み込みアーキテクチャに基づく一般的なイベント境界検出の手法を紹介します。私たちの仕事は、Generic Event Boundary Detection Challenge（CVPR2021 Long Form VideoUnderstanding-LOVEU Workshopの一部）から着想を得ています。本書全体を通して、実行された実験の詳細な分析と、得られた結果の解釈を提供します。

Detecting generic, taxonomy-free event boundaries invideos represents a major stride forward towards holisticvideo understanding. In this paper we present a technique forgeneric event boundary detection based on a two stream in-flated 3D convolutions architecture, which can learn spatio-temporal features from videos. Our work is inspired from theGeneric Event Boundary Detection Challenge (part of CVPR2021 Long Form Video Understanding- LOVEU Workshop).Throughout the paper we provide an in-depth analysis ofthe experiments performed along with an interpretation ofthe results obtained.

updated: Fri Jun 18 2021 12:28:19 GMT+0000 (UTC)

published: Fri Jun 18 2021 12:28:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト