Video Imprint

Zhanning Gao; Le Wang; Nebojsa Jojic; Zhenxing Niu; Nanning Zheng; Gang Hua

ビデオインプリント

Video Imprint

新しい統合ビデオ分析フレームワーク (ER3) は、提案されたビデオインプリント表現に基づいて、複雑なイベントの検索、認識、および再カウントのために提案され、ビデオフレーム全体の画像特徴間の時間的相関を利用します。ビデオインプリント表現を使用すると、マップをビデオフレームの時間的および空間的位置に戻すことができ、各フレーム内のキーフレームの識別とキー領域の位置特定の両方が可能になります。提案されたフレームワークでは、テンソル表現、つまりビデオインプリントを生成するために、フレーム全体で冗長性を除去するために専用の特徴アラインメントモジュールが組み込まれています。その後、ビデオインプリントは、それぞれイベント認識/再カウントおよびイベント検索タスクのために、推論ネットワークと機能集約モジュールの両方に個別に供給されます。言語モデリングで使用されるメモリネットワークに触発された注意メカニズムのおかげで、提案された推論ネットワークは、イベントカテゴリの認識と、イベントの再カウントのための重要な証拠のローカリゼーションを同時に行うことができます。さらに、私たちの推論ネットワークの潜在的な構造は、イベントの再集計に直接使用できるビデオインプリントの領域を強調しています。イベント検索タスクでは、ビデオインプリントから集約されたコンパクトなビデオ表現により、既存の最先端の方法よりも優れた検索結果が得られます。

A new unified video analytics framework (ER3) is proposed for complex event retrieval, recognition and recounting, based on the proposed video imprint representation, which exploits temporal correlations among image features across video frames. With the video imprint representation, it is convenient to reverse map back to both temporal and spatial locations in video frames, allowing for both key frame identification and key areas localization within each frame. In the proposed framework, a dedicated feature alignment module is incorporated for redundancy removal across frames to produce the tensor representation, i.e., the video imprint. Subsequently, the video imprint is individually fed into both a reasoning network and a feature aggregation module, for event recognition/recounting and event retrieval tasks, respectively. Thanks to its attention mechanism inspired by the memory networks used in language modeling, the proposed reasoning network is capable of simultaneous event category recognition and localization of the key pieces of evidence for event recounting. In addition, the latent structure in our reasoning network highlights the areas of the video imprint, which can be directly used for event recounting. With the event retrieval task, the compact video representation aggregated from the video imprint contributes to better retrieval results than existing state-of-the-art methods.

updated: Mon Jun 07 2021 00:32:47 GMT+0000 (UTC)

published: Mon Jun 07 2021 00:32:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト