Diffusing Surrogate Dreams of Video Scenes to Predict Video Memorability

Lorin Sweeney; Graham Healy; Alan F. Smeaton

ビデオの記憶力を予測するためのビデオシーンの代理夢の拡散

MediaEval 2022 のビデオの記憶力の予測タスクの一環として、視覚的な記憶力、それを特徴付ける視覚的表現、およびその視覚的表現によって描写される基本的な概念の間の関係を探ります。私たちは、代用の夢の画像のみでトレーニングおよびテストされたモデルを使用して、最先端の記憶力予測パフォーマンスを実現し、概念を記憶力の基礎となる機能の状態にまで高め、視覚コンテンツの本質的な記憶力が特定の視覚的表現に関係なく、その根底にある概念または意味に蒸留されます。

As part of the MediaEval 2022 Predicting Video Memorability task we explore the relationship between visual memorability, the visual representation that characterises it, and the underlying concept portrayed by that visual representation. We achieve state-of-the-art memorability prediction performance with a model trained and tested exclusively on surrogate dream images, elevating concepts to the status of a cornerstone memorability feature, and finding strong evidence to suggest that the intrinsic memorability of visual content can be distilled to its underlying concept or meaning irrespective of its specific visual representational.

updated: Mon Dec 19 2022 09:10:23 GMT+0000 (UTC)

published: Mon Dec 19 2022 09:10:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト