DTVNet+: A High-Resolution Scenic Dataset for Dynamic Time-lapse Video Generation

Jiangning Zhang; Chao Xu; Yong Liu; Yunliang Jiang

DTVNet +：動的タイムラプスビデオ生成用の高解像度シーニックデータセット

この論文では、正規化されたモーションベクトルを条件とする単一の風景画像から多様なタイムラプスビデオを生成するための、DTVNetという名前の新しいエンドツーエンドの動的タイムラプスビデオ生成フレームワークを紹介します。提案されているDTVNetは、オプティカルフローエンコーダー（OFE）とダイナミックビデオジェネレーター（DVG）の2つのサブモジュールで構成されています。 OFEは、オプティカルフローマップのシーケンスを、生成されたビデオのモーション情報をエンコードする正規化されたモーションベクトルにマッピングします。 DVGには、モーションベクトルと単一の横向き画像から学習するモーションストリームとコンテンツストリームが含まれています。さらに、共有コンテンツ機能を学習するためのエンコーダーと、対応するモーションでビデオフレームを構築するためのデコーダーが含まれています。具体的には、モーションストリームは、オブジェクトのモーションを制御するためのマルチレベルのモーション情報を統合するために、複数のアダプティブインスタンス正規化（AdaIN）レイヤーを導入します。テスト段階では、コンテンツは同じであるがさまざまなモーション情報を持つビデオは、1つの入力画像のみに基づいて異なる正規化されたモーションベクトルによって生成できます。また、Quick-Sky-Timeという名前の高解像度の風景タイムラプスビデオデータセットを提案して、さまざまなアプローチを評価します。これは、高品質の風景画像およびビデオ生成タスクの新しいベンチマークと見なすことができます。さらに、Sky Time-lapse、Beach、およびQuick-Sky-Timeデータセットで実験を行います。結果は、高品質でさまざまな動的ビデオを生成するための最先端の方法に対する私たちのアプローチの優位性を示しています。

This paper presents a novel end-to-end dynamic time-lapse video generation framework, named DTVNet, to generate diversified time-lapse videos from a single landscape image conditioned on normalized motion vectors. The proposed DTVNet consists of two submodules: Optical Flow Encoder (OFE) and Dynamic Video Generator (DVG). The OFE maps a sequence of optical flow maps to a normalized motion vector that encodes the motion information of the generated video. The DVG contains motion and content streams to learn from the motion vector and the single landscape image. Besides, it contains an encoder to learn shared content features and a decoder to construct video frames with corresponding motion. Specifically, the motion stream introduces multiple adaptive instance normalization (AdaIN) layers to integrate multi-level motion information for controlling the object motion. In the testing stage, videos with the same content but various motion information can be generated by different normalized motion vectors based on only one input image. Also, we propose a high-resolution scenic time-lapse video dataset, named Quick-Sky-Time, to evaluate different approaches, which can be viewed as a new benchmark for high-quality scenic image and video generation tasks. We further conduct experiments on Sky Time-lapse, Beach, and Quick-Sky-Time datasets. The results demonstrate the superiority of our approach over state-of-the-art methods for generating high-quality and various dynamic videos.

updated: Fri Dec 17 2021 15:39:34 GMT+0000 (UTC)

published: Tue Aug 11 2020 15:26:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト