High Frame Rate Video Reconstruction based on an Event Camera

Liyuan Pan; Richard Hartley; Cedric Scheerlinck; Miaomiao Liu; Xin Yu; Yuchao Dai

イベントカメラに基づく高フレームレートのビデオ再構成

イベントベースのカメラは、高速モーションや厳しい照明条件下で、マイクロ秒の精度で強度の変化（「イベント」と呼ばれる）を測定します。「アクティブピクセルセンサー」（APS）を使用すると、「ダイナミックおよびアクティブピクセルビジョンセンサー」（DAVIS）により、強度フレームとイベントを同時に出力できます。ただし、出力画像は比較的低いフレームレートでキャプチャされ、モーションブラーが発生することがよくあります。ぼやけた画像は一連の潜像の統合と見なすことができますが、イベントは潜像間の変化を示します。したがって、イベントデータを潜在的な鮮明な画像に関連付けることにより、ぼけ生成プロセスをモデル化することができます。豊富なイベントデータと低フレームレートでぼやけやすい画像に基づいて、高品質で高フレームレートのシャープなビデオを再構築するためのシンプルで効果的なアプローチを提案します。単一のぼやけたフレームとDAVISからのそのイベントデータから始めて、イベントベースの二重積分（EDI）モデルを提案し、正則化項を追加することによってそれを解決します。次に、それを複数のイベントベースの二重積分（mEDI）モデルに拡張して、複数の画像とそのイベントに基づいてよりスムーズな結果を取得します。さらに、提案されたエネルギーモデルを最小化するために、新しくより効率的なソルバーを提供します。エネルギー関数を最適化することにより、ぼけの除去と高時間分解能のビデオの再構築を大幅に改善します。ビデオ生成は、単一のスカラー変数で単純な非凸最適化問題を解くことに基づいています。合成データセットと実際のデータセットの両方での実験結果は、最先端のデータセットと比較して、mEDIモデルと最適化手法の優位性を示しています。

Event-based cameras measure intensity changes (called `events') with microsecond accuracy under high-speed motion and challenging lighting conditions. With the `active pixel sensor' (APS), the `Dynamic and Active-pixel Vision Sensor' (DAVIS) allows the simultaneous output of intensity frames and events. However, the output images are captured at a relatively low frame rate and often suffer from motion blur. A blurred image can be regarded as the integral of a sequence of latent images, while events indicate changes between the latent images. Thus, we are able to model the blur-generation process by associating event data to a latent sharp image. Based on the abundant event data alongside a low frame rate, easily blurred images, we propose a simple yet effective approach to reconstruct high-quality and high frame rate sharp videos. Starting with a single blurred frame and its event data from DAVIS, we propose the Event-based Double Integral (EDI) model and solve it by adding regularization terms. Then, we extend it to multiple Event-based Double Integral (mEDI) model to get more smooth results based on multiple images and their events. Furthermore, we provide a new and more efficient solver to minimize the proposed energy model. By optimizing the energy function, we achieve significant improvements in removing blur and the reconstruction of a high temporal resolution video. The video generation is based on solving a simple non-convex optimization problem in a single scalar variable. Experimental results on both synthetic and real datasets demonstrate the superiority of our mEDI model and optimization method compared to the state-of-the-art.

updated: Wed Nov 11 2020 01:49:56 GMT+0000 (UTC)

published: Tue Mar 12 2019 02:34:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト