Self-Supervised Scene Dynamic Recovery from Rolling Shutter Images and Events

Yangguang Wang; Xiang Zhang; Mingyuan Lin; Lei Yu; Boxin Shi; Wen Yang; Gui-Song Xia

ローリングシャッター画像とイベントからの自己監視シーンの動的復元

歪んだローリングシャッター (RS) 画像を歪みのない高フレームレートのグローバルシャッター (GS) ビデオに反転することによるシーンダイナミックリカバリ (SDR) は、特にカメラ/オブジェクトの動きに関する事前知識が利用できない場合に、深刻な不適切な問題です。 RS スキャンラインに埋め込まれた時間的ダイナミクス情報に関して、動きの線形性とデータ固有の特性に関して一般的に使用される人為的な仮定は、現実世界のシナリオで次善のソリューションを生み出す傾向があります。この課題に対処するために、イベントカメラの非常に高い時間分解能を活用して正確なフレーム間/フレーム内情報を提供する、自己教師あり学習パラダイム内のイベントベースの RS2GS フレームワークを提案します。 % このホワイトペーパーでは、イベントカメラを活用してフレーム間/フレーム内情報を提供することを提案します。これは、放出されたイベントが非常に高い時間分解能を持ち、自己教師あり学習フレームワーク内でイベントベースの RS2GS ネットワークを学習するためです。イベントと RS 画像を利用して、合成データと実際のデータの間のドメインギャップによって引き起こされるパフォーマンスの低下を軽減できます。具体的には、イベントベースのフレーム間/フレーム内補償器（E-IC）が提案され、時間遷移と空間変換を含む、任意の時間間隔間のピクセルごとのダイナミックを予測します。 RS-RS、RS-GS、および GS-RS の観点から接続を調査し、提案された E-IC を使用して相互制約を明示的に定式化し、グラウンドトゥルース GS 画像のない監視を実現します。合成データセットと実際のデータセットに対する広範な評価は、提案された方法が最先端を達成し、現実世界のシナリオでのイベントベースの RS2GS インバージョンに対して顕著なパフォーマンスを示すことを示しています。データセットとコードは https://w3un.github.io/selfunroll/ で入手できます。

Scene Dynamic Recovery (SDR) by inverting distorted Rolling Shutter (RS) images to an undistorted high frame-rate Global Shutter (GS) video is a severely ill-posed problem, particularly when prior knowledge about camera/object motions is unavailable. Commonly used artificial assumptions on motion linearity and data-specific characteristics, regarding the temporal dynamics information embedded in the RS scanlines, are prone to producing sub-optimal solutions in real-world scenarios. To address this challenge, we propose an event-based RS2GS framework within a self-supervised learning paradigm that leverages the extremely high temporal resolution of event cameras to provide accurate inter/intra-frame information. % In this paper, we propose to leverage the event camera to provide inter/intra-frame information as the emitted events have an extremely high temporal resolution and learn an event-based RS2GS network within a self-supervised learning framework, where real-world events and RS images can be exploited to alleviate the performance degradation caused by the domain gap between the synthesized and real data. Specifically, an Event-based Inter/intra-frame Compensator (E-IC) is proposed to predict the per-pixel dynamic between arbitrary time intervals, including the temporal transition and spatial translation. Exploring connections in terms of RS-RS, RS-GS, and GS-RS, we explicitly formulate mutual constraints with the proposed E-IC, resulting in supervisions without ground-truth GS images. Extensive evaluations over synthetic and real datasets demonstrate that the proposed method achieves state-of-the-art and shows remarkable performance for event-based RS2GS inversion in real-world scenarios. The dataset and code are available at https://w3un.github.io/selfunroll/.

updated: Fri Apr 14 2023 05:30:02 GMT+0000 (UTC)

published: Fri Apr 14 2023 05:30:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト