LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video Reconstruction

Haesoo Chung; Nam Ik Cho

LAN-HDR: ハイダイナミックレンジビデオ再構築のための輝度ベースのアライメントネットワーク

高品質ビデオへの要求が高まるにつれ、高解像度およびハイダイナミックレンジ (HDR) イメージング技術が注目を集めています。ローダイナミックレンジ (LDR) 画像から HDR ビデオを生成するための重要な手順の 1 つは、LDR フレーム間の動き補償であり、既存のほとんどの作品ではオプティカルフローアルゴリズムが使用されています。ただし、これらの方法では、飽和または複雑な動きが存在する場合、フロー推定エラーが発生します。この論文では、ピクセルドメインのオプティカルフローに依存せずに、特徴空間内で LDR フレームを位置合わせし、位置合わせされた特徴を HDR フレームにマージする、エンドツーエンドの HDR ビデオ合成フレームワークを提案します。具体的には、位置合わせモジュールと幻覚モジュールから構成される HDR 用の輝度ベースの位置合わせネットワーク (LAN-HDR) を提案します。位置合わせモジュールは、色情報を除外して、輝度ベースの注意を評価することによって、フレームを隣接する基準に位置合わせします。幻覚モジュールは、特に飽和により色褪せた領域に対して鮮明な詳細を生成します。次に、位置合わせされ幻覚を起こした特徴が適応的にブレンドされ、相互に補完されます。最後に、特徴を結合して最終的な HDR フレームを生成します。トレーニングでは、フレーム再構成損失に加えて時間損失を採用して、時間的一貫性を高め、ちらつきを軽減します。広範な実験により、私たちの方法がいくつかのベンチマークで最先端の方法よりも優れているか、同等のパフォーマンスを発揮することが実証されています。

As demands for high-quality videos continue to rise, high-resolution and high-dynamic range (HDR) imaging techniques are drawing attention. To generate an HDR video from low dynamic range (LDR) images, one of the critical steps is the motion compensation between LDR frames, for which most existing works employed the optical flow algorithm. However, these methods suffer from flow estimation errors when saturation or complicated motions exist. In this paper, we propose an end-to-end HDR video composition framework, which aligns LDR frames in the feature space and then merges aligned features into an HDR frame, without relying on pixel-domain optical flow. Specifically, we propose a luminance-based alignment network for HDR (LAN-HDR) consisting of an alignment module and a hallucination module. The alignment module aligns a frame to the adjacent reference by evaluating luminance-based attention, excluding color information. The hallucination module generates sharp details, especially for washed-out areas due to saturation. The aligned and hallucinated features are then blended adaptively to complement each other. Finally, we merge the features to generate a final HDR frame. In training, we adopt a temporal loss, in addition to frame reconstruction losses, to enhance temporal consistency and thus reduce flickering. Extensive experiments demonstrate that our method performs better or comparable to state-of-the-art methods on several benchmarks.

updated: Tue Aug 22 2023 01:43:00 GMT+0000 (UTC)

published: Tue Aug 22 2023 01:43:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト