Efficient Halftoning via Deep Reinforcement Learning

Haitian Jiang; Dongliang Xiong; Xiaowen Jiang; Li Ding; Liang Chen; Kai Huang

深層強化学習による効率的なハーフトーン処理

ハーフトーン処理は、強度が 2 つの離散レベルに制限されているピクセルを使用して、連続トーンイメージを再現することを目的としています。この技術はすべてのプリンターに導入されており、その大部分は、ハーフトーンの品質を決定する構造の詳細をレンダリングできない高速な方法 (順序付けられたディザリング、誤差拡散など) を採用しています。反対に、最適なハーフトーンソリューションを検索することによって視覚的な楽しみを追求する他の従来の方法は、計算コストが高いという欠点があります。この論文では、データ駆動型アプローチによる高速で構造を意識したハーフトーン処理方法を提案します。具体的には、ハーフトーン処理を強化学習問題として定式化します。この問題では、各バイナリピクセルの値が、共有の完全畳み込みニューラルネットワーク (CNN) ポリシーを持つ仮想エージェントによって選択されたアクションと見なされます。オフラインフェーズでは、効果的な勾配推定器を使用して、1 つのアクションステップで高品質のハーフトーンを生成できるようにエージェントをトレーニングします。その後、1 回の高速 CNN 推論によってハーフトーンをオンラインで生成できます。さらに、望ましいブルーノイズ特性をもたらす、新しい異方性抑制損失関数を提案します。最後に、SSIM を最適化すると平坦な領域に穴ができることがわかりました。これは、コントーンのコントラストマップでメトリックを重み付けすることで回避できます。実験では、フレームワークが以前の構造認識方法よりも 15 倍高速な軽量 CNN を効果的にトレーニングして、満足のいく視覚品質のブルーノイズハーフトーンを生成できることが示されています。また、手法の拡張性を実証するために、ディープマルチトーニングのプロトタイプも示します。

Halftoning aims to reproduce a continuous-tone image with pixels whose intensities are constrained to two discrete levels. This technique has been deployed on every printer, and the majority of them adopt fast methods (e.g., ordered dithering, error diffusion) that fail to render structural details, which determine halftone's quality. Other prior methods of pursuing visual pleasure by searching for the optimal halftone solution, on the contrary, suffer from their high computational cost. In this paper, we propose a fast and structure-aware halftoning method via a data-driven approach. Specifically, we formulate halftoning as a reinforcement learning problem, in which each binary pixel's value is regarded as an action chosen by a virtual agent with a shared fully convolutional neural network (CNN) policy. In the offline phase, an effective gradient estimator is utilized to train the agents in producing high-quality halftones in one action step. Then, halftones can be generated online by one fast CNN inference. Besides, we propose a novel anisotropy suppressing loss function, which brings the desirable blue-noise property. Finally, we find that optimizing SSIM could result in holes in flat areas, which can be avoided by weighting the metric with the contone's contrast map. Experiments show that our framework can effectively train a light-weight CNN, which is 15x faster than previous structure-aware methods, to generate blue-noise halftones with satisfactory visual quality. We also present a prototype of deep multitoning to demonstrate the extensibility of our method.

updated: Fri Oct 13 2023 03:40:42 GMT+0000 (UTC)

published: Mon Apr 24 2023 15:03:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト