Learning to Estimate Kernel Scale and Orientation of Defocus Blur with Asymmetric Coded Aperture

Jisheng Li; Qi Dai; Jiangtao Wen

非対称符号化開口によるデフォーカスブラーのカーネルスケールと方向の推定の学習

一貫性のある焦点の合った入力画像は、マシンビジョンシステムが動的環境を認識するための必須の前提条件です。焦点ぼけは、視覚システムの性能を著しく低下させます。この問題に取り組むために、我々は、レンズの焦点を迅速に調整するために、カーネルスケールと焦点ぼけの方向を推定する深層学習ベースのフレームワークを提案します。私たちのパイプラインは、入力スタックから最適なスライスを選択するために、可変数の入力仮説に3DConvNetを利用します。ネットワークパフォーマンスを向上させるために、ランダムシャッフルとGumbel-softmaxを使用します。また、トレーニングを容易にするために、さまざまな非対称符号化開口を備えた合成デフォーカス画像を生成することを提案します。私たちのフレームワークの有効性を実証するために実験が行われます。

Consistent in-focus input imagery is an essential precondition for machine vision systems to perceive the dynamic environment. A defocus blur severely degrades the performance of vision systems. To tackle this problem, we propose a deep-learning-based framework estimating the kernel scale and orientation of the defocus blur to adjust lens focus rapidly. Our pipeline utilizes 3D ConvNet for a variable number of input hypotheses to select the optimal slice from the input stack. We use random shuffle and Gumbel-softmax to improve network performance. We also propose to generate synthetic defocused images with various asymmetric coded apertures to facilitate training. Experiments are conducted to demonstrate the effectiveness of our framework.

updated: Wed Mar 10 2021 03:12:15 GMT+0000 (UTC)

published: Wed Mar 10 2021 03:12:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト