FisheyeHDK: Hyperbolic Deformable Kernel Learning for Ultra-Wide Field-of-View Image Recognition

Ola Ahmad; Freddy Lecue

FisheyeHDK：超広視野画像認識のための双曲線変形可能カーネル学習

狭い視野（FoV）画像でトレーニングされた従来の畳み込みニューラルネットワーク（CNN）は、オブジェクト認識タスクのための最先端のアプローチです。いくつかの方法は、変形可能なカーネルを学習することにより、CNNを超広視野画像に適応させることを提案しました。ただし、それらはユークリッド幾何学によって制限され、魚眼レンズの投影によって引き起こされる強い歪みの下では精度が低下します。この作業では、非ユークリッド空間で畳み込みカーネルの形状を学習する方が、既存の変形可能なカーネル法よりも優れていることを示します。特に、双曲空間で変形可能なカーネルパラメータ（位置）を学習する新しいアプローチを提案します。 FisheyeHDKは、位置と特徴の学習のために双曲線とユークリッドの畳み込み層を組み合わせたハイブリッドCNNアーキテクチャです。まず、広いFoV画像の双曲空間の直感を提供します。合成歪みプロファイルを使用して、アプローチの有効性を示します。遠近法画像の2つのデータセット（CityscapesとBDD100K 2020）を選択し、さまざまな倍率（焦点距離に類似）で魚眼レンズに相当するものに変換します。最後に、実際の魚眼カメラによって収集されたデータの実験を提供します。検証と実験は、私たちのアプローチが魚眼画像のCNN適応のための既存の変形可能なカーネル法を改善することを示しています。

Conventional convolution neural networks (CNNs) trained on narrow Field-of-View (FoV) images are the state-of-the-art approaches for object recognition tasks. Some methods proposed the adaptation of CNNs to ultra-wide FoV images by learning deformable kernels. However, they are limited by the Euclidean geometry and their accuracy degrades under strong distortions caused by fisheye projections. In this work, we demonstrate that learning the shape of convolution kernels in non-Euclidean spaces is better than existing deformable kernel methods. In particular, we propose a new approach that learns deformable kernel parameters (positions) in hyperbolic space. FisheyeHDK is a hybrid CNN architecture combining hyperbolic and Euclidean convolution layers for positions and features learning. First, we provide an intuition of hyperbolic space for wide FoV images. Using synthetic distortion profiles, we demonstrate the effectiveness of our approach. We select two datasets - Cityscapes and BDD100K 2020 - of perspective images which we transform to fisheye equivalents at different scaling factors (analog to focal lengths). Finally, we provide an experiment on data collected by a real fisheye camera. Validations and experiments show that our approach improves existing deformable kernel methods for CNN adaptation on fisheye images.

updated: Mon Mar 14 2022 16:37:54 GMT+0000 (UTC)

published: Mon Mar 14 2022 16:37:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト