Self-Supervised Learning of Image Scale and Orientation

Jongmin Lee; Yoonwoo Jeong; Minsu Cho

画像のスケールと向きの自己監視学習

関心のある画像領域に特徴的なポーズ、つまりスケールと向きを割り当てることを学習する問題を研究します。その明らかな単純さにもかかわらず、問題は自明ではありません。モデルが直接学習する明示的なポーズ注釈を含む大規模な画像領域のセットを取得することは困難です。この問題に取り組むために、ヒストグラムアラインメント手法を使用した自己監視学習フレームワークを提案します。ランダムな再スケーリング/回転によって画像パッチのペアを生成し、次に推定器をトレーニングして、それらの相対的な差が使用される再スケーリング/回転と一致するように、それらのスケール/方向の値を予測します。推定者は、監視なしでスケール/方向のノンパラメトリックヒストグラム分布を予測することを学習します。実験によると、スケール/方向の推定において以前の方法を大幅に上回り、パッチポーズをマッチングプロセスに組み込むことで、画像マッチングと6DoFカメラポーズ推定も改善されます。

We study the problem of learning to assign a characteristic pose, i.e., scale and orientation, for an image region of interest. Despite its apparent simplicity, the problem is non-trivial; it is hard to obtain a large-scale set of image regions with explicit pose annotations that a model directly learns from. To tackle the issue, we propose a self-supervised learning framework with a histogram alignment technique. It generates pairs of image patches by random rescaling/rotating and then train an estimator to predict their scale/orientation values so that their relative difference is consistent with the rescaling/rotating used. The estimator learns to predict a non-parametric histogram distribution of scale/orientation without any supervision. Experiments show that it significantly outperforms previous methods in scale/orientation estimation and also improves image matching and 6 DoF camera pose estimation by incorporating our patch poses into a matching process.

updated: Wed Jun 15 2022 02:43:39 GMT+0000 (UTC)

published: Wed Jun 15 2022 02:43:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト