Learning View Generalization Functions

Thomas M. Breuel

学習ビューの一般化関数

3D視覚オブジェクト認識のビューからオブジェクトモデルを学習することは、通常、オブジェクトのビュー多様体を記述する関数の関数近似問題として、またはクラス条件付き密度の学習として定式化されます。このホワイトペーパーでは、視覚物体認識で学習するための代替フレームワーク、つまりビュー一般化機能を学習するフレームワークについて説明します。ビュー一般化機能を使用すると、観察者は、個別のモデル取得ステップを必要とせずに、1つ以上の2Dトレーニングビューが与えられたベイズ最適3Dオブジェクト認識を直接実行できます。この論文は、ビューの一般化フレームワークで、2つの広く使用されているメソッド、固有空間とビューの線形結合アプローチを繰り返すことにより、ビューの一般化関数が計算上実用的であることを示しています。この論文は、不均一なぼかしに基づく物体認識の最近の方法へのアプローチに関する。この論文は、シミュレートされた3D「ペーパークリップ」オブジェクトとCOIL-100データベースの実世界の画像の両方の結果を示しており、有用なビュー一般化機能が比較的少数のトレーニング例から現実的に学習できることを示しています。

Learning object models from views in 3D visual object recognition is usually formulated either as a function approximation problem of a function describing the view-manifold of an object, or as that of learning a class-conditional density. This paper describes an alternative framework for learning in visual object recognition, that of learning the view-generalization function. Using the view-generalization function, an observer can perform Bayes-optimal 3D object recognition given one or more 2D training views directly, without the need for a separate model acquisition step. The paper shows that view generalization functions can be computationally practical by restating two widely-used methods, the eigenspace and linear combination of views approaches, in a view generalization framework. The paper relates the approach to recent methods for object recognition based on non-uniform blurring. The paper presents results both on simulated 3D ``paperclip'' objects and real-world images from the COIL-100 database showing that useful view-generalization functions can be realistically be learned from a comparatively small number of training examples.

updated: Sun Dec 02 2007 10:54:40 GMT+0000 (UTC)

published: Sun Dec 02 2007 10:54:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト