Fast Wavelet-Based Visual Classification

Guoshen Yu; Jean-Jacques Slotine

高速ウェーブレットベースの視覚分類

Serreらの最近の研究に直接触発された、高速視覚分類への生物学的に動機付けられたアプローチを調査します。具体的には、生物学的精度と計算効率のトレードオフで、ウェーブレットおよびグループレットのような変換を使用して、視覚皮質V1およびV2セルのチューニングを並列処理し、スケールおよび変換の不変性を達成するための最大操作を交互に行います。学習中に特徴選択手順が適用され、認識が加速されます。シンプルな注意のようなフィードバックメカニズムを導入し、複数オブジェクトシーンでの認識と堅牢性を大幅に向上させます。実験では、提案されたアルゴリズムは、物体認識、テクスチャおよび衛星画像の分類、言語の識別、および音の分類に関する最先端の成功率を達成または超えています。

We investigate a biologically motivated approach to fast visual classification, directly inspired by the recent work of Serre et al. Specifically, trading-off biological accuracy for computational efficiency, we explore using wavelet and grouplet-like transforms to parallel the tuning of visual cortex V1 and V2 cells, alternated with max operations to achieve scale and translation invariance. A feature selection procedure is applied during learning to accelerate recognition. We introduce a simple attention-like feedback mechanism, significantly improving recognition and robustness in multiple-object scenes. In experiments, the proposed algorithm achieves or exceeds state-of-the-art success rate on object recognition, texture and satellite image classification, language identification and sound classification.

updated: Sun Jun 08 2008 10:15:04 GMT+0000 (UTC)

published: Sun Jun 08 2008 10:15:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト