On representation of natural image patches

Cheng Guo

自然画パッチの表現について

最初の原則から始めて、自然画像の局所統計をモデル化する偶数コードという名前の教師なし学習方法を導き出します。最初のバージョンでは、独立した状態を持つ直交基底を使用して、いくつかのピクセルの単純な確率分布をモデル化します。 2 番目のバージョンでは、微視的損失関数を使用して、画像パッチの非線形スパースバイナリ表現を学習します。バイナリ表現空間の距離は、画像パッチの類似性を反映しています。学習されたモデルには、初期の視覚システムのようなローカルエッジ検出ユニットと方向選択ユニットもあります。

Starting from the first principle I derive an unsupervised learning method named even code to model local statistics of natural images. The first version uses orthogonal bases with independent states to model simple probability distribution of a few pixels. The second version uses a microscopic loss function to learn a nonlinear sparse binary representation of image patches. The distance in the binary representation space reflects image patch similarity. The learned model also has local edge detecting and orientation selective units like early visual systems.

updated: Mon Oct 24 2022 07:50:02 GMT+0000 (UTC)

published: Mon Oct 24 2022 07:50:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト