Inference via Sparse Coding in a Hierarchical Vision Model

Joshua Bowren; Luis Sanchez-Giraldo; Odelia Schwartz

階層的ビジョンモデルにおけるスパースコーディングによる推論

スパースコーディングは、その計算上の利点と生物学への接続のために視覚野のモデルに組み込まれています。しかし、スパース性のレベルが視覚的なタスクのパフォーマンスにどのように寄与するかはよく理解されていません。この作業では、スパースコーディングが既存の階層V2モデルに統合されましたが（細谷とヒヴァリネン、2015）、独立成分分析（ICA）を、スパース性の程度を制御できる明示的なスパースコーディングに置き換えました。トレーニング後、スパース性の高いスパースコーディング基底関数は、曲線やコーナーなどの質的に異なる構造に似ていました。モデルの寄与は、画像分類タスク、特に図地分類、テクスチャ分類、2つの線刺激間の角度予測などの中間レベルの視覚に関連するタスクで評価されました。さらに、モデルは、V2（Freeman et al。、2013）で報告されているテクスチャ感度測定値および削除領域推定タスクと比較して評価されました。実験の結果は、画像の分類においてスパースコーディングのパフォーマンスがICAよりも劣っていた一方で、スパースコーディングのスパース性の程度を高めることにより、スパースコーディングのみがV2のテクスチャ感度レベルと一致し、削除された画像領域を推測できたことを示しています。より高度なスパース性により、削除されたより大きな画像領域での推論が可能になりました。ここでは、スパースコーディングでこの推論機能を可能にするメカニズムについて説明します。

Sparse coding has been incorporated in models of the visual cortex for its computational advantages and connection to biology. But how the level of sparsity contributes to performance on visual tasks is not well understood. In this work, sparse coding has been integrated into an existing hierarchical V2 model (Hosoya and Hyvärinen, 2015), but replacing its independent component analysis (ICA) with an explicit sparse coding in which the degree of sparsity can be controlled. After training, the sparse coding basis functions with a higher degree of sparsity resembled qualitatively different structures, such as curves and corners. The contributions of the models were assessed with image classification tasks, specifically tasks associated with mid-level vision including figure-ground classification, texture classification, and angle prediction between two line stimuli. In addition, the models were assessed in comparison to a texture sensitivity measure that has been reported in V2 (Freeman et al., 2013), and a deleted-region inference task. The results from the experiments show that while sparse coding performed worse than ICA at classifying images, only sparse coding was able to better match the texture sensitivity level of V2 and infer deleted image regions, both by increasing the degree of sparsity in sparse coding. Higher degrees of sparsity allowed for inference over larger deleted image regions. The mechanism that allows for this inference capability in sparse coding is described here.

updated: Sun Jan 16 2022 18:34:26 GMT+0000 (UTC)

published: Tue Aug 03 2021 14:55:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト