U(1) Symmetry-breaking Observed in Generic CNN Bottleneck Layers

Louis-François Bouchard; Mohsen Ben Lazreg; Matthew Toews

U(1) 汎用 CNN ボトルネック層で観察される対称性の破れ

深い畳み込みニューラルネットワーク (CNN) を生物学的視覚および基本的な素粒子物理学にリンクする新しいモデルについて報告します。 CNN での情報伝搬は、光学システムとの類推によってモデル化されます。ここでは、2D 空間解像度が焦点 1×1=1 の周りで崩壊するボトルネックの近くに情報が集中します。 3D 空間 (x,y,t) は、イメージプレーンと CNN 層 t の (x,y) 座標によって定義されます。ここで、主光線 (0,0,t) は、両方の光学層を通る情報伝播の方向に進みます。軸と (x,y)=(0,0) に位置する画像の中心ピクセル。これについては、可能な限り最も鮮明な空間焦点が画像平面内の錯乱円に制限されます。私たちの新しい洞察は、主光線 (0,0,t) を、N チャネル活性化空間の正のオルサント I(x,y) ∈R^N+ の内側ベクトルと幾何学的に等価なものとして、たとえばグレースケールに沿ってモデル化することです。 RGB 色空間の (または輝度) ベクトル (t,t,t)。したがって、情報はエネルギーポテンシャル E(x,y,t)=\|I(x,y,t)\|^2 に集中します。これは、特に一般的な CNN のボトルネック層 t の場合、非常に集中し、空間原点 (0,0,t) であり、ボソン粒子のよく知られた「ソンブレロ」ポテンシャルを示します。この対称性は分類で破られ、一般的な事前トレーニング済み CNN モデルのボトルネック層は、画像平面と活性化特徴空間で同時に定義された角度 θ∈U(1) に向かって一貫したクラス固有のバイアスを示します。初期の観察では、トレーニングやチューニングを行わずに、一般的な事前トレーニング済みの CNN アクティベーションマップと最小限のメモリベースの分類スキームから仮説を検証します。ワンホット + U(1) 損失の組み合わせを使用したゼロからのトレーニングにより、ImageNet を含むテストされたすべてのタスクの分類が改善されます。

We report on a novel model linking deep convolutional neural networks (CNN) to biological vision and fundamental particle physics. Information propagation in a CNN is modeled via an analogy to an optical system, where information is concentrated near a bottleneck where the 2D spatial resolution collapses about a focal point 1×1=1. A 3D space (x,y,t) is defined by (x,y) coordinates in the image plane and CNN layer t, where a principal ray (0,0,t) runs in the direction of information propagation through both the optical axis and the image center pixel located at (x,y)=(0,0), about which the sharpest possible spatial focus is limited to a circle of confusion in the image plane. Our novel insight is to model the principal optical ray (0,0,t) as geometrically equivalent to the medial vector in the positive orthant I(x,y) ∈R^N+ of a N-channel activation space, e.g. along the greyscale (or luminance) vector (t,t,t) in RGB colour space. Information is thus concentrated into an energy potential E(x,y,t)=\|I(x,y,t)\|^2, which, particularly for bottleneck layers t of generic CNNs, is highly concentrated and symmetric about the spatial origin (0,0,t) and exhibits the well-known "Sombrero" potential of the boson particle. This symmetry is broken in classification, where bottleneck layers of generic pre-trained CNN models exhibit a consistent class-specific bias towards an angle θ∈U(1) defined simultaneously in the image plane and in activation feature space. Initial observations validate our hypothesis from generic pre-trained CNN activation maps and a bare-bones memory-based classification scheme, with no training or tuning. Training from scratch using combined one-hot + U(1) loss improves classification for all tasks tested including ImageNet.

updated: Wed Aug 31 2022 14:35:09 GMT+0000 (UTC)

published: Sun Jun 05 2022 16:54:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト