Towards glass-box CNNs

Piduguralla Manaswini; Jignesh S. Bhatt

グラスボックス CNN に向けて

畳み込みニューラルネットワーク (CNN) は、視覚的に複雑なタスクをトレーニングして再学習する能力で人気のある脳にヒントを得たアーキテクチャです。増分的でスケーラブルです。ただし、CNN はほとんどがブラックボックスとして扱われ、何度も試行錯誤を繰り返します。 CNN は、最先端のパフォーマンスを達成するのに役立つ強力な内部表現を構築することがわかります。ここでは、2 クラスの画像分類問題に対して、3 層のグラスボックス (分析) CNN を提案します。 1 つ目は、入力画像のクラス情報 (群不変) と対称変換 (群同変) の両方を含む表現層です。次に、次元削減層 (PCA) を通過します。最後に、コンパクトでありながら完全な表現が分類子に提供されます。分析機械学習分類器と多層パーセプトロンを使用して、感度を評価します。提案されたグラスボックス CNN は、より良い理解と結果の普及のために、AlexNet (CNN) 内部表現の等分散と比較されます。将来的には、マルチクラスの視覚的に複雑なタスクのためのグラスボックス CNN を構築したいと考えています。

Convolution neural networks (CNNs) are brain-inspired architectures popular for their ability to train and relearn visually complex tasks. It is incremental and scalable; however, CNN is mostly treated as black-box and involves multiple trial & error runs. We observe that CNN constructs powerful internal representations that help achieve state-of-the-art performance. Here we propose three layer glass-box (analytical) CNN for two-class image classifcation problems. First is a representation layer that encompasses both the class information (group invariant) and symmetric transformations (group equivariant) of input images. It is then passed through dimension reduction layer (PCA). Finally the compact yet complete representation is provided to a classifer. Analytical machine learning classifers and multilayer perceptrons are used to assess sensitivity. Proposed glass-box CNN is compared with equivariance of AlexNet (CNN) internal representation for better understanding and dissemination of results. In future, we would like to construct glass-box CNN for multiclass visually complex tasks.

updated: Thu Nov 03 2022 12:04:43 GMT+0000 (UTC)

published: Mon Jan 11 2021 15:00:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト