Neural Similarity Learning

Weiyang Liu; Zhen Liu; James M. Rehg; Le Song

ニューラル類似性学習

内積ベースの畳み込みは、畳み込みニューラルネットワーク（CNN）の基礎であり、視覚表現のエンドツーエンドの学習を可能にします。内積を双一次行列で一般化することにより、CNNの学習可能なパラメトリック類似性尺度として機能するニューラル類似性を提案します。神経の類似性は自然に畳み込みを一般化し、柔軟性を高めます。さらに、トレーニングデータから適応的にニューラルの類似性を学習するために、ニューラルの類似性学習（NSL）を検討します。具体的には、ニューラルの類似性を学習する2つの異なる方法、静的NSLと動的NSLを提案します。興味深いことに、動的なニューラル類似性により、CNNは動的な推論ネットワークになります。双線形行列を正則化することにより、NSLはカーネルの形状と類似性の測定を同時に学習していると見なすことができます。理論的な観点からNSLの有効性をさらに正当化します。最も重要なことは、NSLが視覚認識と少数ショット学習で有望なパフォーマンスを示しており、内積ベースの畳み込みカウンターパートに対するNSLの優位性を検証していることです。

Inner product-based convolution has been the founding stone of convolutional neural networks (CNNs), enabling end-to-end learning of visual representation. By generalizing inner product with a bilinear matrix, we propose the neural similarity which serves as a learnable parametric similarity measure for CNNs. Neural similarity naturally generalizes the convolution and enhances flexibility. Further, we consider the neural similarity learning (NSL) in order to learn the neural similarity adaptively from training data. Specifically, we propose two different ways of learning the neural similarity: static NSL and dynamic NSL. Interestingly, dynamic neural similarity makes the CNN become a dynamic inference network. By regularizing the bilinear matrix, NSL can be viewed as learning the shape of kernel and the similarity measure simultaneously. We further justify the effectiveness of NSL with a theoretical viewpoint. Most importantly, NSL shows promising performance in visual recognition and few-shot learning, validating the superiority of NSL over the inner product-based convolution counterparts.

updated: Fri Dec 06 2019 10:39:39 GMT+0000 (UTC)

published: Mon Oct 28 2019 23:06:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト