Deep Metric Learning with Density Adaptivity

Yehao Li; Ting Yao; Yingwei Pan; Hongyang Chao; Tao Mei

密度適応性を備えたディープメトリックラーニング

距離メトリック学習の問題は、埋め込みスペースを学習する観点から主に考慮されます。埋め込みスペースでは、例のペア間の距離は類似度メトリックに対応しています。畳み込みニューラルネットワーク（CNN）の台頭と成功により、ディープメトリックラーニング（DML）では、ネットワークをトレーニングして、埋め込み空間への非線形変換を学習します。既存のDMLアプローチは、多くの場合、クラス間の距離を最大化し、クラス内の変動を最小化することで監督を表現します。ただし、特に各クラスのトレーニング例がしっかりと組み込まれ、各クラスの密度が非常に高い場合、結果には過剰適合の問題が生じる可能性があります。この論文では、密度、つまり表現内のデータ集中の測定値をDMLフレームワークの最適化に統合して、エンドツーエンドの方法でアーキテクチャをトレーニングすることにより、クラス間の類似性とクラス内の変動をバランスよく調整します。技術的には、密度の知識は正則化として使用されます。正則化は、コントラスト損失、Nペア損失、トリプレット損失などのさまざまな目的関数を持つDMLアーキテクチャにプラグインできます。 3つのパブリックデータセットに対する広範な実験は、3種類の埋め込みを密度適応性で修正することにより、明確な改善を一貫して実証しています。さらに驚くべきことに、Cars196、CUB-200-2011、Stanford Online ProductsデータセットのRecall @ 1は、それぞれ67.95％から77.62％、52.01％から55.64％、68.20％から70.56％に増加しています。

The problem of distance metric learning is mostly considered from the perspective of learning an embedding space, where the distances between pairs of examples are in correspondence with a similarity metric. With the rise and success of Convolutional Neural Networks (CNN), deep metric learning (DML) involves training a network to learn a nonlinear transformation to the embedding space. Existing DML approaches often express the supervision through maximizing inter-class distance and minimizing intra-class variation. However, the results can suffer from overfitting problem, especially when the training examples of each class are embedded together tightly and the density of each class is very high. In this paper, we integrate density, i.e., the measure of data concentration in the representation, into the optimization of DML frameworks to adaptively balance inter-class similarity and intra-class variation by training the architecture in an end-to-end manner. Technically, the knowledge of density is employed as a regularizer, which is pluggable to any DML architecture with different objective functions such as contrastive loss, N-pair loss and triplet loss. Extensive experiments on three public datasets consistently demonstrate clear improvements by amending three types of embedding with the density adaptivity. More remarkably, our proposal increases Recall@1 from 67.95% to 77.62%, from 52.01% to 55.64% and from 68.20% to 70.56% on Cars196, CUB-200-2011 and Stanford Online Products dataset, respectively.

updated: Mon Sep 09 2019 15:04:26 GMT+0000 (UTC)

published: Mon Sep 09 2019 15:04:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト