Deep Momentum Uncertainty Hashing

Chaoyou Fu; Guoli Wang; Xiang Wu; Qian Zhang; Ran He

深い勢いの不確定性ハッシュ

組み合わせ最適化（CO）は、その理論的および実用的な重要性から、注目されている研究トピックです。古典的なCOの問題として、ディープハッシュは、有限の離散的な可能性から各データに最適なコードを見つけることを目的としていますが、離散的な性質は最適化プロセスに大きな課題をもたらします。以前の方法は通常、活性化関数または正則化を介して実数値の代わりにバイナリコードを使用する、バイナリ近似によってこの課題を軽減します。ただし、このような近似は、実数値と2進値の間の不確実性につながり、検索パフォーマンスを低下させます。この論文では、新しいディープモメンタム不確定性ハッシュ（DMUH）を提案します。トレーニング中の不確実性を明示的に推定し、不確実性情報を活用して近似プロセスをガイドします。具体的には、ハッシュネットワークの出力と運動量が更新されたネットワークの出力との間の不一致を測定することにより、ビットレベルの不確実性をモデル化します。各ビットの不一致は、そのビットのおおよその出力に対するハッシュネットワークの不確実性を示しています。一方、ハッシュコードのすべてのビットの平均不一致は、画像レベルの不確実性と見なすことができます。これは、対応する入力画像に対するハッシュネットワークの不確実性を具体化します。不確実性の高いハッシュビットと画像は、最適化の際により注意が払われます。私たちの知る限り、これはハッシュビットの不確実性を研究する最初の作業です。 CIFAR-10、NUS-WIDE、MS-COCO、および百万規模のデータセットClothing1Mを含む、私たちの方法の優位性を検証するために、4つのデータセットで広範な実験が行われます。私たちの方法は、すべてのデータセットで最高のパフォーマンスを達成し、既存の最先端の方法を大幅に上回っています。

Combinatorial optimization (CO) has been a hot research topic because of its theoretic and practical importance. As a classic CO problem, deep hashing aims to find an optimal code for each data from finite discrete possibilities, while the discrete nature brings a big challenge to the optimization process. Previous methods usually mitigate this challenge by binary approximation, substituting binary codes for real-values via activation functions or regularizations. However, such approximation leads to uncertainty between real-values and binary ones, degrading retrieval performance. In this paper, we propose a novel Deep Momentum Uncertainty Hashing (DMUH). It explicitly estimates the uncertainty during training and leverages the uncertainty information to guide the approximation process. Specifically, we model bit-level uncertainty via measuring the discrepancy between the output of a hashing network and that of a momentum-updated network. The discrepancy of each bit indicates the uncertainty of the hashing network to the approximate output of that bit. Meanwhile, the mean discrepancy of all bits in a hashing code can be regarded as image-level uncertainty. It embodies the uncertainty of the hashing network to the corresponding input image. The hashing bit and image with higher uncertainty are paid more attention during optimization. To the best of our knowledge, this is the first work to study the uncertainty in hashing bits. Extensive experiments are conducted on four datasets to verify the superiority of our method, including CIFAR-10, NUS-WIDE, MS-COCO, and a million-scale dataset Clothing1M. Our method achieves the best performance on all of the datasets and surpasses existing state-of-the-art methods by a large margin.

updated: Tue Jul 13 2021 07:25:50 GMT+0000 (UTC)

published: Thu Sep 17 2020 01:57:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト