One Loss for Quantization: Deep Hashing with Discrete Wasserstein Distributional Matching

Khoa D. Doan; Peng Yang; Ping Li

量子化の1つの損失：離散ワッサーシュタイン分布マッチングによるディープハッシュ

画像ハッシュは、画像の大規模なコレクション内のクエリに類似したアイテムを見つけるための原則的な近似最近傍アプローチです。ハッシュは、画像をバイナリベクトルにマッピングするバイナリ出力関数を学習することを目的としています。最適な検索パフォーマンスを得るには、学習段階の連続緩和と推論段階の離散量子化の間のギャップを埋めるために、量子化誤差の少ないバランスの取れたハッシュコードを生成することが重要です。ただし、既存の詳細な監視付きハッシュ手法では、コーディングバランスと低量子化誤差を実現するのは困難であり、いくつかの損失が伴います。これは、これらの方法の既存の量子化アプローチがヒューリスティックに構築されており、これらの目的を達成するのに効果的ではないためであると主張します。このホワイトペーパーでは、量子化の制約を学習するための代替アプローチについて検討します。量子化誤差の少ないバランスの取れたコードを学習するタスクは、連続コードの学習された分布を事前定義された離散的で均一な分布に一致させるものとして再定式化されます。これは、2つの分布間の距離を最小化することと同じです。次に、ハッシュ関数の離散特性を利用して、計算効率の高い分布距離を提案します。この分布距離は有効な距離であり、時間とサンプルの複雑さが少なくなります。提案された単一損失量子化目標は、コードバランスと量子化誤差を改善するために、既存の監視付きハッシュ法に統合することができます。実験により、提案されたアプローチがいくつかの代表的なハッシュ手法のパフォーマンスを大幅に改善することが確認されています。

Image hashing is a principled approximate nearest neighbor approach to find similar items to a query in a large collection of images. Hashing aims to learn a binary-output function that maps an image to a binary vector. For optimal retrieval performance, producing balanced hash codes with low-quantization error to bridge the gap between the learning stage's continuous relaxation and the inference stage's discrete quantization is important. However, in the existing deep supervised hashing methods, coding balance and low-quantization error are difficult to achieve and involve several losses. We argue that this is because the existing quantization approaches in these methods are heuristically constructed and not effective to achieve these objectives. This paper considers an alternative approach to learning the quantization constraints. The task of learning balanced codes with low quantization error is re-formulated as matching the learned distribution of the continuous codes to a pre-defined discrete, uniform distribution. This is equivalent to minimizing the distance between two distributions. We then propose a computationally efficient distributional distance by leveraging the discrete property of the hash functions. This distributional distance is a valid distance and enjoys lower time and sample complexities. The proposed single-loss quantization objective can be integrated into any existing supervised hashing method to improve code balance and quantization error. Experiments confirm that the proposed approach substantially improves the performance of several representative hashing~methods.

updated: Tue May 31 2022 12:11:17 GMT+0000 (UTC)

published: Tue May 31 2022 12:11:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト