Sparse-Inductive Generative Adversarial Hashing for Nearest Neighbor Search

Hong Liu

最近傍検索のためのスパース帰納的敵対的生成ハッシュ

教師なしハッシュは、過去 10 年間にわたって広範な研究に焦点を当ててきました。通常、その目的は、ハミング空間で事前に定義されたメトリック (ユークリッドメトリック) を保存することです。この目的を達成するために、既存のハッシュの符号化関数は通常、準等長関数であり、ターゲット計量空間から離散ハミング空間への量子化損失を削減することに専念しています。しかし、前述の 2 つの計量空間は不均一であり、準等長マッピングは非線形であるため、このような誤差を直接最小化することには実際問題があります。前者は機能分布の不一致につながり、後者は最適化に問題が生じます。この論文では、大規模な高次元特徴をバイナリコードにエンコードするための、スパース性誘導生成敵対的ハッシュ (SiGAH) と呼ばれる新しい教師なしハッシュ手法を提案します。これは、敵対的生成トレーニングフレームワークを通じて 2 つの問題をうまく解決します。量子化損失を最小限に抑える代わりに、私たちの主な革新は、学習されたハミング空間が生成モデルを介してターゲット計量空間と同様のデータ分布を持つように強制することにあります。特に、バイナリコードを出力するジェネレーターとして ReLU ベースのニューラルネットワークを定式化し、識別子として MSE 損失ベースの自動エンコーダーネットワークを定式化し、その上で敵対的生成学習を実行してハッシュ関数を訓練します。さらに、ハッシュコードから合成特徴を生成するために、圧縮センシング手順が生成モデルに導入され、バイナリコードの再構築境界が元の特徴の境界と一致するように強制されます。最後に、このような敵対生成フレームワークは、Adam オプティマイザーを介してトレーニングできます。 4 つのベンチマーク、つまり Tiny100K、GIST1M、Deep1M、および MNIST に関する実験結果は、提案された SiGAH が最先端のアプローチよりも優れたパフォーマンスを備えていることを示しています。

Unsupervised hashing has received extensive research focus on the past decade, which typically aims at preserving a predefined metric (i.e. Euclidean metric) in the Hamming space. To this end, the encoding functions of the existing hashing are typically quasi-isometric, which devote to reducing the quantization loss from the target metric space to the discrete Hamming space. However, it is indeed problematic to directly minimize such error, since such mentioned two metric spaces are heterogeneous, and the quasi-isometric mapping is non-linear. The former leads to inconsistent feature distributions, while the latter leads to problematic optimization issues. In this paper, we propose a novel unsupervised hashing method, termed Sparsity-Induced Generative Adversarial Hashing (SiGAH), to encode large-scale high-dimensional features into binary codes, which well solves the two problems through a generative adversarial training framework. Instead of minimizing the quantization loss, our key innovation lies in enforcing the learned Hamming space to have similar data distribution to the target metric space via a generative model. In particular, we formulate a ReLU-based neural network as a generator to output binary codes and an MSE-loss based auto-encoder network as a discriminator, upon which a generative adversarial learning is carried out to train hash functions. Furthermore, to generate the synthetic features from the hash codes, a compressed sensing procedure is introduced into the generative model, which enforces the reconstruction boundary of binary codes to be consistent with that of original features. Finally, such generative adversarial framework can be trained via the Adam optimizer. Experimental results on four benchmarks, i.e., Tiny100K, GIST1M, Deep1M, and MNIST, have shown that the proposed SiGAH has superior performance over the state-of-the-art approaches.

updated: Mon Jun 12 2023 08:07:23 GMT+0000 (UTC)

published: Mon Jun 12 2023 08:07:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト