Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding

Xuetong Xue; Jiaying Shi; Xinxue He; Shenghui Xu; Zhaoming Pan

ファイングレインイメージバイナリエンコーディングのためのクロススケールコンテキスト抽出ハッシュ

ディープハッシングは、高次元の画像データをバイナリコードにエンコードすることにより、効率的な計算と低ストレージコストにより、大規模な画像検索タスクに広く適用されています。バイナリコードにはフロート機能ほど多くの情報が含まれていないため、バイナリエンコーディングの本質は、検索品質を保証するためにメインコンテキストを保持することです。しかし、既存のハッシング手法では、冗長な背景情報を抑制し、単純な符号関数によってユークリッド空間からハミング空間へ正確に符号化するには大きな限界があります。これらの問題を解決するために、Cross-Scale Context Extracted Hashing Network (CSCE-Net) が本論文で提案されています。まず、高レベルのグローバルセマンティック情報を維持しながら、きめの細かいローカル情報をキャプチャする 2 ブランチフレームワークを設計します。さらに、注意ガイド付き情報抽出モジュール (AIE) が 2 つのブランチの間に導入され、グローバルスライディングウィンドウと連携して低コンテキスト情報の領域が抑制されます。以前の方法とは異なり、CSCE-Net はコンテンツ関連の動的署名関数 (DSF) を学習して、元の単純な署名関数を置き換えます。したがって、提案された CSCE-Net はコンテキストに依存し、正確なイメージバイナリエンコーディングで適切に実行できます。さらに、CSCE-Net が既存のハッシュ手法よりも優れていることを実証し、標準的なベンチマークでの検索パフォーマンスを向上させます。

Deep hashing has been widely applied to large-scale image retrieval tasks owing to efficient computation and low storage cost by encoding high-dimensional image data into binary codes. Since binary codes do not contain as much information as float features, the essence of binary encoding is preserving the main context to guarantee retrieval quality. However, the existing hashing methods have great limitations on suppressing redundant background information and accurately encoding from Euclidean space to Hamming space by a simple sign function. In order to solve these problems, a Cross-Scale Context Extracted Hashing Network (CSCE-Net) is proposed in this paper. Firstly, we design a two-branch framework to capture fine-grained local information while maintaining high-level global semantic information. Besides, Attention guided Information Extraction module (AIE) is introduced between two branches, which suppresses areas of low context information cooperated with global sliding windows. Unlike previous methods, our CSCE-Net learns a content-related Dynamic Sign Function (DSF) to replace the original simple sign function. Therefore, the proposed CSCE-Net is context-sensitive and able to perform well on accurate image binary encoding. We further demonstrate that our CSCE-Net is superior to the existing hashing methods, which improves retrieval performance on standard benchmarks.

updated: Fri Oct 14 2022 06:52:40 GMT+0000 (UTC)

published: Fri Oct 14 2022 06:52:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト