Deep Recurrent Quantization for Generating Sequential Binary Codes

Jingkuan Song; Xiaosu Zhu; Lianli Gao; Xin-Shun Xu; Wu Liu; Heng Tao Shen

シーケンシャルバイナリコードを生成するためのディープリカレント量子化

量子化は、その高精度と高速検索速度により、ANN（近似最近傍）検索で効果的なテクノロジーです。さまざまなアプリケーションの要件を満たすために、可変コード長に反映される検索精度と速度の間には常にトレードオフがあります。ただし、データセットを異なるコード長にエンコードするには、既存のメソッドで複数のモデルをトレーニングする必要があり、各モデルは特定のコード長しか生成できません。これにはかなりのトレーニング時間コストがかかり、実際のアプリケーションに展開される量子化手法の柔軟性が大幅に低下します。この問題に対処するために、シーケンシャルバイナリコードを生成できるDeep Recurrent Quantization（DRQ）アーキテクチャを提案します。最後に、モデルをトレーニングすると、バイナリコードのシーケンスを生成でき、反復の繰り返し回数を調整することでコードの長さを簡単に制御できます。共有コードブックとスカラーファクターは、ディープリカレント量子化ブロックで学習可能な重みになるように設計されており、フレームワーク全体をエンドツーエンドでトレーニングできます。私たちの知る限り、これは一度トレーニングしてシーケンシャルバイナリコードを生成できる最初の量子化方法です。ベンチマークデータセットの実験結果は、私たちのモデルが最先端の画像検索と比較して同等またはそれ以上のパフォーマンスを達成していることを示しています。ただし、必要なパラメーターの数とトレーニング時間は大幅に少なくなります。私たちのコードはオンラインで公開されています：https：//github.com/cfm-uestc/DRQ。

Quantization has been an effective technology in ANN (approximate nearest neighbour) search due to its high accuracy and fast search speed. To meet the requirement of different applications, there is always a trade-off between retrieval accuracy and speed, reflected by variable code lengths. However, to encode the dataset into different code lengths, existing methods need to train several models, where each model can only produce a specific code length. This incurs a considerable training time cost, and largely reduces the flexibility of quantization methods to be deployed in real applications. To address this issue, we propose a Deep Recurrent Quantization (DRQ) architecture which can generate sequential binary codes. To the end, when the model is trained, a sequence of binary codes can be generated and the code length can be easily controlled by adjusting the number of recurrent iterations. A shared codebook and a scalar factor is designed to be the learnable weights in the deep recurrent quantization block, and the whole framework can be trained in an end-to-end manner. As far as we know, this is the first quantization method that can be trained once and generate sequential binary codes. Experimental results on the benchmark datasets show that our model achieves comparable or even better performance compared with the state-of-the-art for image retrieval. But it requires significantly less number of parameters and training times. Our code is published online: https://github.com/cfm-uestc/DRQ.

updated: Sat Dec 05 2020 03:18:56 GMT+0000 (UTC)

published: Sun Jun 16 2019 14:28:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト