Double-Bit Quantization for Hashing

Authors

  • Weihao Kong Shanghai Jiao Tong University
  • Wu-Jun Li Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v26i1.8208

Abstract

Hashing, which tries to learn similarity-preserving binary codes for data representation, has been widely used for efficient nearest neighbor search in massive databases due to its fast query speed and low storage cost. Because it is NP hard to directly compute the best binary codes for a given data set, mainstream hashing methods typically adopt a two-stage strategy. In the first stage, several projected dimensions of real values are generated. Then in the second stage, the real values will be quantized into binary codes by thresholding. Currently, most existing methods use one single bit to quantize each projected dimension. One problem with this single-bit quantization (SBQ) is that the threshold typically lies in the region of the highest point density and consequently a lot of neighboring points close to the threshold will be hashed to totally different bits, which is unexpected according to the principle of hashing. In this paper, we propose a novel quantization strategy, called double-bit quantization (DBQ), to solve the problem of SBQ. The basic idea of DBQ is to quantize each projected dimension into double bits with adaptively learned thresholds. Extensive experiments on two real data sets show that our DBQ strategy can significantly outperform traditional SBQ strategy for hashing.

Downloads

Published

2021-09-20

How to Cite

Kong, W., & Li, W.-J. (2021). Double-Bit Quantization for Hashing. Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), 634-640. https://doi.org/10.1609/aaai.v26i1.8208