Is Quantized ANN Search Cursed? Case Study of Quantifying Search and Index Quality

Guðmundsson, Gylfi Þór; Jónsson, Björn Þór

doi:10.1007/978-3-031-46994-7_14

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14289))

Included in the following conference series:

International Conference on Similarity Search and Applications

Abstract

Traditional evaluation of an approximate high-dimensional index typically consists of running a benchmark with known ground truth, analyzing the performance in terms of traditional result quality and latency measures, and then comparing those measures to competing index structures. Such analysis can give an overall indication of the suitability of the index for the application that the benchmark represents. When the index inevitably fails to return the sought items for some queries, however, this methodology does not help to explain why the index fails in those cases. Furthermore, when considering many different parameter settings, the process of repeatedly indexing the entire collection is prohibitively time-consuming. In this paper, we define three causes for failures in hierarchical quantized search. We show that the two failure cases that relate to the index can be evaluated and quantified using only the index structure and ground-truth data. In our evaluation, we use eCP, a lightweight algorithm that builds the index hierarchy top-down a priori without any costly segmentation of the dataset, and show that significant insight can be gained into the quality of the index structure, or lack thereof.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

High-dimensional similarity searches using query driven dynamic quantization and distributed indexing

Article 11 April 2019

Indexability-Based Dataset Partitioning

Scalability of the NV-tree: Three Experiments

References

Amsaleg, L., Jégou, H.: BIGANN: abillion-sized evaluation dataset, corpus-texmex.irisa.fr. Accessed 2 June 2023
Google Scholar
Gudmundsson, G.Þ., Jónsson, B.Þ., Amsaleg, L.: A large-scale performance study of cluster-based high-dimensional indexing. In: Proceedings of the international workshop on Very-Large-Scale Multimedia Corpus, Mining and Retrieval (VLS-MCMR), pp. 31–36 (2010)
Google Scholar
Gudmundsson, G.Þ, Jónsson, B.Þ, Amsaleg, L., Franklin, M.J.: Prototyping a web-scale multimedia retrieval service using spark. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 14(3s), 1–24 (2018)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision (IJCV) 60, 91–110 (2004)
Article Google Scholar
Malkov, Y., Ponomarenko, A., Logvinov, A., Krylov, V.: Approximate nearest neighbor algorithm based on navigable small world graphs. Inf. Syst. 45, 61–68 (2014)
Article Google Scholar
Matsui, Y., Uchida, Y., Jégou, H., Satoh, S.: A survey of product quantization. ITE Trans. Media Technol. Appl. (MTA) 6(1), 2–10 (2018)
Google Scholar
Simhadri, H.V., et al.: Results of the NeurIPS 2021 challenge on billion-scale approximate nearest neighbor search. In: NeurIPS 2021 Competitions and Demonstrations Track, pp. 177–189. PMLR (2022)
Google Scholar

Download references

Author information

Authors and Affiliations

Reykjavik University, Menntavegur 1, 102, Reykjavik, Iceland
Gylfi Þór Guðmundsson & Björn Þór Jónsson

Authors

Gylfi Þór Guðmundsson
View author publications
You can also search for this author in PubMed Google Scholar
Björn Þór Jónsson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gylfi Þór Guðmundsson .

Editor information

Editors and Affiliations

University of A Coruña, Coruña, Spain
Oscar Pedreira
Pompeu Fabra University, Barcelona, Spain
Vladimir Estivill-Castro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guðmundsson, G.Þ., Jónsson, B.Þ. (2023). Is Quantized ANN Search Cursed? Case Study of Quantifying Search and Index Quality. In: Pedreira, O., Estivill-Castro, V. (eds) Similarity Search and Applications. SISAP 2023. Lecture Notes in Computer Science, vol 14289. Springer, Cham. https://doi.org/10.1007/978-3-031-46994-7_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-46994-7_14
Published: 27 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46993-0
Online ISBN: 978-3-031-46994-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Is Quantized ANN Search Cursed? Case Study of Quantifying Search and Index Quality

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

High-dimensional similarity searches using query driven dynamic quantization and distributed indexing

Indexability-Based Dataset Partitioning

Scalability of the NV-tree: Three Experiments

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Is Quantized ANN Search Cursed? Case Study of Quantifying Search and Index Quality

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

High-dimensional similarity searches using query driven dynamic quantization and distributed indexing

Indexability-Based Dataset Partitioning

Scalability of the NV-tree: Three Experiments

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation