Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/846218.847242guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

The Hybrid Tree: An Index Structure for High Dimensional Feature Spaces

Published: 23 March 1999 Publication History

Abstract

Feature based similarity search is emerging as an important search paradigm in database systems. The technique used is to map the data items as points into a high dimensional feature space which is indexed using a multidimensional data structure. Similarity search then corresponds to a range search over the data structure. Although several data structures have been proposed for feature indexing, none of them is known to scale beyond 10-15 dimensional spaces. This paper introduces the hybrid tree -- a multidimensional data structure for indexing high dimensional feature spaces. Unlike other multidimensional data structures, the hybrid tree cannot be classified as either a pure data partitioning (DP) index structure (e.g., R-tree, SS-tree, SR-tree) or a pure space partitioning (SP) one (e.g., KDB-tree, hB-tree); rather, it ``combines'' positive aspects of the two types of index structures a single data structure to achieve search performance more scalable to high dimensionalities than either of the above techniques (hence, the name ``hybrid''). Furthermore, unlike many data structures (e.g., distance based index structures like SS-tree, SR-tree), the hybrid tree can support queries based on arbitrary dist ance functions. Our experiments on ``real'' high dimensional large size feature databases demonstrate that the hybrid tree scal es well to high dimensionality and large database sizes. It significantly outperforms both purely DP-based and SP-based index mechanisms as well as linear scan at all dimensionalities for large sized databases.

Cited By

View all
  • (2018)Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity SearchProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3196925(889-903)Online publication date: 27-May-2018
  • (2016)An Index Scheme for Similarity Search on Cloud Computing using MapReduce over Docker ContainerProceedings of the 10th International Conference on Ubiquitous Information Management and Communication10.1145/2857546.2857607(1-6)Online publication date: 4-Jan-2016
  • (2015)EMIFProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749346(543-546)Online publication date: 22-Jun-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDE '99: Proceedings of the 15th International Conference on Data Engineering
March 1999
ISBN:0769500714

Publisher

IEEE Computer Society

United States

Publication History

Published: 23 March 1999

Author Tags

  1. high dimensional feature spaces
  2. multidimensional indexing
  3. multimedia
  4. similarity queries

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity SearchProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3196925(889-903)Online publication date: 27-May-2018
  • (2016)An Index Scheme for Similarity Search on Cloud Computing using MapReduce over Docker ContainerProceedings of the 10th International Conference on Ubiquitous Information Management and Communication10.1145/2857546.2857607(1-6)Online publication date: 4-Jan-2016
  • (2015)EMIFProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749346(543-546)Online publication date: 22-Jun-2015
  • (2012)Indexing RFID data using the VG-curveProceedings of the Twenty-Third Australasian Database Conference - Volume 12410.5555/2483739.2483754(117-126)Online publication date: 31-Jan-2012
  • (2012)Time-series data miningACM Computing Surveys10.1145/2379776.237978845:1(1-34)Online publication date: 7-Dec-2012
  • (2011)Variable granularity space filling curve for indexing multidimensional dataProceedings of the 15th international conference on Advances in databases and information systems10.5555/2041746.2041759(111-124)Online publication date: 20-Sep-2011
  • (2009)A revised r*-tree in comparison with related index structuresProceedings of the 2009 ACM SIGMOD International Conference on Management of data10.1145/1559845.1559929(799-812)Online publication date: 29-Jun-2009
  • (2009)The C-ND treeProceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology10.1145/1516360.1516414(462-471)Online publication date: 24-Mar-2009
  • (2007)The MM-treeProceedings of the 11th East European conference on Advances in databases and information systems10.5555/1780119.1780138(157-171)Online publication date: 29-Sep-2007
  • (2007)The Concentration of Fractional DistancesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2007.103719:7(873-886)Online publication date: 1-Jul-2007
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media