Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Towards a Painless Index for Spatial Objects

Published: 07 October 2014 Publication History
  • Get Citation Alerts
  • Abstract

    Conventional spatial indexes, represented by the R-tree, employ multidimensional tree structures that are complicated and require enormous efforts to implement in a full-fledged database management system (DBMS). An alternative approach for supporting spatial queries is mapping-based indexing, which maps both data and queries into a one-dimensional space such that data can be indexed and queries can be processed through a one-dimensional indexing structure such as the B+. Mapping-based indexing requires implementing only a few mapping functions, incurring much less effort in implementation compared to conventional spatial index structures. Yet, a major concern about using mapping-based indexes is their lower efficiency than conventional tree structures.
    In this article, we propose a mapping-based spatial indexing scheme called Size Separation Indexing (SSI). SSI is equipped with a suite of techniques including size separation, data distribution transformation, and more efficient mapping algorithms. These techniques overcome the drawbacks of existing mapping-based indexes and significantly improve the efficiency of query processing. We show through extensive experiments that, for window queries on spatial objects with nonzero extents, SSI has two orders of magnitude better performance than existing mapping-based indexes and competitive performance to the R-tree as a standalone implementation. We have also implemented SSI on top of two off-the-shelf DBMSs, PostgreSQL and a commercial platform, both having R-tree implementation. In this case, SSI is up to two orders of magnitude faster than their provided spatial indexes. Therefore, we achieve a spatial index more efficient than the R-tree in a DBMS implementation that is at the same time easy to implement. This result may upset a common perception that has existed for a long time in this area that the R-tree is the best choice for indexing spatial objects.

    References

    [1]
    W. G. Aref and I. F. Ilyas. 2001. Sp-gist: An extensible database index for supporting space partitioning trees. J. Intell. Inf. Syst. 17, 2--3, 215--240.
    [2]
    M. Arya, W. F. Cody, C. Faloutsos, J. Richardson, and A. Toya. 1994. Qbism: Extending a dbms to support 3d medical images. In Proceedings of the 10th International Conference on Data Engineering (ICDE'94). 314--325.
    [3]
    N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. 1990. The R*-tree: An efficient and robust access method for points and rectangles. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'90). 322--331.
    [4]
    J. L. Bentley. 1975. Multidimensional binary search trees used for associative searching. Comm. ACM 18, 9, 509--517.
    [5]
    S. Berchtold, C. Bohm, and H.-P. Kriegel. 1998. The pyramid-technique: Towards breaking the curse of dimensionality. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'98). 142--153.
    [6]
    C. Bohm, G. Klump, and H.-P. Kriegel. 1999. XZ-ordering: A space-filling curve for objects with spatial extension. In Proceedings of the 6th International Symposium on Advances in Spatial Databases (SSD'99). 75--90.
    [7]
    A. Butz. 1971. Alternative algorithm for hilbert's space-filling curve. IEEE Trans. Comput. 100, 20, 424--426.
    [8]
    D. Comer. 1979. The ubiquitous b-tree. ACM Comput. Surv. 11, 2, 121--137.
    [9]
    C. Faloutsos and Y. Rong. 1991. DOT: A spatial access method using fractals. In Proceedings of the 7th International Conference on Data Engineering (ICDE'91). 152--159.
    [10]
    C. Faloutsos and S. Roseman. 1989. Fractals for secondary key retrieval. In Proceedings of the 8th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS'89). 247--252.
    [11]
    R. A. Finkel and J. L. Bentley. 1974. Quad trees: A data structure for retrieval on composite keys. Acta Informatica 4, 1, 1--9.
    [12]
    V. Gaede and O. Gunther 1998. Multidimensional access methods. ACM Comput. Surv. 30, 2, 170--231.
    [13]
    Google. 2012. Google search. http://en.wikipedia.org/wiki/GoogleSearch.
    [14]
    A. Guttman. 1984. R-trees: A dynamic index structure for spatial searching. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'84). 47--57.
    [15]
    J. M. Hellerstein, J. F. Naughton, and A. Pfeffer. 1995. Generalized search trees for database systems. In Proceedings of the 21st International Conference on Very Large Data Bases (VLDB'95). 562--573.
    [16]
    H. Jagadish, B. C. Ooi, K.-L. Tan, C. Yu, and R. Zhang. 2005. iDistance: An adaptive B+-tree based indexing method for nearest neighbor search. ACM Trans. Database Syst. 30, 2, 364--397.
    [17]
    C. S. Jensen, D. Lin, B. C. Ooi, and R. Zhang. 2006. Effective density queries on continuously moving objects. In Proceedings of the 22nd International Conference on Data Engineering (ICDE'06). 71.
    [18]
    N. Koudas, B. C. Ooi, K.-L. Tan, and R. Zhang. 2004. Approximate nn queries on streams with guaranteed error/performance bounds. In Proceedings of the 13th International Conference on Very Large Data Bases (VLDB'04), vol. 30. 804--815.
    [19]
    N. Koudas and K. C. Sevcik. 1997. Size separation spatial join. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'97). 324--335.
    [20]
    J. K. Lawder and P. J. H. King. 2001. Querying multi-dimensional data index using the hilbert space-filling curve. ACM SIGMOD Rec. 30, 1, 19--24.
    [21]
    Microsoft. 2008. Microsoft SQL server 2008. http://www.microsoft.com/sqlserver/2008/en/us/spatial-data.aspx.
    [22]
    B. Moon, H. V. Jagadish, C. Faloutsos, and J. H. Saltz. 2001. Analysis of the clustering properties of the hilbert space-filling curve. IEEE Trans. Knowl. Data Engin. 13, 1, 124--141.
    [23]
    S. Nutanong, R. Zhang, E. Tanin, and L. Kulik. 2008. The V*-diagram: A query-dependent approach to moving knn queries. Proc. VLDB Endow. 1, 1, 1095--1106.
    [24]
    Oracle. 2007. Oracle spatial technologies. http://download.oracle.com/otndocs/products/spatial/pdf/spatial_features_jsirev.pdf.
    [25]
    J. A. Orenstein and T. H. Merrett. 1984. A class of data structures for associative searching. In Proceedings of the 3rd ACM SIGACT-SIGMOD Symposium on Principles of Database Systems (PODS'84). 181--190.
    [26]
    F. Ramsak, V. Markl, R. Fenk, M. Zirkel, K. Elhardt, and R. Bayer. 2000. Integrating the UB-tree into a database system kernel. In Proceedings of the 26th International Conference on Very Large Data Bases (VLDB'00). 263--272.
    [27]
    E. Stefanakis, T. Theodoridis, T. K. Sellis, and Y. Cheung Lee. 1997. Point representation of spatial objects and query window extension: A new technique for spatial access methods. Int. J. Geograph. Inf. Sci. 11, 6, 529--554.
    [28]
    Y. Tao, D. Papadias, and J. Sun. 2003. The TPR*-tree: An optimized spatio-temporal access method for predictive queries. In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB'03). Vol. 29. 790--801.
    [29]
    R. Weber, H.-J. Schek, and S. Blott. 1998. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In Proceedings of the 24th International Conference on Very Large Data Bases (VLDB'98). 194--205.
    [30]
    P. Xu and S. Tirthapura. 2012. On the optimality of clustering properties of space filling curves. In Proceedings of the 31st Symposium on Principles of Database Systems (PODS'12). 215--224.
    [31]
    M. L. Yiu, Y. Tao, and N. Mamoulis. 2008. The Bdual -tree: Indexing moving objects by space filling curves in the dual space. The VLDB J. 17, 3, 379--400.
    [32]
    R. J. Zauhar, G. Moyna, L. Tian, Z. Li, and W. J. Welsh. 2003. Shape signatures: A new approach to computer-aided ligand and receptor-based drug design. J. Med. Chem. 46, 26, 5674--5690.
    [33]
    R. Zhang, P. Kalnis, B. C. Ooi, and K.-L. Tan. 2005. Generalized multidimensional data mapping and query processing. ACM Trans. Database Syst. 30, 3, 661--697.
    [34]
    R. Zhang, B. C. Ooi, and K.-L. Tan. 2004. Making the pyramid technique robust to query types and workloads. In Proceedings of the 20th International Conference on Data Engineering (ICDE'04). 313--324.

    Cited By

    View all

    Index Terms

    1. Towards a Painless Index for Spatial Objects

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Database Systems
      ACM Transactions on Database Systems  Volume 39, Issue 3
      September 2014
      264 pages
      ISSN:0362-5915
      EISSN:1557-4644
      DOI:10.1145/2676651
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 07 October 2014
      Accepted: 01 January 2014
      Revised: 01 November 2013
      Received: 01 April 2013
      Published in TODS Volume 39, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Spatial databases
      2. mapping-based indexing
      3. space-filling curves
      4. window queries

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)22
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)The “AI + R” - tree: An Instance-optimized R - tree2022 23rd IEEE International Conference on Mobile Data Management (MDM)10.1109/MDM55031.2022.00023(9-18)Online publication date: Jul-2022
      • (2022)IndexingEncyclopedia of Big Data Technologies10.1007/978-3-319-63962-8_217-2(1-6)Online publication date: 15-Jun-2022
      • (2021)RSIMS: Large-Scale Heterogeneous Remote Sensing Images Management SystemRemote Sensing10.3390/rs1309181513:9(1815)Online publication date: 6-May-2021
      • (2021) Novel approaches on bulk‐loading of large scale spatial datasets Concurrency and Computation: Practice and Experience10.1002/cpe.659634:9Online publication date: 10-Sep-2021
      • (2020)Effectively learning spatial indicesProceedings of the VLDB Endowment10.14778/3407790.340782913:12(2341-2354)Online publication date: 14-Sep-2020
      • (2020)LAZY R-treeJournal of Information Science10.1177/016555151982861646:2(243-257)Online publication date: 1-Apr-2020
      • (2019)An Adaptive Construction Method of Hierarchical Spatio-Temporal Index for Vector Data under Peer-to-Peer NetworksISPRS International Journal of Geo-Information10.3390/ijgi81105128:11(512)Online publication date: 12-Nov-2019
      • (2019)U2-TreeIEEE/ACM Transactions on Networking10.1109/TNET.2019.289100827:1(201-213)Online publication date: 1-Feb-2019
      • (2019)An efficient and scalable multi-dimensional indexing scheme for modular data centersData & Knowledge Engineering10.1016/j.datak.2019.101729123:COnline publication date: 1-Sep-2019
      • (2019)IndexingEncyclopedia of Big Data Technologies10.1007/978-3-319-77525-8_217(1015-1019)Online publication date: 20-Feb-2019
      • Show More Cited By

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media