Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3274895.3274984acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
poster

R-Grove: growing a family of R-trees in the big-data forest

Published: 06 November 2018 Publication History
  • Get Citation Alerts
  • Abstract

    The rapid growth of big spatial data urged the research community to develop several big spatial data systems. Regardless of their architecture, one of the fundamental requirements of all these systems is to partition the data efficiently across machines. A widely-used technique for big spatial indexing is to reuse existing search trees asis, e.g., the R-tree family, by building a temporary tree for a sample of the input and use its leaf nodes as partition boundaries. However, we show in this paper that this approach has major limitations that make it unsuitable for the big data environment. This paper studies the use of three popular trees from the R-tree family to index big spatial data, namely, the original R-tree by Guttman, R*-tree, and RR*-tree. We show that the entire family of R-trees is not ready to grow in the big data forest due to fundamental limitations in their design. To overcome these limitations, we propose three new indexes, namely, R-Grove, R*-Grove, and RR*-Grove, which are fundamentally modified to work with big data while inheriting the main characteristics of their traditional index counterparts. With all the proposed indexes publicly available as open source, we hope that these new indexes will be adopted by the community to better serve big spatial data research.

    References

    [1]
    Norbert Beckmann, Hans-Peter Kriegel, Ralf Schneider, and Bernhard Seeger. 1990. The R*-Tree: An Efficient and Robust Access Method for Points and Rectangles. In SIGMOD. Atlantic City, NJ, 322--331.
    [2]
    Norbert Beckmann and Bernhard Seeger. 2009. A Revised R*-tree in Comparison with Related Index Structures. In SIGMOD. Providence, RI, 799--812.
    [3]
    Ahmed Eldawy and Mohamed F. Mokbel. 2015. SpatialHadoop: A MapReduce framework for spatial data. In ICDE. Seoul, South Korea, 1352--1363.
    [4]
    Antonin Guttman. 1984. R-Trees: A Dynamic Index Structure for Spatial Searching. In SIGMOD. Boston, MA, 47--57.
    [5]
    Scott T Leutenegger, Mario A Lopez, and Jeffrey Edgington. 1997. STR: A simple and efficient algorithm for R-tree packing. In Data Engineering, 1997. Proceedings. 13th International Conference on. IEEE, 497--506.

    Cited By

    View all
    • (2024)A learning-based framework for spatial join processing: estimation, optimization and tuningThe VLDB Journal10.1007/s00778-024-00836-133:4(1155-1177)Online publication date: 13-Feb-2024
    • (2023)GL-Tree: A Hierarchical Tree Structure for Efficient Retrieval of Massive Geographic LocationsSensors10.3390/s2304224523:4(2245)Online publication date: 16-Feb-2023
    • (2023)Comparison of LSM indexing techniques for storing spatial dataJournal of Big Data10.1186/s40537-023-00734-310:1Online publication date: 23-Apr-2023
    • Show More Cited By

    Index Terms

    1. R-Grove: growing a family of R-trees in the big-data forest

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
      November 2018
      655 pages
      ISBN:9781450358897
      DOI:10.1145/3274895
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 06 November 2018

      Check for updates

      Author Tags

      1. big data
      2. indexing
      3. spatial partitioning

      Qualifiers

      • Poster

      Conference

      SIGSPATIAL '18
      Sponsor:

      Acceptance Rates

      SIGSPATIAL '18 Paper Acceptance Rate 30 of 150 submissions, 20%;
      Overall Acceptance Rate 220 of 1,116 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A learning-based framework for spatial join processing: estimation, optimization and tuningThe VLDB Journal10.1007/s00778-024-00836-133:4(1155-1177)Online publication date: 13-Feb-2024
      • (2023)GL-Tree: A Hierarchical Tree Structure for Efficient Retrieval of Massive Geographic LocationsSensors10.3390/s2304224523:4(2245)Online publication date: 16-Feb-2023
      • (2023)Comparison of LSM indexing techniques for storing spatial dataJournal of Big Data10.1186/s40537-023-00734-310:1Online publication date: 23-Apr-2023
      • (2023)A distributed framework for large-scale semantic trajectory similarity joinMultimedia Tools and Applications10.1007/s11042-023-15236-w83:6(16205-16229)Online publication date: 13-Jul-2023
      • (2022)Incremental partitioning for efficient spatial data analyticsProceedings of the VLDB Endowment10.14778/3494124.349415015:3(713-726)Online publication date: 4-Feb-2022
      • (2022)Angular Quantization Online Hashing for Image RetrievalIEEE Access10.1109/ACCESS.2021.309536710(72577-72589)Online publication date: 2022
      • (2021)BeastProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3481897(3796-3807)Online publication date: 26-Oct-2021
      • (2020)R*-Grove: Balanced Spatial Partitioning for Large-Scale DatasetsFrontiers in Big Data10.3389/fdata.2020.000283Online publication date: 28-Aug-2020
      • (2020)Using Deep Learning for Big Spatial Data PartitioningACM Transactions on Spatial Algorithms and Systems10.1145/34021267:1(1-37)Online publication date: 12-Aug-2020
      • (2019)UCR-STARSIGSPATIAL Special10.1145/3377000.337700511:2(34-40)Online publication date: 17-Dec-2019
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media