Abstract
The prevention of forgery and alternation of important data of blockchain technology is contributing widely to the expanding usage of this technology to areas and industries such as real estate and agriculture. Despite the high utilization of the blockchain, its write-intensive feature causes a large amount of disk I/Os when trying to index and process queries over the data. Among previous studies, the hierarchical quadrant spatial LSM tree (i.e., HQ-sLSM tree) was proposed as an effective structure to index large amounts of geospatial point data from the blockchain and process queries while triggering a low number of disk I/Os. However, geospatial data exist in forms such as lines and polygons inside cadastral maps and survey information. In this paper, we propose an extended version of the HQ-sLSM tree which indexes geospatial line and polygon data. The extended tree, named the HQ-sLSM\(^{+}\) tree, inherits and adapts some common features and the low disk I/O algorithms of the original HQ-sLSM tree, fitting them to the line and polygon data types. Furthermore, an algorithm to process the spatial join query over two HQ-sLSM\(^{+}\) trees is proposed. A concept of a spatial join filter is introduced to access disk components efficiently. Experiments confirmed that the number of disk I/Os triggered when spatially joining two HQ-sLSM\(^{+}\) trees was much less compared to existing baseline index trees such as the R-tree and the LSM R-tree.
Similar content being viewed by others
Data availability
No datasets were generated or analyzed during the current study.
References
Bal M (2017) Securing property rights in India through distributed ledger technology. Working Papers id:11567, eSocialSciences (2017). https://ideas.repec.org/p/ess/wpaper/id11567.html
KRAS (2014) KRAS, Sejong, Korea. https://www.kras.go.kr/
Land I (2022) Transport: one stop service for korea land use information. http://www.molit.go.kr/english/USR/BORD0201/m_28286/DTL.jsp?id=eng0301 &cate= &mode=view &idx=3090 &key=content &search= &search_regdate_s= &search_regdate_e= &order= &desc=asc &srch_prc_stts= &item_num=0 &search_dept_id= &search_dept_nm= &srch_usr_nm=N &srch_usr_titl=N &srch_usr_ctnt=N &srch_mng_nm=N &old_dept_nm= &search_gbn= &search_section= &source= &search1= &lcmspage=1
Lantmateriet (2018) Lantmateriet, Gavle, Sweden. https://www.lantmateriet.se/en/
Samet H (2005) Foundations of multidimensional and metric data structures (the Morgan Kaufmann series in computer graphics and geometric modeling). Morgan Kaufmann Publishers Inc., San Francisco
Gaede V, Günther O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231. https://doi.org/10.1145/280277.280279
Leinonen H (2016) In: Górka J (ed) Decentralised Blockchained and Centralised Real-Time Payment Ledgers: Development Trends and Basic Requirements, pp 236–261. Palgrave Macmillan UK, London. https://doi.org/10.1057/9781137541215_8
Wood G et al (2014) Ethereum: a secure decentralised generalised transaction ledger. Ethereum Project Yellow Paper 151(2014):1–32
Skrodzki M (2019) The k-d tree data structure and a proof for neighborhood computation in expected logarithmic time. arXiv:1903.04936
Finkel R, Bentley J (1974) Quad trees: a data structure for retrieval on composite keys. Acta Inf 4:1–9. https://doi.org/10.1007/BF00288933
FOAM (2018) FOAM location. Brooklyn, New York. https://foam.space
Lee J, Kwon T, Seo M, Jung S (2023) Hierarchical quadrant spatial lsm tree for indexing blockchain-based geospatial point data. IEEE Access 11:118088–118104. https://doi.org/10.1109/ACCESS.2023.3326256
Clementini E, Di Felice P (1995) A comparison of methods for representing topological relationships. Inf Sci - Appl 3(3):149–178. https://doi.org/10.1016/1069-0115(94)00033-X
Bouros P, Mamoulis N (2019) Spatial joins: what’s next? SIGSPATIAL Special 11(1):13–21. https://doi.org/10.1145/3355491.3355494
Moon W (2012) Geographic information systems and science (3rd edition) by P. A. Longley, M. F. Goodchild, D. J. Maguire and D. W. Rhind (book review). The Leading Edge (Society of Exploration Geophysicists) 31, 975–976
Ester M, Kriegel H-P, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. KDD’96. AAAI Press, Portland, Oregon, pp 226–231
Cao H, Mamoulis N, Cheung DW (2007) Discovery of periodic patterns in spatiotemporal sequences. IEEE Trans Knowl Data Eng 19(4):453–467. https://doi.org/10.1109/TKDE.2007.1002
Markram H, Meier K, Lippert T, Grillner S, Frackowiak R, Dehaene S, Knoll A, Sompolinsky H, Verstreken K, Defelipe J, Grant S, Changeux J-P, Saria A (2011) Introducing the human brain project. Procedia Comput Sci 7:39–42. https://doi.org/10.1016/j.procs.2011.12.015
Ablimit A, Fusheng W, Joel HS (2012) Towards building a high performance spatial query system for large scale medical imaging data. ACM SIGSPATIAL 7:309–318. https://doi.org/10.1145/2424321.2424361
Jacox EH, Samet H (2007) Spatial join techniques. ACM Trans Database Syst 32(1):7. https://doi.org/10.1145/1206049.1206056
Mamoulis N (2009) Spatial join. In: Liu L, Özsu MT (eds) Encyclopedia of database systems, pp 2707–2714. Springer, Boston. https://doi.org/10.1007/978-0-387-39940-9_356
Sabek I, Mokbel MF (2017) On spatial joins in mapreduce. In: Proceedings of the 25th ACM SIGSPATIAL international conference on advances in geographic information systems. SIGSPATIAL ’17. Association for Computing Machinery, New York.https://doi.org/10.1145/3139958.3139967
Heyrani-Nobari S, Qu Q, Jensen CS (2017) In-memory spatial join: the data matters! In: International Conference on Extending Database Technology. https://api.semanticscholar.org/CorpusID:39365787
McKenney M, Frye R, Dellamano M, Anderson K, Harris J (2017) Multi-core parallelism for plane sweep algorithms as a foundation for GIS operations. Geoinformatica 21(1):151–174
Brinkhoff T, Kriegel H-P, Seeger B (1993) Efficient processing of spatial joins using r-trees. SIGMOD Rec 22(2):237–246. https://doi.org/10.1145/170036.170075
Gurret C, Rigaux P (2000) The sort/sweep algorithm: a new method for r-tree based spatial joins. In: Proceedings. 12th International Conference on Scientific and Statistica Database Management, pp 153–165. https://doi.org/10.1109/SSDM.2000.869785
Huang Y-W, Jing N, Rundensteiner EA (1997) Spatial joins using r-trees: breadth-first traversal with global optimizations. In: Proceedings of the 23rd International Conference on Very Large Data Bases. VLDB ’97. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp 396–405
Alsubaiee S, Behm A, Borkar V, Heilbron Z, Kim Y-S, Carey MJ, Dreseler M, Li C (2014) Storage management in AsterixDB. Proc VLDB Endow 7(10):841–852. https://doi.org/10.14778/2732951.2732958
Kim Y (2016) Transactional and spatial query processing in the big data era. Ph.D. thesis, University of California, Irvine, USA. http://www.escholarship.org/uc/item/3815k9kz
Shin J, Wang J, Aref WG (2021) The LSM RUM-tree: a log structured merge R-tree for update-intensive spatial workloads. In: 2021 IEEE 37th International Conference on Data Engineering (ICDE), pp 2285–2290. https://doi.org/10.1109/ICDE51399.2021.00238
Liu J, Li H, Gao Y, Yu H, Jiang D (2014) A geohash-based index for spatial data management in distributed memory. In: 2014 22nd International Conference on Geoinformatics, pp 1–4. https://doi.org/10.1109/GEOINFORMATICS.2014.6950819
Wood J, Kim S (2008) Plane sweep algorithm. In: Shekhar S, Xiong H (eds) Encyclopedia of GIS, pp 876–878. Springer, Boston. https://doi.org/10.1007/978-0-387-35973-1_989
O’Neil PE, Cheng E, Gawlick D, O’Neil EJ (1996) The log-structured merge-tree (LSM-tree). Acta Inform 33(4):351–385. https://doi.org/10.1007/s002360050048
OpenStreetMap contributors: Planet dump retrieved from https://planet.osm.org. https://www.openstreetmap.org (2017)
Acknowledgments
This work was supported in part by the Institute of Information and Communications Technology Planning and Evaluation (IITP) through the Korean Government [Ministry of Science and ICT (MSIT)], Development of High-Speed Analysis of Distributed Large-Scale Data for Wide Usability of Various Industries, under Grant 2021-0-00180, and in part by the Korea Institute for Advanced of Technology (KIAT) funded by the Korean Government [Ministry of Trade, Industry and Energy (MOTIE)] through the Human Resource Development (HRD) Program for Industrial Innovation under Grant P0020535.
Author information
Authors and Affiliations
Contributions
T.K and J.L. wrote the main manuscript text. T.K revised the paper. J.L. improved and implemented the HQ-sLSM+ tree indexing method and its spatial join algorithm. J.L. also did the extensive experiments for the performance analysis given in the paper. T.K. developed the initial version of HQ-sLSM+ tree indexing scheme and its spatial join algorithm. S.J. proposed the basic idea of HQ-sLSM+ tree indexing scheme and its spatial join algorithm. S.J. guided J.L and T.K for the development of HQ-sLSM+ tree indexing scheme and its spatial join algorithm. S.J. also guided J.L. and T.K in writing this paper. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Lee, J., Kwon, T. & Jung, S. An effective spatial join method for blockchain-based geospatial data using hierarchical quadrant spatial LSM+ tree. J Supercomput 80, 17492–17523 (2024). https://doi.org/10.1007/s11227-024-06134-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-024-06134-5