Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3557915.3560944acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article
Public Access

CoMiner: nationwide behavior-driven unsupervised spatial coordinate mining from uncertain delivery events

Published: 22 November 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Geocoding, associating textual addresses with corresponding GPS coordinates, is vital for many location-based services (e.g., logistics, ridesharing, and social networks). One of the most common Geocoding solutions is using commercial map services (e.g., Google Maps) by uploading textual addresses to obtain corresponding coordinates. However, this is typically not practical for some location-based service providers due to real-world challenges like commercial competition and high costs (recurring fees). In this paper, we design a new cost-effective Geocoding framework to automatically infer the geographic coordinates from textual addresses for service providers. To achieve this, we take the E-Commerce logistics service as a concrete scenario and design CoMiner, an unsupervised coordinate inference framework based on textual address data, delivery event data, and courier trajectory data. There are three main components in CoMiner. (1) A POI-level clustering model by modeling customers' shopping patterns at different spatial granularities; (2) A Delivery Mobility Graph (DMG) by modeling couriers' delivery events and geographic coordinates; (3) A behavior-driven address ranking model by mining couriers' uncertain reporting behaviors to further infer coordinates on DMG. We extensively verify the performance of CoMiner with a three-phase evaluation from data-driven experiments to real-world deployment. (i) We conduct extensive experiments on three large-scale datasets where CoMiner achieves an average accuracy of 95.1%, which outperforms the state-of-the-art methods by 20.3%. (ii) We deploy CoMiner in JD Logistics, inferring coordinates for over 30 million addresses with an average accuracy of 93.3%. (iii) We utilize CoMiner for two Geocoding-based applications, i.e., parcel re-routing optimization and abnormal delivery event detection.

    References

    [1]
    2020. Cross-border E-commerce LogisticsTrends. 2020. https://www.cnlogistics.com.hk/doc/CNL_intel_202011.pdf.
    [2]
    2021. Baidu Maps. 2021. https://map.baidu.com/.
    [3]
    2021. COVID-19 has reshaped last-mile logistics in 2020. https://www.weforum.org/press/2021/04/covid-19-has-reshaped-last-mile-logistics-with\-e-commerce-deliveries-rising-25-in-2020/.
    [4]
    2021. GeoNames. 2021. http://www.geonames.org/.
    [5]
    2021. OpenStreetMap. 2021. http://nominatim.openstreetmap.org/.
    [6]
    Pavel Berkhin, Michael R. Evans, Florin Teodorescu, Wei Wu, and Dragomir Yankov. 2015. A New Approach to Geocoding: BingGC. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems (Seattle, Washington) (SIGSPATIAL '15). Association for Computing Machinery, New York, NY, USA, Article 7, 10 pages.
    [7]
    Edward Buckland, Egemen Tanin, Nicholas Geard, Cameron Zachreson, Hairuo Xie, and Hanan Samet. 2021. Managing Trajectories and Interactions During a Pandemic: A Trajectory Similarity-based Approach (Demo Paper). In Proceedings of the 29th International Conference on Advances in Geographic Information Systems. 423--426.
    [8]
    Arnaud Casteigts, Paola Flocchini, Walter Quattrociocchi, and Nicola Santoro. 2012. Time-varying graphs and dynamic networks. International Journal of Parallel, Emergent and Distributed Systems 27, 5 (2012), 387--408.
    [9]
    Abhranil Chatterjee, Janit Anjaria, Sourav Roy, Arnab Ganguli, and Krishanu Seal. 2016. SAGEL: Smart Address Geocoding Engine for Supply-Chain Logistics. In SIGSPATIAL'16. ACM, Article 42, 10 pages.
    [10]
    Anooshmita Das, Emil Stubbe Kolvig-Raun, and Mikkel Baun Kjærgaard. 2020. Accurate Trajectory Prediction in a Smart Building Using Recurrent Neural Networks. In UbiComp-ISWC'20. ACM, 619--628.
    [11]
    Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In KDD'1996 (KDD'96). AAAI Press, 226--231.
    [12]
    Joachim Gudmundsson, Martin P Seybold, and John Pfeifer. 2021. On Practical Nearest Sub-Trajectory Queries under the Fréchet Distance. In Proceedings of the 29th International Conference on Advances in Geographic Information Systems. 596--605.
    [13]
    Baoshen Guo, Shuai Wang, Yi Ding, Guang Wang, Suining He, Desheng Zhang, and Tian He. 2021. Concurrent Order Dispatch for Instant Delivery with Time-Constrained Actor-Critic Reinforcement Learning. In 2021 IEEE Real-Time Systems Symposium (RTSS). IEEE, 176--187.
    [14]
    Baoshen Guo, Weijian Zuo, Shuai Wang, Wenjun Lyu, Zhiqing Hong, Yi Ding, Tian He, and Desheng Zhang. 2022. WePos: Weak-Supervised Indoor Positioning with Unlabeled WiFi for On-Demand Delivery. 6, 2, Article 54 (jul 2022), 25 pages.
    [15]
    Suining He and Kang G Shin. 2020. Towards fine-grained flow forecasting: A graph attention approach for bike sharing systems. In WWW 2020. 88--98.
    [16]
    Tianfu He, Guochun Chen, Chuishi Meng, Huajun He, Zheyi Pan, Yexin Li, Sijie Ruan, Huimin Ren, Ye Yuan, Ruiyuan Li, et al. 2021. POI Alias Discovery in Delivery Addresses using User Locations. In SIGSPATIAL'21. 225--228.
    [17]
    Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780.
    [18]
    Huiqi Hu, Yudian Zheng, Zhifeng Bao, Guoliang Li, Jianhua Feng, and Reynold Cheng. 2016. Crowdsourced POI labelling: Location-aware result inference and Task Assignment. In 2016 IEEE 32nd International Conference on Data Engineering (ICDE). 61--72.
    [19]
    Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015).
    [20]
    Sayali Kulkarni, Shailee Jain, Mohammad Javad Hosseini, Jason Baldridge, Eugene Ie, and Li Zhang. 2021. Multi-Level Gazetteer-Free Geocoding. In Proceedings of Second International Combined Workshop on Spatial Language Understanding and Grounded Communication for Robotics. Association for Computational Linguistics, Online, 79--88.
    [21]
    Mingqian Li, Panrong Tong, Mo Li, Zhongming Jin, Jianqiang Huang, and Xian-Sheng Hua. 2021. Traffic Flow Prediction with Vehicle Trajectories. AAAI 35, 1 (May 2021), 294--302.
    [22]
    Quannan Li, Yu Zheng, Xing Xie, Yukun Chen, Wenyu Liu, and Wei ying Ma. 2008. Mining user similarity based on location history. In In GIS '08: Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems. ACM, 1--10.
    [23]
    Ruiyuan Li, Huajun He, Rubin Wang, Sijie Ruan, Tianfu He, Jie Bao, Junbo Zhang, Liang Hong, and Yu Zheng. 2021. TrajMesa: A Distributed NoSQL-Based Trajectory Data Management System. TKDE (2021), 1--1.
    [24]
    Yingtao Luo, Qiang Liu, and Zhaocheng Liu. 2021. STAN: Spatio-Temporal Attention Network for Next Location Recommendation. In WWW 2021. 2177--2185.
    [25]
    Rada Mihalcea, Courtney Corley, Carlo Strapparava, et al. 2006. Corpus-based and knowledge-based measures of text semantic similarity. In AAAI, Vol. 6. 775--780.
    [26]
    Mashaal Musleh, Sofiane Abbar, Rade Stanojevic, and Mohamed Mokbel. 2021. QARTA: an ML-based system for accurate map services. Proceedings of the VLDB Endowment 14, 11 (2021), 2273--2282.
    [27]
    Suraj Nair, Kiran Javkar, Jiahui Wu, and Vanessa Frias-Martinez. 2019. Understanding Cycling Trip Purpose and Route Choice Using GPS Traces and Open Data. IMWUT. 3, 1, Article 20 (March 2019), 26 pages.
    [28]
    Benjamin J. Radford. 2021. Regressing Location on Text for Probabilistic Geocoding. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021). Association for Computational Linguistics, Online, 53--57.
    [29]
    Sina Rashidian, Xinyu Dong, Shubham Kumar Jain, and Fusheng Wang. 2018. EaserGeocoder: Integrative Geocoding with Machine Learning (Demo Paper). In SIGSPATIAL'18. ACM, 572--575.
    [30]
    Sijie Ruan, Cheng Long, Jie Bao, Chunyang Li, Zisheng Yu, Ruiyuan Li, Yuxuan Liang, Tianfu He, and Yu Zheng. 2020. Learning to generate maps from trajectories. In AAAI, Vol. 34. 890--897.
    [31]
    Sijie Ruan, Zi Xiong, Cheng Long, Yiheng Chen, Jie Bao, Tianfu He, Ruiyuan Li, Shengnan Wu, Zhongyuan Jiang, and Yu Zheng. 2020. Doing in One Go: Delivery Time Inference Based on Couriers' Trajectories. In KDD'2020. 2813--2821.
    [32]
    Yatong Song, Jiawei Li, Liying Chen, Shuiping Chen, Renqing He, and Zhizhao Sun. 2021. A Semantic Segmentation Based POI Coordinates Generating Framework for On-Demand Food Delivery Service. In SIGSPATIAL'21. ACM, 379--388.
    [33]
    Vishal Srivastava, Priyam Tejaswin, Lucky Dhakad, Mohit Kumar, and Amar Dani. 2020. A Geocoding Framework Powered by Delivery Data. In SIGSPATIAL'20. ACM, 568--577.
    [34]
    Qin TIAN, Yue GONG, Mengjun KANG, Shening MENG, and Qingyun DU. 2016. A comparative evaluation of online geocoding services in China. 41, 10 (2016), 1351--1358.
    [35]
    Qin Tian, Fu Ren, Tao Hu, Jiangtao Liu, Ruichang Li, and Qingyun Du. 2016. Using an optimized Chinese address matching method to develop a geocoding service: a case study of Shenzhen, China. ISPRS International Journal of Geo-Information 5, 5 (2016), 65.
    [36]
    Panrong Tong, Mingqian Li, Mo Li, Jianqiang Huang, and Xiansheng Hua. 2021. Large-Scale Vehicle Trajectory Reconstruction with Camera Sensing Network. In MobiCom'21. ACM, 188--200.
    [37]
    Tin Vu, Solluna Liu, Renzhong Wang, and Kumarswamy Valegerepura. 2020. Noise Prediction for Geocoding Queries using Word Geospatial Embedding and Bidirectional LSTM. In Proceedings of the 28th International Conference on Advances in Geographic Information Systems. 127--130.
    [38]
    Sheng Wang, Zhifeng Bao, J. Shane Culpepper, and Gao Cong. 2021. A Survey on Trajectory Data Management, Analytics, and Learning. ACM Comput. Surv. 54, 2, Article 39 (March 2021), 36 pages.
    [39]
    Zhengcong Yin, Andong Ma, and Daniel W Goldberg. 2019. A deep learning approach for rooftop geocoding. Transactions in GIS 23, 3 (2019), 495--514.
    [40]
    Yukun Yuan, Desheng Zhang, Fei Miao, John A. Stankovic, Tian He, George Pappas, and Shan Lin. 2021. eRoute: Mobility-Driven Integration of Heterogeneous Urban Cyber-Physical Systems under Disruptive Events. IEEE Transactions on Mobile Computing (2021), 1--1.
    [41]
    Hanyuan Zhang, Xingyu Zhang, Qize Jiang, Baihua Zheng, Zhenbang Sun, and Weiwei Sun. 2020. Trajectory similarity learning with auxiliary supervision and optimal matching. In IJCAI'2020. 11--17.
    [42]
    Ping Zhang, Zhifeng Bao, Yuchen Li, Guoliang Li, Yipeng Zhang, and Zhiyong Peng. 2018. Trajectory-driven influential billboard placement. In KDD'2018. 2748--2757.
    [43]
    Rui Zhang, Conrad Albrecht, Wei Zhang, Xiaodong Cui, Ulrich Finkler, David Kung, and Siyuan Lu. 2020. Map generation from large scale incomplete and inaccurate data labels. In KDD'2020. 2514--2522.
    [44]
    Yan Zhao, Shuo Shang, YuWang, Bolong Zheng, Quoc Viet Hung Nguyen, and Kai Zheng. 2018. Rest: A reference-based framework for spatio-temporal trajectory compression. In KDD'2018. 2797--2806.
    [45]
    Yu Zheng. 2015. Trajectory Data Mining: An Overview. ACM Trans. Intell. Syst. Technol. 6, 3, Article 29 (May 2015), 41 pages.

    Cited By

    View all
    • (2024)SmallMap: Low-cost Community Road Map Sensing with Uncertain Delivery BehaviorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36595968:2(1-26)Online publication date: 15-May-2024
    • (2024)Robust Route Planning under Uncertain Pickup Requests for Last-mile DeliveryProceedings of the ACM on Web Conference 202410.1145/3589334.3645595(3022-3030)Online publication date: 13-May-2024
    • (2023)VeLP: Vehicle Loading Plan Learning from Human Behavior in Nationwide Logistics SystemProceedings of the VLDB Endowment10.14778/3626292.362630517:2(241-249)Online publication date: 1-Oct-2023
    • Show More Cited By

    Index Terms

    1. CoMiner: nationwide behavior-driven unsupervised spatial coordinate mining from uncertain delivery events

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SIGSPATIAL '22: Proceedings of the 30th International Conference on Advances in Geographic Information Systems
        November 2022
        806 pages
        ISBN:9781450395298
        DOI:10.1145/3557915
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 22 November 2022

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. coordinate inference
        2. human behavior
        3. spatio-temporal data mining

        Qualifiers

        • Research-article

        Funding Sources

        Conference

        SIGSPATIAL '22
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 220 of 1,116 submissions, 20%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)219
        • Downloads (Last 6 weeks)57
        Reflects downloads up to 27 Jul 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)SmallMap: Low-cost Community Road Map Sensing with Uncertain Delivery BehaviorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36595968:2(1-26)Online publication date: 15-May-2024
        • (2024)Robust Route Planning under Uncertain Pickup Requests for Last-mile DeliveryProceedings of the ACM on Web Conference 202410.1145/3589334.3645595(3022-3030)Online publication date: 13-May-2024
        • (2023)VeLP: Vehicle Loading Plan Learning from Human Behavior in Nationwide Logistics SystemProceedings of the VLDB Endowment10.14778/3626292.362630517:2(241-249)Online publication date: 1-Oct-2023
        • (2023)AutoBuild: Automatic Community Building Labeling for Last-mile DeliveryProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614658(4623-4630)Online publication date: 21-Oct-2023
        • (2023)Attention Enhanced Package Pick-Up Time Prediction via Heterogeneous Behavior ModelingAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0862-8_12(189-208)Online publication date: 20-Oct-2023

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media