Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3274895.3274929acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
demonstration
Public Access

EaserGeocoder: integrative geocoding with machine learning (demo paper)

Published: 06 November 2018 Publication History
  • Get Citation Alerts
  • Abstract

    Increased availability of large amounts of address data provides opportunities for data driven studies to improve decision making in business applications and support precision public health with high resolution geolocations. Geocoding large number of addresses is challenging due to high cost and often disclosure of sensitive data to vendors over the Web. Most geocoders take advantage of Web APIs which require sending private addresses over the Internet, which may not be an option for many applications with sensitive data including public health and geo-medicine. Meanwhile, the cost for geocoding massive number of addresses could be high and becomes a major hurdle for many users. To overcome these challenges, we developed an open source on-premise geocoding software EaserGeocoder, which uses a novel integrative geocoding model to achieve high accuracy through integrating multiple open data sources. EaserGeocoder takes advantage of machine learning based approaches to determine best answers from multiple data sources. EaserGeocoder can also be easily parallelized to achieve high scalability through parallelized search and distributed computing. EaserGeocoder is on a par with commercial geocoding systems, outperforms open source systems, and is available for free.

    References

    [1]
    2016. Physician and Other Supplier Data CY 2013. Retrieved August 11, 2016 from https://www.cms.gov/
    [2]
    2016. SPARCS. https://www.health.ny.gov/statistics/sparcs/
    [3]
    2017. Google Maps Geocoding API. https://developers.google.com/maps/
    [4]
    2017. NYS GIS Clearinghouse - NYS Address Points. Retrieved May 17, 2017 from http://gis.ny.gov/gisdata/inventories/details.cfm?DSID=921
    [5]
    2017. OpenAddresses. Retrieved Feb, 2017 from http://results.openaddresses.io/
    [6]
    2017. OpenStreetMap Nominatim. Retrieved May, 2017 from http://nominatim.openstreetmap.org
    [7]
    2017. TIGER Products - Geography - U.S. Census Bureau. Retrieved May 26, 2017 from https://www.census.gov/geo/maps-data/data/tiger.html
    [8]
    2018. EaserGeocoder. http://bmidb.cs.stonybrook.edu/easergeocoder/
    [9]
    Tianqi Chen and Carlos Guestrin. 2016. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. ACM, 785--794.
    [10]
    Xin Chen and Fusheng Wang. 2016. Integrative Spatial Data Analytics for Public Health Studies of New York State. In AMIA Annual Symposium Proceedings, Vol. 2016. American Medical Informatics Association, 391.
    [11]
    New York State Geographic Information Systems Clearinghouse. 2014. NYS GIS Clearinghouse - NYS Tax Parcels. Retrieved Feb 20, 2017 from http://gis.ny.gov/gisdata/inventories/details.cfm?DSID=1300
    [12]
    Daniel W Goldberg and Myles G Cockburn. 2010. Improving geocode accuracy with candidate selection criteria. Transactions in GIS 14, s1 (2010), 149--176.
    [13]
    Geoffrey M Jacquez. 2012. A research agenda: does geocoding positional error matter in health GIS studies? Spatial and spatio-temporal epidemiology 3, 1 (2012), 7--16.
    [14]
    U.S. Department of Health & Human Services. 2015. Health Information Privacy | HHS.gov. https://www.hhs.gov/hipaa/
    [15]
    Sina Rashidian, Xinyu Dong, Amogh Avadhani, Prachi Poddar, and Fusheng Wang. 2017. Effective Scalable and Integrative Geocoding for Massive Address Datasets. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 26.
    [16]
    Gerard Rushton, Marc P Armstrong, Josephine Gittler, Barry R Greene, Claire E Pavlik, Michele M West, and Dale L Zimmerman. 2006. Geocoding in cancer research: a review. American journal of preventive medicine 30, 2 (2006), S16--S24.
    [17]
    Xuan Shi, Bowei Xue, and Imam M Xierali. 2016. Identifying the Uncertainty in Physician Practice Location through Spatial Analytics and Text Mining. International Journal of Environmental Research and Public Health 13, 9 (2016), 930.
    [18]
    Duck-Hye Yang, Lucy Mackey Bilaver, Oscar Hayes, and Robert Goerge. 2004. Improving geocoding practices: evaluation of geocoding tools. Journal of medical systems 28, 4 (2004), 361--370.
    [19]
    Paul A Zandbergen. 2008. A comparison of address point, parcel and street geocoding techniques. Computers, Environment and Urban Systems 32, 3 (2008), 214--232.
    [20]
    Paul A Zandbergen. 2009. Geocoding quality and implications for spatial analysis. Geography Compass 3, 2 (2009), 647--680.

    Cited By

    View all
    • (2024)Unveiling the impact of machine learning algorithms on the quality of online geocoding services: a case study using COVID-19 dataJournal of Geographical Systems10.1007/s10109-023-00435-8Online publication date: 25-Jan-2024
    • (2022)CoMinerProceedings of the 30th International Conference on Advances in Geographic Information Systems10.1145/3557915.3560944(1-10)Online publication date: 1-Nov-2022
    • (2021)Toward correctness control of postal addresses geocodingInterCarto. InterGIS10.35595/2414-9179-2021-2-27-114-12727:2(114-127)Online publication date: 2021
    • Show More Cited By

    Index Terms

    1. EaserGeocoder: integrative geocoding with machine learning (demo paper)

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
      November 2018
      655 pages
      ISBN:9781450358897
      DOI:10.1145/3274895
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 06 November 2018

      Check for updates

      Author Tags

      1. geocoding
      2. geographic information system
      3. text searching

      Qualifiers

      • Demonstration

      Funding Sources

      Conference

      SIGSPATIAL '18
      Sponsor:

      Acceptance Rates

      SIGSPATIAL '18 Paper Acceptance Rate 30 of 150 submissions, 20%;
      Overall Acceptance Rate 220 of 1,116 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)98
      • Downloads (Last 6 weeks)14
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Unveiling the impact of machine learning algorithms on the quality of online geocoding services: a case study using COVID-19 dataJournal of Geographical Systems10.1007/s10109-023-00435-8Online publication date: 25-Jan-2024
      • (2022)CoMinerProceedings of the 30th International Conference on Advances in Geographic Information Systems10.1145/3557915.3560944(1-10)Online publication date: 1-Nov-2022
      • (2021)Toward correctness control of postal addresses geocodingInterCarto. InterGIS10.35595/2414-9179-2021-2-27-114-12727:2(114-127)Online publication date: 2021
      • (2020)Association of Opioid Use Disorder With 2016 Presidential Voting Patterns: A Cross-Sectional Study in New York State at Census Tract Level (Preprint)JMIR Public Health and Surveillance10.2196/23426Online publication date: 29-Aug-2020

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media