Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3318464.3393815acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
keynote

When the Web is your Data Lake: Creating a Search Engine for Datasets on the Web

Published: 31 May 2020 Publication History
  • Get Citation Alerts
  • Abstract

    There are thousands of data repositories on the Web, providing access to millions of datasets. National and regional governments, scientific publishers and consortia, commercial data providers, and others publish data for fields ranging from social science to life science to high-energy physics to climate science and more. Access to this data is critical to facilitating reproducibility of research results, enabling scientists to build on others' work, and providing data journalists easier access to information and its provenance. In this talk, I will discuss our work on Dataset Search, which provides search capabilities over potentially all dataset repositories on the Web. I will talk about the open ecosystem for describing and citing datasets that we hope to encourage and the technical details on how we went about building Dataset Search. Finally, I will highlight research challenges in building a vibrant, heterogeneous, and open ecosystem where data becomes a first-class citizen.

    Cited By

    View all
    • (2023)Cross Modal Data Discovery over Structured and Unstructured Data LakesProceedings of the VLDB Endowment10.14778/3611479.361153316:11(3377-3390)Online publication date: 24-Aug-2023
    • (2020)Loch Prospector: Metadata Visualization for Lakes of Open Data2020 IEEE Visualization Conference (VIS)10.1109/VIS47514.2020.00032(126-130)Online publication date: Oct-2020
    • (2020)Using EEG to Distinguish Between Writing and Typing for the Same Cognitive TaskBrain Function Assessment in Learning10.1007/978-3-030-60735-7_7(66-74)Online publication date: 9-Oct-2020

    Index Terms

    1. When the Web is your Data Lake: Creating a Search Engine for Datasets on the Web

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
      June 2020
      2925 pages
      ISBN:9781450367356
      DOI:10.1145/3318464
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 31 May 2020

      Check for updates

      Author Tags

      1. knowledge graphs
      2. structured data
      3. web search

      Qualifiers

      • Keynote

      Conference

      SIGMOD/PODS '20
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 785 of 4,003 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)14
      • Downloads (Last 6 weeks)3
      Reflects downloads up to

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Cross Modal Data Discovery over Structured and Unstructured Data LakesProceedings of the VLDB Endowment10.14778/3611479.361153316:11(3377-3390)Online publication date: 24-Aug-2023
      • (2020)Loch Prospector: Metadata Visualization for Lakes of Open Data2020 IEEE Visualization Conference (VIS)10.1109/VIS47514.2020.00032(126-130)Online publication date: Oct-2020
      • (2020)Using EEG to Distinguish Between Writing and Typing for the Same Cognitive TaskBrain Function Assessment in Learning10.1007/978-3-030-60735-7_7(66-74)Online publication date: 9-Oct-2020

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media