Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3626246.3654757acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
short-paper

MillenniumDB: A Multi-modal, Multi-model Graph Database

Published: 09 June 2024 Publication History
  • Get Citation Alerts
  • Abstract

    Current knowledge graphs encompass diverse data formats, including images, text, tables, audio files, and videos. Additionally, the graph database ecosystem is required to support multiple co-existing data models. Addressing these challenges is essential for promoting interoperability between data sources. This demo introduces MillenniumDB, a high-performing, open-source graph database handling this diversity of data formats and models.
    MillenniumDB is a multi-modal, multi-model graph database, supporting the popular property graph paradigm, the Semantic Web format RDF, and the multi-layered graph model, which combines and extends the two. In terms of querying, its provides support for a Cypher-like language over property graphs and multilayered graphs, as well as SPARQL 1.1 support over RDF. The engine is build on a solid theoretical foundation and it leverages worst-case optimal join algorithms in combination with traditional relational query optimization. It also support a wide array of graph-specific tasks such as path finding, pattern recognition, and similarity search on multi-modal data. In this demo, we will showcase how MillenniumDB is currently being used to host three public multi-modal knowledge graphs. The first one, a multi-layered graph called TelarKG, was developed at IMFD Chile to track the information about the Chilean constitutional reform. In the second one, called BibKG, we integrate information about Computer Science publications from different sources and make them available as a property graph. Finally, for RDF, we provide a SPARQL endpoint for Wikidata, the largest knowledge graph openly available online. We remark that our endpoints have stable links, allowing the audience to post queries using their Web browser with no restrictions, and will be available during the review process and during the demo.

    References

    [1]
    Renzo Angles, Marcelo Arenas, Pablo Barceló, Aidan Hogan, Juan L. Reutter, and Domagoj Vrgovc. 2017. Foundations of Modern Query Languages for Graph Databases. ACM Comput. Surv., Vol. 50, 5 (2017). https://doi.org/10.1145/3104031
    [2]
    Renzo Angles, Aidan Hogan, Ora Lassila, Carlos Rojas, Daniel Schwabe, Pedro A. Szekely, and Domagoj Vrgoc. 2022. Multilayer graphs: a unified data model for graph databases. In GRADES-NDA'22. ACM, 11:1--11:6.
    [3]
    Mayank Bawa, Tyson Condie, and Prasanna Ganesan. 2005. LSH forest: self-tuning indexes for similarity search. In WWW'05. 651--660.
    [4]
    World Wide Web Consortium et al. 2013. SPARQL 1.1 overview. (2013).
    [5]
    World Wide Web Consortium et al. 2014. RDF 1.1 concepts and syntax. (2014).
    [6]
    Orri Erling. 2012. Virtuoso, a Hybrid RDBMS/Graph Column Store. IEEE Data Eng. Bull., Vol. 35, 1 (2012), 3--8. http://sites.computer.org/debull/A12mar/vicol.pdf
    [7]
    Alin Deutsch et. al. 2022. Graph Pattern Matching in GQL and SQL/PGQ. In SIGMOD '22. https://doi.org/10.1145/3514221.3526057
    [8]
    Nadime Francis et. al. 2018. Cypher: An Evolving Query Language for Property Graphs. In SIGMOD 2019. https://doi.org/10.1145/3183713.3190657
    [9]
    The Wikimedia Foundation. 2021. Wikidata:Database download. https://www.wikidata.org/wiki/Wikidata:Database_download
    [10]
    Michael J. Freitag, Maximilian Bandle, Tobias Schmidt, Alfons Kemper, and Thomas Neumann. 2020. Adopting Worst-Case Optimal Joins in Relational Database Systems. Proc. VLDB Endow., Vol. 13, 11 (2020), 1891--1904.
    [11]
    Aidan Hogan, Cristian Riveros, Carlos Rojas, and Adriá n Soto. 2019. A Worst-Case Optimal Join Algorithm for SPARQL. In ISWC'19. Springer, 258--275.
    [12]
    Stanislav Malyshev, Markus Krö tzsch, Larry Gonzá lez, Julius Gonsior, and Adrian Bielefeldt. 2018. Getting the Most Out of Wikidata: Semantic Technology Usage in Wikipedia's Knowledge Graph. In ISWC 2019.
    [13]
    Jena Team. 2021. Jena TDB. https://jena.apache.org/documentation/tdb/
    [14]
    Bryan Thompson, Mike Personick, and Martyn Cutcher. 2014. The Bigdata® RDF Graph Database. In Linked Data Management. Chapman and Hall/CRC, 193--237.
    [15]
    Todd L. Veldhuizen. 2014. Triejoin: A Simple, Worst-Case Optimal Join Algorithm. In ICDT'14. 96--106.
    [16]
    Denny Vrandecic and Markus Krö tzsch. 2014. Wikidata: a free collaborative knowledgebase. Commun. ACM, Vol. 57, 10 (2014), 78--85.
    [17]
    Domagoj Vrgovc, Carlos Rojas, Renzo Angles, Marcelo Arenas, Diego Arroyuelo, Carlos Buil-Aranda, Aidan Hogan, Gonzalo Navarro, Cristian Riveros, and Juan Romero. 2023. MillenniumDB: A Persistent, Open-Source, Graph Database. Data Intelligence, Vol. 5 (3) (2023), 560--610. https://doi.org/10.1162/dint_a_00229
    [18]
    Jim Webber. 2012. A programmatic introduction to Neo4j. In SPLASH '12, Gary T. Leavens (Ed.). https://doi.org/10.1145/2384716.2384777 io

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD/PODS '24: Companion of the 2024 International Conference on Management of Data
    June 2024
    694 pages
    ISBN:9798400704222
    DOI:10.1145/3626246
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 June 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. graph databases
    2. knowledge graphs
    3. property graphs

    Qualifiers

    • Short-paper

    Funding Sources

    • ANID ? Millennium Science Initiative Program
    • ANID Fondecyt Regular

    Conference

    SIGMOD/PODS '24
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 43
      Total Downloads
    • Downloads (Last 12 months)43
    • Downloads (Last 6 weeks)43

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media