-
Developments and applications of the OPTIMADE API for materials discovery, design, and data exchange
Authors:
Matthew L. Evans,
Johan Bergsma,
Andrius Merkys,
Casper W. Andersen,
Oskar B. Andersson,
Daniel Beltrán,
Evgeny Blokhin,
Tara M. Boland,
Rubén Castañeda Balderas,
Kamal Choudhary,
Alberto Díaz Díaz,
Rodrigo Domínguez García,
Hagen Eckert,
Kristjan Eimre,
María Elena Fuentes Montero,
Adam M. Krajewski,
Jens Jørgen Mortensen,
José Manuel Nápoles Duarte,
Jacob Pietryga,
Ji Qi,
Felipe de Jesús Trejo Carrillo,
Antanas Vaitkus,
Jusong Yu,
Adam Zettel,
Pedro Baptista de Castro
, et al. (34 additional authors not shown)
Abstract:
The Open Databases Integration for Materials Design (OPTIMADE) application programming interface (API) empowers users with holistic access to a growing federation of databases, enhancing the accessibility and discoverability of materials and chemical data. Since the first release of the OPTIMADE specification (v1.0), the API has undergone significant development, leading to the upcoming v1.2 relea…
▽ More
The Open Databases Integration for Materials Design (OPTIMADE) application programming interface (API) empowers users with holistic access to a growing federation of databases, enhancing the accessibility and discoverability of materials and chemical data. Since the first release of the OPTIMADE specification (v1.0), the API has undergone significant development, leading to the upcoming v1.2 release, and has underpinned multiple scientific studies. In this work, we highlight the latest features of the API format, accompanying software tools, and provide an update on the implementation of OPTIMADE in contributing materials databases. We end by providing several use cases that demonstrate the utility of the OPTIMADE API in materials research that continue to drive its ongoing development.
△ Less
Submitted 5 April, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
OPTIMADE, an API for exchanging materials data
Authors:
Casper W. Andersen,
Rickard Armiento,
Evgeny Blokhin,
Gareth J. Conduit,
Shyam Dwaraknath,
Matthew L. Evans,
Ádám Fekete,
Abhijith Gopakumar,
Saulius Gražulis,
Andrius Merkys,
Fawzi Mohamed,
Corey Oses,
Giovanni Pizzi,
Gian-Marco Rignanese,
Markus Scheidgen,
Leopold Talirz,
Cormac Toher,
Donald Winston,
Rossella Aversa,
Kamal Choudhary,
Pauline Colinet,
Stefano Curtarolo,
Davide Di Stefano,
Claudia Draxl,
Suleyman Er
, et al. (31 additional authors not shown)
Abstract:
The Open Databases Integration for Materials Design (OPTIMADE) consortium has designed a universal application programming interface (API) to make materials databases accessible and interoperable. We outline the first stable release of the specification, v1.0, which is already supported by many leading databases and several software packages. We illustrate the advantages of the OPTIMADE API throug…
▽ More
The Open Databases Integration for Materials Design (OPTIMADE) consortium has designed a universal application programming interface (API) to make materials databases accessible and interoperable. We outline the first stable release of the specification, v1.0, which is already supported by many leading databases and several software packages. We illustrate the advantages of the OPTIMADE API through worked examples on each of the public materials databases that support the full API specification.
△ Less
Submitted 25 August, 2021; v1 submitted 2 March, 2021;
originally announced March 2021.
-
AiiDAlab -- an ecosystem for developing, executing, and sharing scientific workflows
Authors:
Aliaksandr V. Yakutovich,
Kristjan Eimre,
Ole Schütt,
Leopold Talirz,
Carl S. Adorf,
Casper W. Andersen,
Edward Ditler,
Dou Du,
Daniele Passerone,
Berend Smit,
Nicola Marzari,
Giovanni Pizzi,
Carlo A. Pignedoli
Abstract:
Cloud platforms allow users to execute tasks directly from their web browser and are a key enabling technology not only for commerce but also for computational science. Research software is often developed by scientists with limited experience in (and time for) user interface design, which can make research software difficult to install and use for novices. When combined with the increasing comple…
▽ More
Cloud platforms allow users to execute tasks directly from their web browser and are a key enabling technology not only for commerce but also for computational science. Research software is often developed by scientists with limited experience in (and time for) user interface design, which can make research software difficult to install and use for novices. When combined with the increasing complexity of scientific workflows (involving many steps and software packages), setting up a computational research environment becomes a major entry barrier. AiiDAlab is a web platform that enables computational scientists to package scientific workflows and computational environments and share them with their collaborators and peers. By leveraging the AiiDA workflow manager and its plugin ecosystem, developers get access to a growing range of simulation codes through a python API, coupled with automatic provenance tracking of simulations for full reproducibility. Computational workflows can be bundled together with user-friendly graphical interfaces and made available through the AiiDAlab app store. Being fully compatible with open-science principles, AiiDAlab provides a complete infrastructure for automated workflows and provenance tracking, where incorporating new capabilities becomes intuitive, requiring only Python knowledge.
△ Less
Submitted 29 September, 2020;
originally announced October 2020.
-
Materials Cloud, a platform for open computational science
Authors:
Leopold Talirz,
Snehal Kumbhar,
Elsa Passaro,
Aliaksandr V. Yakutovich,
Valeria Granata,
Fernando Gargiulo,
Marco Borelli,
Martin Uhrin,
Sebastiaan P. Huber,
Spyros Zoupanos,
Carl S. Adorf,
Casper W. Andersen,
Ole Schütt,
Carlo A. Pignedoli,
Daniele Passerone,
Joost VandeVondele,
Thomas C. Schulthess,
Berend Smit,
Giovanni Pizzi,
Nicola Marzari
Abstract:
Materials Cloud is a platform designed to enable open and seamless sharing of resources for computational science, driven by applications in materials modelling. It hosts 1) archival and dissemination services for raw and curated data, together with their provenance graph, 2) modelling services and virtual machines, 3) tools for data analytics, and pre-/post-processing, and 4) educational material…
▽ More
Materials Cloud is a platform designed to enable open and seamless sharing of resources for computational science, driven by applications in materials modelling. It hosts 1) archival and dissemination services for raw and curated data, together with their provenance graph, 2) modelling services and virtual machines, 3) tools for data analytics, and pre-/post-processing, and 4) educational materials. Data is citable and archived persistently, providing a comprehensive embodiment of the FAIR principles that extends to computational workflows. Materials Cloud leverages the AiiDA framework to record the provenance of entire simulation pipelines (calculations performed, codes used, data generated) in the form of graphs that allow to retrace and reproduce any computed result. When an AiiDA database is shared on Materials Cloud, peers can browse the interconnected record of simulations, download individual files or the full database, and start their research from the results of the original authors. The infrastructure is agnostic to the specific simulation codes used and can support diverse applications in computational science that transcend its initial materials domain.
△ Less
Submitted 27 March, 2020;
originally announced March 2020.
-
AiiDA 1.0, a scalable computational infrastructure for automated reproducible workflows and data provenance
Authors:
Sebastiaan. P. Huber,
Spyros Zoupanos,
Martin Uhrin,
Leopold Talirz,
Leonid Kahle,
Rico Häuselmann,
Dominik Gresch,
Tiziano Müller,
Aliaksandr V. Yakutovich,
Casper W. Andersen,
Francisco F. Ramirez,
Carl S. Adorf,
Fernando Gargiulo,
Snehal Kumbhar,
Elsa Passaro,
Conrad Johnston,
Andrius Merkys,
Andrea Cepellotti,
Nicolas Mounet,
Nicola Marzari,
Boris Kozinsky,
Giovanni Pizzi
Abstract:
The ever-growing availability of computing power and the sustained development of advanced computational methods have contributed much to recent scientific progress. These developments present new challenges driven by the sheer amount of calculations and data to manage. Next-generation exascale supercomputers will harden these challenges, such that automated and scalable solutions become crucial.…
▽ More
The ever-growing availability of computing power and the sustained development of advanced computational methods have contributed much to recent scientific progress. These developments present new challenges driven by the sheer amount of calculations and data to manage. Next-generation exascale supercomputers will harden these challenges, such that automated and scalable solutions become crucial. In recent years, we have been developing AiiDA (http://www.aiida.net), a robust open-source high-throughput infrastructure addressing the challenges arising from the needs of automated workflow management and data provenance recording. Here, we introduce developments and capabilities required to reach sustained performance, with AiiDA supporting throughputs of tens of thousands processes/hour, while automatically preserving and storing the full data provenance in a relational database making it queryable and traversable, thus enabling high-performance data analytics. AiiDA's workflow language provides advanced automation, error handling features and a flexible plugin model to allow interfacing with any simulation software. The associated plugin registry enables seamless sharing of extensions, empowering a vibrant user community dedicated to making simulations more robust, user-friendly and reproducible.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.