-
Thermal Conductivity Predictions with Foundation Atomistic Models
Authors:
Balázs Póta,
Paramvir Ahlawat,
Gábor Csányi,
Michele Simoncelli
Abstract:
Recent advances in machine learning have led to the development of foundation models for atomistic materials chemistry, enabling quantum-accurate descriptions of interatomic forces across diverse compounds at reduced computational cost. Hitherto, these models have been benchmarked mostly on harmonic phonons, and their accuracy and efficiency in predicting complex, technologically relevant anharmon…
▽ More
Recent advances in machine learning have led to the development of foundation models for atomistic materials chemistry, enabling quantum-accurate descriptions of interatomic forces across diverse compounds at reduced computational cost. Hitherto, these models have been benchmarked mostly on harmonic phonons, and their accuracy and efficiency in predicting complex, technologically relevant anharmonic heat-conduction properties remains unknown. Here, we introduce a framework that leverages foundation models and the Wigner formulation of heat transport to overcome the major bottlenecks of current techniques for designing heat-management materials, such as high cost, limited transferability, or lack of physics awareness. We present the standards needed to achieve first-principles accuracy in conductivity predictions through foundational model fine-tuning, introducing suitable benchmark metrics and discussing the precision/cost trade-off. We apply our framework to a database of solids with diverse compositions and structures, demonstrating its potential to discover materials for next-gen technologies ranging from thermal insulation to neuromorphic computing.
△ Less
Submitted 29 August, 2024; v1 submitted 1 August, 2024;
originally announced August 2024.
-
Self-consistent Coulomb interactions for machine learning interatomic potentials
Authors:
Jack Thomas,
William J. Baldwin,
Gábor Csányi,
Christoph Ortner
Abstract:
A ubiquitous approach to obtain transferable machine learning-based models of potential energy surfaces for atomistic systems is to decompose the total energy into a sum of local atom-centred contributions. However, in many systems non-negligible long-range electrostatic effects must be taken into account as well. We introduce a general mathematical framework to study how such long-range effects c…
▽ More
A ubiquitous approach to obtain transferable machine learning-based models of potential energy surfaces for atomistic systems is to decompose the total energy into a sum of local atom-centred contributions. However, in many systems non-negligible long-range electrostatic effects must be taken into account as well. We introduce a general mathematical framework to study how such long-range effects can be included in a way that (i) allows charge equilibration and (ii) retains the locality of the learnable atom-centred contributions to ensure transferability. Our results give partial explanations for the success of existing machine learned potentials that include equilibriation and provide perspectives how to design such schemes in a systematic way. To complement the rigorous theoretical results, we describe a practical scheme for fitting the energy and electron density of water clusters.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Data-efficient fine-tuning of foundational models for first-principles quality sublimation enthalpies
Authors:
Harveen Kaur,
Flaviano Della Pia,
Ilyes Batatia,
Xavier R. Advincula,
Benjamin X. Shi,
Jinggang Lan,
Gábor Csányi,
Angelos Michaelides,
Venkat Kapil
Abstract:
Calculating sublimation enthalpies of molecular crystal polymorphs is relevant to a wide range of technological applications. However, predicting these quantities at first-principles accuracy -- even with the aid of machine learning potentials -- is a challenge that requires sub-kJ/mol accuracy in the potential energy surface and finite-temperature sampling. We present an accurate and data-efficie…
▽ More
Calculating sublimation enthalpies of molecular crystal polymorphs is relevant to a wide range of technological applications. However, predicting these quantities at first-principles accuracy -- even with the aid of machine learning potentials -- is a challenge that requires sub-kJ/mol accuracy in the potential energy surface and finite-temperature sampling. We present an accurate and data-efficient protocol based on fine-tuning of the foundational MACE-MP-0 model and showcase its capabilities on sublimation enthalpies and physical properties of ice polymorphs. Our approach requires only a few tens of training structures to achieve sub-kJ/mol accuracy in the sublimation enthalpies and sub 1 % error in densities for polymorphs at finite temperature and pressure. Exploiting this data efficiency, we explore simulations of hexagonal ice at the random phase approximation level of theory at experimental temperatures and pressures, calculating its physical properties, like pair correlation function and density, with good agreement with experiments. Our approach provides a way forward for predicting the stability of molecular crystals at finite thermodynamic conditions with the accuracy of correlated electronic structure theory.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Accurate Crystal Structure Prediction of New 2D Hybrid Organic Inorganic Perovskites
Authors:
Nima Karimitari,
William J. Baldwin,
Evan W. Muller,
Zachary J. L. Bare,
W. Joshua Kennedy,
Gábor Csányi,
Christopher Sutton
Abstract:
Low dimensional hybrid organic-inorganic perovskites (HOIPs) represent a promising class of electronically active materials for both light absorption and emission. The design space of HOIPs is extremely large, since a diverse space of organic cations can be combined with different inorganic frameworks. This immense design space allows for tunable electronic and mechanical properties, but also nece…
▽ More
Low dimensional hybrid organic-inorganic perovskites (HOIPs) represent a promising class of electronically active materials for both light absorption and emission. The design space of HOIPs is extremely large, since a diverse space of organic cations can be combined with different inorganic frameworks. This immense design space allows for tunable electronic and mechanical properties, but also necessitates the development of new tools for in silico high throughput analysis of candidate structures. In this work, we present an accurate, efficient, transferable and widely applicable machine learning interatomic potential (MLIP) for predicting the structure of new 2D HOIPs. Using the MACE architecture, an MLIP is trained on 86 diverse experimentally reported HOIP structures. The model is tested on 73 unseen perovskite compositions, and achieves chemical accuracy with respect to the reference electronic structure method. Our model is then combined with a simple random structure search algorithm to predict the structure of hypothetical HOIPs given only the proposed composition. Success is demonstrated by correctly and reliably recovering the crystal structure of a set of experimentally known 2D perovskites. Such a random structure search is impossible with ab initio methods due to the associated computational cost, but is relatively inexpensive with the MACE potential. Finally, the procedure is used to predict the structure formed by a new organic cation with no previously known corresponding perovskite. Laboratory synthesis of the new hybrid perovskite confirms the accuracy of our prediction. This capability, applied at scale, enables efficient screening of thousands of combinations of organic cations and inorganic layers.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials
Authors:
Ivan Grega,
Ilyes Batatia,
Gábor Csányi,
Sri Karlapati,
Vikram S. Deshpande
Abstract:
Lattices are architected metamaterials whose properties strongly depend on their geometrical design. The analogy between lattices and graphs enables the use of graph neural networks (GNNs) as a faster surrogate model compared to traditional methods such as finite element modelling. In this work, we generate a big dataset of structure-property relationships for strut-based lattices. The dataset is…
▽ More
Lattices are architected metamaterials whose properties strongly depend on their geometrical design. The analogy between lattices and graphs enables the use of graph neural networks (GNNs) as a faster surrogate model compared to traditional methods such as finite element modelling. In this work, we generate a big dataset of structure-property relationships for strut-based lattices. The dataset is made available to the community which can fuel the development of methods anchored in physical principles for the fitting of fourth-order tensors. In addition, we present a higher-order GNN model trained on this dataset. The key features of the model are (i) SE(3) equivariance, and (ii) consistency with the thermodynamic law of conservation of energy. We compare the model to non-equivariant models based on a number of error metrics and demonstrate its benefits in terms of predictive performance and reduced training requirements. Finally, we demonstrate an example application of the model to an architected material design task. The methods which we developed are applicable to fourth-order tensors beyond elasticity such as piezo-optical tensor etc.
△ Less
Submitted 20 March, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
A foundation model for atomistic materials chemistry
Authors:
Ilyes Batatia,
Philipp Benner,
Yuan Chiang,
Alin M. Elena,
Dávid P. Kovács,
Janosh Riebesell,
Xavier R. Advincula,
Mark Asta,
Matthew Avaylon,
William J. Baldwin,
Fabian Berger,
Noam Bernstein,
Arghya Bhowmik,
Samuel M. Blau,
Vlad Cărare,
James P. Darby,
Sandip De,
Flaviano Della Pia,
Volker L. Deringer,
Rokas Elijošius,
Zakariya El-Machachi,
Fabio Falcioni,
Edvin Fako,
Andrea C. Ferrari,
Annalena Genreith-Schriever
, et al. (51 additional authors not shown)
Abstract:
Machine-learned force fields have transformed the atomistic modelling of materials by enabling simulations of ab initio quality on unprecedented time and length scales. However, they are currently limited by: (i) the significant computational and human effort that must go into development and validation of potentials for each particular system of interest; and (ii) a general lack of transferabilit…
▽ More
Machine-learned force fields have transformed the atomistic modelling of materials by enabling simulations of ab initio quality on unprecedented time and length scales. However, they are currently limited by: (i) the significant computational and human effort that must go into development and validation of potentials for each particular system of interest; and (ii) a general lack of transferability from one chemical system to the next. Here, using the state-of-the-art MACE architecture we introduce a single general-purpose ML model, trained on a public database of 150k inorganic crystals, that is capable of running stable molecular dynamics on molecules and materials. We demonstrate the power of the MACE-MP-0 model - and its qualitative and at times quantitative accuracy - on a diverse set problems in the physical sciences, including the properties of solids, liquids, gases, chemical reactions, interfaces and even the dynamics of a small protein. The model can be applied out of the box and as a starting or "foundation model" for any atomistic system of interest and is thus a step towards democratising the revolution of ML force fields by lowering the barriers to entry.
△ Less
Submitted 1 March, 2024; v1 submitted 29 December, 2023;
originally announced January 2024.
-
Equivariant Matrix Function Neural Networks
Authors:
Ilyes Batatia,
Lars L. Schaaf,
Huajie Chen,
Gábor Csányi,
Christoph Ortner,
Felix A. Faber
Abstract:
Graph Neural Networks (GNNs), especially message-passing neural networks (MPNNs), have emerged as powerful architectures for learning on graphs in diverse applications. However, MPNNs face challenges when modeling non-local interactions in graphs such as large conjugated molecules, and social networks due to oversmoothing and oversquashing. Although Spectral GNNs and traditional neural networks su…
▽ More
Graph Neural Networks (GNNs), especially message-passing neural networks (MPNNs), have emerged as powerful architectures for learning on graphs in diverse applications. However, MPNNs face challenges when modeling non-local interactions in graphs such as large conjugated molecules, and social networks due to oversmoothing and oversquashing. Although Spectral GNNs and traditional neural networks such as recurrent neural networks and transformers mitigate these challenges, they often lack generalizability, or fail to capture detailed structural relationships or symmetries in the data. To address these concerns, we introduce Matrix Function Neural Networks (MFNs), a novel architecture that parameterizes non-local interactions through analytic matrix equivariant functions. Employing resolvent expansions offers a straightforward implementation and the potential for linear scaling with system size. The MFN architecture achieves stateof-the-art performance in standard graph benchmarks, such as the ZINC and TU datasets, and is able to capture intricate non-local interactions in quantum systems, paving the way to new state-of-the-art force fields.
△ Less
Submitted 30 January, 2024; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Gaussian Approximation Potentials: theory, software implementation and application examples
Authors:
Sascha Klawohn,
Gábor Csányi,
James P. Darby,
James R. Kermode,
Miguel A. Caro,
Albert P. Bartók
Abstract:
Gaussian Approximation Potentials are a class of Machine Learned Interatomic Potentials routinely used to model materials and molecular systems on the atomic scale. The software implementation provides the means for both fitting models using ab initio data and using the resulting potentials in atomic simulations. Details of the GAP theory, algorithms and software are presented, together with detai…
▽ More
Gaussian Approximation Potentials are a class of Machine Learned Interatomic Potentials routinely used to model materials and molecular systems on the atomic scale. The software implementation provides the means for both fitting models using ab initio data and using the resulting potentials in atomic simulations. Details of the GAP theory, algorithms and software are presented, together with detailed usage examples to help new and existing users. We review some recent developments to the GAP framework, including MPI parallelisation of the fitting code enabling its use on thousands of CPU cores and compression of descriptors to eliminate the poor scaling with the number of different chemical elements.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Efficiency, Accuracy, and Transferability of Machine Learning Potentials: Application to Dislocations and Cracks in Iron
Authors:
Lei Zhang,
Gábor Csányi,
Erik van der Giessen,
Francesco Maresca
Abstract:
Machine learning interatomic potentials (ML-IAPs) enable quantum-accurate, classical molecular dynamics simulations of large systems, beyond reach of density functional theory (DFT). Yet, their efficiency and ability to predict systems larger than DFT supercells are not fully explored, posing a question regarding transferability to large-scale simulations with defects (e.g. dislocations, cracks).…
▽ More
Machine learning interatomic potentials (ML-IAPs) enable quantum-accurate, classical molecular dynamics simulations of large systems, beyond reach of density functional theory (DFT). Yet, their efficiency and ability to predict systems larger than DFT supercells are not fully explored, posing a question regarding transferability to large-scale simulations with defects (e.g. dislocations, cracks). Here, we apply a three-step validation approach to body-centered-cubic iron. First, accuracy and efficiency are assessed by optimizing ML-IAPs based on four state-of-the-art ML packages. The Pareto front of computational speed versus testing root-mean-square-error (RMSE) is computed. Second, benchmark properties relevant to plasticity and fracture are evaluated. Their average relative error Q with respect to DFT is found to correlate with RMSE. Third, transferability of ML-IAPs to dislocations and cracks is investigated by using per-atom model uncertainty quantification. The core structures and Peierls barriers of screw, M111 and three edge dislocations are compared with DFT. Traction-separation curve and critical stress intensity factor (K_Ic) are also predicted. Cleavage on the pre-existing crack plane is found to be the zero-temperature atomistic fracture mechanism of pure body-centered-cubic iron under mode-I loading, independent of ML package and training database. Quantitative predictions of dislocation glide paths and KIc can be sensitive to database, ML package, cutoff radius, and are limited by DFT accuracy. Our results highlight the importance of validating ML-IAPs by using indicators beyond RMSE. Moreover, significant computational speed-ups can be achieved by using the most efficient ML-IAP package, yet the assessment of the accuracy and transferability should be performed with care.
△ Less
Submitted 6 November, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Structural Dynamics Descriptors for Metal Halide Perovskites
Authors:
Xia Liang,
Johan Klarbring,
William Baldwin,
Zhenzhu Li,
Gábor Csányi,
Aron Walsh
Abstract:
Metal halide perovskites have shown extraordinary performance in solar energy conversion technologies. They have been classified as "soft semiconductors" due to their flexible corner-sharing octahedral networks and polymorphous nature. Understanding the local and average structures continues to be challenging for both modelling and experiments. Here, we report the quantitative analysis of structur…
▽ More
Metal halide perovskites have shown extraordinary performance in solar energy conversion technologies. They have been classified as "soft semiconductors" due to their flexible corner-sharing octahedral networks and polymorphous nature. Understanding the local and average structures continues to be challenging for both modelling and experiments. Here, we report the quantitative analysis of structural dynamics in time and space from molecular dynamics simulations of perovskite crystals. The compact descriptors provided cover a wide variety of structural properties, including octahedral tilting and distortion, local lattice parameters, molecular orientations, as well as their spatial correlation. To validate our methods, we have trained a machine learning force field (MLFF) for methylammonium lead bromide (CH$_3$NH$_3$PbBr$_3$) using an on-the-fly training approach with Gaussian process regression. The known stable phases are reproduced and we find an additional symmetry-breaking effect in the cubic and tetragonal phases close to the phase transition temperature. To test the implementation for large trajectories, we also apply it to 69,120 atom simulations for CsPbI$_3$ based on an MLFF developed using the atomic cluster expansion formalism. The structural dynamics descriptors and Python toolkit are general to perovskites and readily transferable to more complex compositions.
△ Less
Submitted 23 July, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Dynamic Local Structure in Caesium Lead Iodide: Spatial Correlation and Transient Domains
Authors:
William Baldwin,
Xia Liang,
Johan Klarbring,
Milos Dubajic,
David Dell'Angelo,
Christopher Sutton,
Claudia Caddeo,
Samuel D. Stranks,
Alessandro Mattoni,
Aron Walsh,
Gábor Csányi
Abstract:
Metal halide perovskites are multifunctional semiconductors with tunable structures and properties. They are highly dynamic crystals with complex octahedral tilting patterns and strongly anharmonic atomic behaviour. In the higher temperature, higher symmetry phases of these materials, several complex structural features have been observed. The local structure can differ greatly from the average st…
▽ More
Metal halide perovskites are multifunctional semiconductors with tunable structures and properties. They are highly dynamic crystals with complex octahedral tilting patterns and strongly anharmonic atomic behaviour. In the higher temperature, higher symmetry phases of these materials, several complex structural features have been observed. The local structure can differ greatly from the average structure and there is evidence that dynamic two-dimensional structures of correlated octahedral motion form. An understanding of the underlying complex atomistic dynamics is, however, still lacking. In this work, the local structure of the inorganic perovskite CsPbI$_3$ is investigated using a new machine learning force field based on the atomic cluster expansion framework. Through analysis of the temporal and spatial correlation observed during large-scale simulations, we reveal that the low frequency motion of octahedral tilts implies a double-well effective potential landscape, even well into the cubic phase. Moreover, dynamic local regions of lower symmetry are present within both higher symmetry phases. These regions are planar and we report the length and timescales of the motion. Finally, we investigate and visualise the spatial arrangement of these features and their interactions, providing a comprehensive picture of local structure in the higher symmetry phases.
△ Less
Submitted 11 April, 2023; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Tensor-reduced atomic density representations
Authors:
James P. Darby,
Dávid P. Kovács,
Ilyes Batatia,
Miguel A. Caro,
Gus L. W. Hart,
Christoph Ortner,
Gábor Csányi
Abstract:
Density based representations of atomic environments that are invariant under Euclidean symmetries have become a widely used tool in the machine learning of interatomic potentials, broader data-driven atomistic modelling and the visualisation and analysis of materials datasets.The standard mechanism used to incorporate chemical element information is to create separate densities for each element a…
▽ More
Density based representations of atomic environments that are invariant under Euclidean symmetries have become a widely used tool in the machine learning of interatomic potentials, broader data-driven atomistic modelling and the visualisation and analysis of materials datasets.The standard mechanism used to incorporate chemical element information is to create separate densities for each element and form tensor products between them. This leads to a steep scaling in the size of the representation as the number of elements increases. Graph neural networks, which do not explicitly use density representations, escape this scaling by mapping the chemical element information into a fixed dimensional space in a learnable way. We recast this approach as tensor factorisation by exploiting the tensor structure of standard neighbour density based descriptors. In doing so, we form compact tensor-reduced representations whose size does not depend on the number of chemical elements, but remain systematically convergeable and are therefore applicable to a wide range of data analysis and regression tasks.
△ Less
Submitted 6 December, 2022; v1 submitted 1 October, 2022;
originally announced October 2022.
-
Atomistic fracture in bcc iron revealed by active learning of Gaussian approximation potential
Authors:
Lei Zhang,
Gábor Csányi,
Erik van der Giessen,
Francesco Maresca
Abstract:
The prediction of atomistic fracture mechanisms in body-centred cubic (bcc) iron is essential for understanding its semi-brittle nature. Existing atomistic simulations of the crack-tip deformation mechanisms under mode-I loading based on classical interatomic potentials yield contradicting predictions. To enable fracture prediction with quantum accuracy, we develop a Gaussian approximation potenti…
▽ More
The prediction of atomistic fracture mechanisms in body-centred cubic (bcc) iron is essential for understanding its semi-brittle nature. Existing atomistic simulations of the crack-tip deformation mechanisms under mode-I loading based on classical interatomic potentials yield contradicting predictions. To enable fracture prediction with quantum accuracy, we develop a Gaussian approximation potential (GAP) using an active learning strategy by extending a density functional theory (DFT) database of ferromagnetic bcc iron. We apply the active learning algorithm and obtain a Fe GAP model with a maximum predicted error of 8 meV/atom over a broad range of stress intensity factors (SIFs) and for four crack systems. The learning efficiency of the approach is analysed, and the predicted critical SIFs are compared with Griffith and Rice theories. The simulations reveal that cleavage along the original crack plane is the crack tip mechanism for {100} and {110} crack planes at T=0K, thus settling a long-standing dispute. Our work also highlights the need for a multiscale approach to predicting fracture and intrinsic ductility, whereby finite temperature, finite loading rate effects and pre-existing defects (e.g. nanovoids, dislocations) should be taken explicitly into account.
△ Less
Submitted 14 September, 2022; v1 submitted 11 August, 2022;
originally announced August 2022.
-
MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Force Fields
Authors:
Ilyes Batatia,
Dávid Péter Kovács,
Gregor N. C. Simm,
Christoph Ortner,
Gábor Csányi
Abstract:
Creating fast and accurate force fields is a long-standing challenge in computational chemistry and materials science. Recently, several equivariant message passing neural networks (MPNNs) have been shown to outperform models built using other approaches in terms of accuracy. However, most MPNNs suffer from high computational cost and poor scalability. We propose that these limitations arise becau…
▽ More
Creating fast and accurate force fields is a long-standing challenge in computational chemistry and materials science. Recently, several equivariant message passing neural networks (MPNNs) have been shown to outperform models built using other approaches in terms of accuracy. However, most MPNNs suffer from high computational cost and poor scalability. We propose that these limitations arise because MPNNs only pass two-body messages leading to a direct relationship between the number of layers and the expressivity of the network. In this work, we introduce MACE, a new equivariant MPNN model that uses higher body order messages. In particular, we show that using four-body messages reduces the required number of message passing iterations to just two, resulting in a fast and highly parallelizable model, reaching or exceeding state-of-the-art accuracy on the rMD17, 3BPA, and AcAc benchmark tasks. We also demonstrate that using higher order messages leads to an improved steepness of the learning curves.
△ Less
Submitted 26 January, 2023; v1 submitted 15 June, 2022;
originally announced June 2022.
-
Nested sampling for physical scientists
Authors:
Greg Ashton,
Noam Bernstein,
Johannes Buchner,
Xi Chen,
Gábor Csányi,
Andrew Fowlie,
Farhan Feroz,
Matthew Griffiths,
Will Handley,
Michael Habeck,
Edward Higson,
Michael Hobson,
Anthony Lasenby,
David Parkinson,
Livia B. Pártay,
Matthew Pitkin,
Doris Schneider,
Joshua S. Speagle,
Leah South,
John Veitch,
Philipp Wacker,
David J. Wales,
David Yallup
Abstract:
We review Skilling's nested sampling (NS) algorithm for Bayesian inference and more broadly multi-dimensional integration. After recapitulating the principles of NS, we survey developments in implementing efficient NS algorithms in practice in high-dimensions, including methods for sampling from the so-called constrained prior. We outline the ways in which NS may be applied and describe the applic…
▽ More
We review Skilling's nested sampling (NS) algorithm for Bayesian inference and more broadly multi-dimensional integration. After recapitulating the principles of NS, we survey developments in implementing efficient NS algorithms in practice in high-dimensions, including methods for sampling from the so-called constrained prior. We outline the ways in which NS may be applied and describe the application of NS in three scientific fields in which the algorithm has proved to be useful: cosmology, gravitational-wave astronomy, and materials science. We close by making recommendations for best practice when using NS and by summarizing potential limitations and optimizations of NS.
△ Less
Submitted 31 May, 2022;
originally announced May 2022.
-
Multilayer atomic cluster expansion for semi-local interactions
Authors:
Anton Bochkarev,
Yury Lysogorskiy,
Christoph Ortner,
Gábor Csányi,
Ralf Drautz
Abstract:
Traditionally, interatomic potentials assume local bond formation supplemented by long-range electrostatic interactions when necessary. This ignores intermediate range multi-atom interactions that arise from the relaxation of the electronic structure. Here, we present the multilayer atomic cluster expansion (ml-ACE) that includes collective, semi-local multi-atom interactions naturally within its…
▽ More
Traditionally, interatomic potentials assume local bond formation supplemented by long-range electrostatic interactions when necessary. This ignores intermediate range multi-atom interactions that arise from the relaxation of the electronic structure. Here, we present the multilayer atomic cluster expansion (ml-ACE) that includes collective, semi-local multi-atom interactions naturally within its remit. We demonstrate that ml-ACE significantly improves fit accuracy compared to a local expansion on selected examples and provide physical intuition to understand this improvement.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
The Design Space of E(3)-Equivariant Atom-Centered Interatomic Potentials
Authors:
Ilyes Batatia,
Simon Batzner,
Dávid Péter Kovács,
Albert Musaelian,
Gregor N. C. Simm,
Ralf Drautz,
Christoph Ortner,
Boris Kozinsky,
Gábor Csányi
Abstract:
The rapid progress of machine learning interatomic potentials over the past couple of years produced a number of new architectures. Particularly notable among these are the Atomic Cluster Expansion (ACE), which unified many of the earlier ideas around atom density-based descriptors, and Neural Equivariant Interatomic Potentials (NequIP), a message passing neural network with equivariant features t…
▽ More
The rapid progress of machine learning interatomic potentials over the past couple of years produced a number of new architectures. Particularly notable among these are the Atomic Cluster Expansion (ACE), which unified many of the earlier ideas around atom density-based descriptors, and Neural Equivariant Interatomic Potentials (NequIP), a message passing neural network with equivariant features that showed state of the art accuracy. In this work, we construct a mathematical framework that unifies these models: ACE is generalised so that it can be recast as one layer of a multi-layer architecture. From another point of view, the linearised version of NequIP is understood as a particular sparsification of a much larger polynomial model. Our framework also provides a practical tool for systematically probing different choices in the unified design space. We demonstrate this by an ablation study of NequIP via a set of experiments looking at in- and out-of-domain accuracy and smooth extrapolation very far from the training data, and shed some light on which design choices are critical for achieving high accuracy. Finally, we present BOTNet (Body-Ordered-Tensor-Network), a much-simplified version of NequIP, which has an interpretable architecture and maintains accuracy on benchmark datasets.
△ Less
Submitted 24 November, 2022; v1 submitted 13 May, 2022;
originally announced May 2022.
-
Compressing local atomic neighbourhood descriptors
Authors:
James P. Darby,
James R. Kermode,
Gábor Csányi
Abstract:
Many atomic descriptors are currently limited by their unfavourable scaling with the number of chemical elements $S$ e.g. the length of body-ordered descriptors, such as the Smooth Overlap of Atomic Positions (SOAP) power spectrum (3-body) and the Atomic Cluster Expansion (ACE) (multiple body-orders), scales as $(NS)^ν$ where $ν+1$ is the body-order and $N$ is the number of radial basis functions…
▽ More
Many atomic descriptors are currently limited by their unfavourable scaling with the number of chemical elements $S$ e.g. the length of body-ordered descriptors, such as the Smooth Overlap of Atomic Positions (SOAP) power spectrum (3-body) and the Atomic Cluster Expansion (ACE) (multiple body-orders), scales as $(NS)^ν$ where $ν+1$ is the body-order and $N$ is the number of radial basis functions used in the density expansion. We introduce two distinct approaches which can be used to overcome this scaling for the SOAP power spectrum. Firstly, we show that the power spectrum is amenable to lossless compression with respect to both $S$ and $N$, so that the descriptor length can be reduced from $\mathcal{O}(N^2S^2)$ to $\mathcal{O}\left(NS\right)$. Secondly, we introduce a generalized SOAP kernel, where compression is achieved through the use of the total, element agnostic density, in combination with radial projection. The ideas used in the generalized kernel are equally applicably to any other body-ordered descriptors and we demonstrate this for the Atom Centered Symmetry Functions (ACSF). Finally, both compression approaches are shown to offer comparable performance to the original descriptor across a variety of numerical tests.
△ Less
Submitted 24 December, 2021;
originally announced December 2021.
-
A Gaussian Approximation Potential for Amorphous Si:H
Authors:
Davis Unruh,
Reza Vatan Meidanshahi,
Stephen M. Goodnick,
Gábor Csányi,
Gergely T. Zimányi
Abstract:
Hydrogenation of amorphous silicon (a-Si:H) is critical for reducing defect densities, passivating mid-gap states and surfaces, and improving photoconductivity in silicon-based electro-optical devices. Modelling the atomic scale structure of this material is critical to understanding these processes, which in turn is needed to describe c-Si/a-Si:H heterjunctions that are at the heart of the modern…
▽ More
Hydrogenation of amorphous silicon (a-Si:H) is critical for reducing defect densities, passivating mid-gap states and surfaces, and improving photoconductivity in silicon-based electro-optical devices. Modelling the atomic scale structure of this material is critical to understanding these processes, which in turn is needed to describe c-Si/a-Si:H heterjunctions that are at the heart of the modern solar cells with world record efficiency. Density functional theory (DFT) studies achieve the required high accuracy but are limited to moderate system sizes a hundred atoms or so by their high computational cost. Simulations of amorphous materials in particular have been hindered by this high cost because large structural models are required to capture the medium range order that is characteristic of such materials. Empirical potential models are much faster, but their accuracy is not sufficient to correctly describe the frustrated local structure. Data driven, "machine learned" interatomic potentials have broken this impasse, and have been highly successful in describing a variety of amorphous materials in their elemental phase. Here we extend the Gaussian approximation potential (GAP) for silicon by incorporating the interaction with hydrogen, thereby significantly improving the degree of realism with which amorphous silicon can be modelled. We show that our Si:H GAP enables the simulation of hydrogenated silicon with an accuracy very close to DFT, but with computational expense and run times reduced by several orders of magnitude for large structures. We demonstrate the capabilities of the Si:H GAP by creating models of hydrogenated liquid and amorphous silicon, and showing that their energies, forces and stresses are in excellent agreement with DFT results, and their structure as captured by bond and angle distributions, with both DFT and experiments.
△ Less
Submitted 5 January, 2022; v1 submitted 5 June, 2021;
originally announced June 2021.
-
Machine learning force fields based on local parametrization of dispersion interactions: Application to the phase diagram of C$_{60}$
Authors:
Heikki Muhli,
Xi Chen,
Albert P. Bartók,
Patricia Hernández-León,
Gábor Csányi,
Tapio Ala-Nissila,
Miguel A. Caro
Abstract:
We present a comprehensive methodology to enable addition of van der Waals (vdW) corrections to machine learning (ML) atomistic force fields. Using a Gaussian approximation potential (GAP) [Bartók et al., Phys. Rev. Lett. 104, 136403 (2010)] as baseline, we accurately machine learn a local model of atomic polarizabilities based on Hirshfeld volume partitioning of the charge density [Tkatchenko and…
▽ More
We present a comprehensive methodology to enable addition of van der Waals (vdW) corrections to machine learning (ML) atomistic force fields. Using a Gaussian approximation potential (GAP) [Bartók et al., Phys. Rev. Lett. 104, 136403 (2010)] as baseline, we accurately machine learn a local model of atomic polarizabilities based on Hirshfeld volume partitioning of the charge density [Tkatchenko and Scheffler, Phys. Rev. Lett. 102, 073005 (2009)]. These environment-dependent polarizabilities are then used to parametrize a screened London-dispersion approximation to the vdW interactions. Our ML vdW model only needs to learn the charge density partitioning implicitly, by learning the reference Hirshfeld volumes from density functional theory (DFT). In practice, we can predict accurate Hirshfeld volumes from the knowledge of the local atomic environment (atomic positions) alone, making the model highly computationally efficient. For additional efficiency, our ML model of atomic polarizabilities reuses the same many-body atomic descriptors used for the underlying GAP learning of bonded interatomic interactions. We also show how the method enables straightforward computation of gradients of the observables, even when these remain challenging for the reference method (e.g., calculating gradients of the Hirshfeld volumes in DFT). Finally, we demonstrate the approach by studying the phase diagram of C$_{60}$, where vdW effects are important. The need for a highly accurate vdW-inclusive reactive force field is highlighted by modeling the decomposition of the C$_{60}$ molecules taking place at high pressures and temperatures.
△ Less
Submitted 10 August, 2021; v1 submitted 6 May, 2021;
originally announced May 2021.
-
Performant implementation of the atomic cluster expansion (PACE): Application to copper and silicon
Authors:
Yury Lysogorskiy,
Cas van der Oord,
Anton Bochkarev,
Sarath Menon,
Matteo Rinaldi,
Thomas Hammerschmidt,
Matous Mrovec,
Aidan Thompson,
Gábor Csányi,
Christoph Ortner,
Ralf Drautz
Abstract:
The atomic cluster expansion is a general polynomial expansion of the atomic energy in multi-atom basis functions. Here we implement the atomic cluster expansion in the performant C++ code \verb+PACE+ that is suitable for use in large scale atomistic simulations. We briefly review the atomic cluster expansion and give detailed expressions for energies and forces as well as efficient algorithms for…
▽ More
The atomic cluster expansion is a general polynomial expansion of the atomic energy in multi-atom basis functions. Here we implement the atomic cluster expansion in the performant C++ code \verb+PACE+ that is suitable for use in large scale atomistic simulations. We briefly review the atomic cluster expansion and give detailed expressions for energies and forces as well as efficient algorithms for their evaluation. We demonstrate that the atomic cluster expansion as implemented in \verb+PACE+ shifts a previously established Pareto front for machine learning interatomic potentials towards faster and more accurate calculations. Moreover, general purpose parameterizations are presented for copper and silicon and evaluated in detail. We show that the new Cu and Si potentials significantly improve on the best available potentials for highly accurate large-scale atomistic simulations.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
Predicting polarizabilities of silicon clusters using local chemical environments
Authors:
Mario G. Zauchner,
Stefano Dal Forno,
Gábor Cśanyi,
Andrew Horsfield,
Johannes Lischner
Abstract:
Calculating polarizabilities of large clusters with first-principles techniques is challenging because of the unfavorable scaling of computational cost with cluster size. To address this challenge, we demonstrate that polarizabilities of large hydrogenated silicon clusters containing thousands of atoms can be efficiently calculated with machine learning methods. Specifically, we construct machine…
▽ More
Calculating polarizabilities of large clusters with first-principles techniques is challenging because of the unfavorable scaling of computational cost with cluster size. To address this challenge, we demonstrate that polarizabilities of large hydrogenated silicon clusters containing thousands of atoms can be efficiently calculated with machine learning methods. Specifically, we construct machine learning models based on the smooth overlap of atomic positions (SOAP) descriptor and train the models using a database of calculated random-phase approximation polarizabilities for clusters containing up to 110 silicon atoms. We first demonstrate the ability of the machine learning models to fit the data and then assess their ability to predict cluster polarizabilities using k-fold cross validation. Finally, we study the machine learning predictions for clusters that are too large for explicit first-principles calculations and find that they accurately describe the dependence of the polarizabilities on the ratio of hydrogen to silicon atoms and also predict a bulk limit that is in good agreement with previous studies.
△ Less
Submitted 25 August, 2021; v1 submitted 11 January, 2021;
originally announced January 2021.
-
An Experimentally Driven Automated Machine Learned lnter-Atomic Potential for a Refractory Oxide
Authors:
Ganesh Sivaraman,
Leighanne Gallington,
Anand Narayanan Krishnamoorthy,
Marius Stan,
Gabor Csanyi,
Alvaro Vazquez-Mayagoitia,
Chris J. Benmore
Abstract:
Understanding the structure and properties of refractory oxides are critical for high temperature applications. In this work, a combined experimental and simulation approach uses an automated closed loop via an active-learner, which is initialized by X-ray and neutron diffraction measurements, and sequentially improves a machine-learning model until the experimentally predetermined phase space is…
▽ More
Understanding the structure and properties of refractory oxides are critical for high temperature applications. In this work, a combined experimental and simulation approach uses an automated closed loop via an active-learner, which is initialized by X-ray and neutron diffraction measurements, and sequentially improves a machine-learning model until the experimentally predetermined phase space is covered. A multi-phase potential is generated for a canonical example of the archetypal refractory oxide, HfO2, by drawing a minimum number of training configurations from room temperature to the liquid state at ~2900oC. The method significantly reduces model development time and human effort.
△ Less
Submitted 8 September, 2020;
originally announced September 2020.
-
An Accurate and Transferable Machine Learning Potential for Carbon
Authors:
Patrick Rowe,
Volker L Deringer,
Piero Gasparotto,
Gábor Csányi,
Angelos Michaelides
Abstract:
We present an accurate machine learning (ML) model for atomistic simulations of carbon, constructed using the Gaussian approximation potential (GAP) methodology. The potential, named GAP-20, describes the properties of the bulk crystalline and amorphous phases, crystal surfaces and defect structures with an accuracy approaching that of direct ab initio simulation, but at a significantly reduced co…
▽ More
We present an accurate machine learning (ML) model for atomistic simulations of carbon, constructed using the Gaussian approximation potential (GAP) methodology. The potential, named GAP-20, describes the properties of the bulk crystalline and amorphous phases, crystal surfaces and defect structures with an accuracy approaching that of direct ab initio simulation, but at a significantly reduced cost. We combine structural databases for amorphous carbon and graphene, which we extend substantially by adding suitable configurations, for example, for defects in graphene and other nanostructures. The final potential is fitted to reference data computed using the optB88-vdW density functional theory (DFT) functional. Dispersion interactions, which are crucial to describe multilayer carbonaceous materials, are therefore implicitly included. We additionally account for long-range dispersion interactions using a semianalytical two-body term and show that an improved model can be obtained through an optimisation of the many-body smooth overlap of atomic positions (SOAP) descriptor. We rigorously test the potential on lattice parameters, bond lengths, formation energies and phonon dispersions of numerous carbon allotropes. We compare the formation energies of an extensive set of defect structures, surfaces and surface reconstructions to DFT reference calculations. The present work demonstrates the ability to combine, in the same ML model, the previously attained flexibility required for amorphous carbon [Phys. Rev. B, 95, 094203, (2017)] with the high numerical accuracy necessary for crystalline graphene [Phys. Rev. B, 97, 054303, (2018)], thereby providing an interatomic potential that will be applicable to a wide range of applications concerning diverse forms of bulk and nanostructured carbon.
△ Less
Submitted 24 June, 2020;
originally announced June 2020.
-
Learning the electronic density of states in condensed matter
Authors:
Chiheb Ben Mahmoud,
Andrea Anelli,
Gábor Csányi,
Michele Ceriotti
Abstract:
The electronic density of states (DOS) quantifies the distribution of the energy levels that can be occupied by electrons in a quasiparticle picture, and is central to modern electronic structure theory. It also underpins the computation and interpretation of experimentally observable material properties such as optical absorption and electrical conductivity. We discuss the challenges inherent in…
▽ More
The electronic density of states (DOS) quantifies the distribution of the energy levels that can be occupied by electrons in a quasiparticle picture, and is central to modern electronic structure theory. It also underpins the computation and interpretation of experimentally observable material properties such as optical absorption and electrical conductivity. We discuss the challenges inherent in the construction of a machine-learning (ML) framework aimed at predicting the DOS as a combination of local contributions that depend in turn on the geometric configuration of neighbours around each atom, using quasiparticle energy levels from density functional theory as training data. We present a challenging case study that includes configurations of silicon spanning a broad set of thermodynamic conditions, ranging from bulk structures to clusters, and from semiconducting to metallic behavior. We compare different approaches to represent the DOS, and the accuracy of predicting quantities such as the Fermi level, the DOS at the Fermi level, or the band energy, either directly or as a side-product of the evaluation of the DOS. The performance of the model depends crucially on the smoothening of the DOS, and there is a tradeoff to be made between the systematic error associated with the smoothening and the error in the ML model for a specific structure. We demonstrate the usefulness of this approach by computing the density of states of a large amorphous silicon sample, for which it would be prohibitively expensive to compute the DOS by direct electronic structure calculations, and show how the atom-centred decomposition of the DOS that is obtained through our model can be used to extract physical insights into the connections between structural and electronic features.
△ Less
Submitted 12 November, 2020; v1 submitted 21 June, 2020;
originally announced June 2020.
-
Machine learning driven simulated deposition of carbon films: from low-density to diamondlike amorphous carbon
Authors:
Miguel A. Caro,
Gábor Csányi,
Tomi Laurila,
Volker L. Deringer
Abstract:
Amorphous carbon (a-C) materials have diverse interesting and useful properties, but the understanding of their atomic-scale structures is still incomplete. Here, we report on extensive atomistic simulations of the deposition and growth of a-C films, describing interatomic interactions using a machine learning (ML) based Gaussian Approximation Potential (GAP) model. We expand widely on our initial…
▽ More
Amorphous carbon (a-C) materials have diverse interesting and useful properties, but the understanding of their atomic-scale structures is still incomplete. Here, we report on extensive atomistic simulations of the deposition and growth of a-C films, describing interatomic interactions using a machine learning (ML) based Gaussian Approximation Potential (GAP) model. We expand widely on our initial work [Phys. Rev. Lett. 120, 166101 (2018)] by now considering a broad range of incident ion energies, thus modeling samples that span the entire range from low-density ($sp^{2}$-rich) to high-density ($sp^{3}$-rich, "diamond-like") amorphous forms of carbon. Two different mechanisms are observed in these simulations, depending on the impact energy: low-energy impacts induce $sp$- and $sp^{2}$-dominated growth directly around the impact site, whereas high-energy impacts induce peening. Furthermore, we propose and apply a scheme for computing the anisotropic elastic properties of the a-C films. Our work provides fundamental insight into this intriguing class of disordered solids, as well as a conceptual and methodological blueprint for simulating the atomic-scale deposition of other materials with ML-driven molecular dynamics.
△ Less
Submitted 4 November, 2020; v1 submitted 17 June, 2020;
originally announced June 2020.
-
Combining phonon accuracy with high transferability in Gaussian approximation potential models
Authors:
Janine George,
Geoffroy Hautier,
Albert P. Bartók,
Gábor Csányi,
Volker L. Deringer
Abstract:
Machine learning driven interatomic potentials, including Gaussian approximation potential (GAP) models, are emerging tools for atomistic simulations. Here, we address the methodological question of how one can fit GAP models that accurately predict vibrational properties in specific regions of configuration space, whilst retaining flexibility and transferability to others. We use an adaptive regu…
▽ More
Machine learning driven interatomic potentials, including Gaussian approximation potential (GAP) models, are emerging tools for atomistic simulations. Here, we address the methodological question of how one can fit GAP models that accurately predict vibrational properties in specific regions of configuration space, whilst retaining flexibility and transferability to others. We use an adaptive regularization of the GAP fit that scales with the absolute force magnitude on any given atom, thereby exploring the Bayesian interpretation of GAP regularization as an "expected error", and its impact on the prediction of physical properties for a material of interest. The approach enables excellent predictions of phonon modes (to within 0.1-0.2 THz) for structurally diverse silicon allotropes, and it can be coupled with existing fitting databases for high transferability. These findings and workflows are expected to be useful for GAP-driven materials modeling more generally.
△ Less
Submitted 14 May, 2020;
originally announced May 2020.
-
Gaussian Process States: A data-driven representation of quantum many-body physics
Authors:
Aldo Glielmo,
Yannic Rath,
Gabor Csanyi,
Alessandro De Vita,
George H. Booth
Abstract:
We present a novel, non-parametric form for compactly representing entangled many-body quantum states, which we call a `Gaussian Process State'. In contrast to other approaches, we define this state explicitly in terms of a configurational data set, with the probability amplitudes statistically inferred from this data according to Bayesian statistics. In this way the non-local physical correlated…
▽ More
We present a novel, non-parametric form for compactly representing entangled many-body quantum states, which we call a `Gaussian Process State'. In contrast to other approaches, we define this state explicitly in terms of a configurational data set, with the probability amplitudes statistically inferred from this data according to Bayesian statistics. In this way the non-local physical correlated features of the state can be analytically resummed, allowing for exponential complexity to underpin the ansatz, but efficiently represented in a small data set. The state is found to be highly compact, systematically improvable and efficient to sample, representing a large number of known variational states within its span. It is also proven to be a `universal approximator' for quantum states, able to capture any entangled many-body state with increasing data set size. We develop two numerical approaches which can learn this form directly: a fragmentation approach, and direct variational optimization, and apply these schemes to the Fermionic Hubbard model. We find competitive or superior descriptions of correlated quantum problems compared to existing state-of-the-art variational ansatzes, as well as other numerical methods.
△ Less
Submitted 17 September, 2020; v1 submitted 27 February, 2020;
originally announced February 2020.
-
On the Completeness of Atomic Structure Representations
Authors:
Sergey N. Pozdnyakov,
Michael J. Willatt,
Albert P. Bartók,
Christoph Ortner,
Gábor Csányi,
Michele Ceriotti
Abstract:
Many-body descriptors are widely used to represent atomic environments in the construction of machine learned interatomic potentials and more broadly for fitting, classification and embedding tasks on atomic structures. It was generally believed that 3-body descriptors uniquely specify the environment of an atom, up to a rotation and permutation of like atoms. We produce several counterexamples to…
▽ More
Many-body descriptors are widely used to represent atomic environments in the construction of machine learned interatomic potentials and more broadly for fitting, classification and embedding tasks on atomic structures. It was generally believed that 3-body descriptors uniquely specify the environment of an atom, up to a rotation and permutation of like atoms. We produce several counterexamples to this belief, with the consequence that any classifier, regression or embedding model for atom-centred properties that uses 3 (or 4)-body features will incorrectly give identical results for different configurations. Writing global properties (such as total energies) as a sum of many atom-centred contributions mitigates, but does not eliminate, the impact of this fundamental deficiency -- explaining the success of current "machine-learning" force fields. We anticipate the issues that will arise as the desired accuracy increases, and suggest potential solutions.
△ Less
Submitted 5 June, 2020; v1 submitted 31 January, 2020;
originally announced January 2020.
-
Structural transitions in dense disordered silicon from quantum-accurate ultra-large-scale simulations
Authors:
Volker L. Deringer,
Noam Bernstein,
Gábor Csányi,
Mark Wilson,
David A. Drabold,
Stephen R. Elliott
Abstract:
Structurally disordered materials continue to pose fundamental questions, including that of how different disordered phases ("polyamorphs") can coexist and transform from one to another. As a widely studied case, amorphous silicon (a-Si) forms a fourfold-coordinated, covalent random network at ambient conditions, but much higher-coordinated, metallic-like phases under pressure. However, a detailed…
▽ More
Structurally disordered materials continue to pose fundamental questions, including that of how different disordered phases ("polyamorphs") can coexist and transform from one to another. As a widely studied case, amorphous silicon (a-Si) forms a fourfold-coordinated, covalent random network at ambient conditions, but much higher-coordinated, metallic-like phases under pressure. However, a detailed mechanistic understanding of the liquid-amorphous and amorphous-amorphous transitions in silicon has been lacking, due to intrinsic limitations of even the most advanced experimental and computational techniques. Here, we show how machine-learning (ML)-driven simulations can break through this long-standing barrier, affording a comprehensive, quantum-accurate, and fully atomistic description of all relevant liquid and amorphous phases of silicon. Combining a model system size of 100,000 atoms (ten-nanometre length scale) with a prediction accuracy of a few meV per atom, our simulations reveal a remarkable, three-step transformation sequence for a-Si under increasing external pressure. First, up to 10-11 GPa, polyamorphic low- and high-density amorphous (LDA and HDA) regions are found to coexist, rather than appearing sequentially. Then, we observe a structural collapse into a distinct, very-high-density amorphous (VHDA) phase at 12-13 GPa, reminiscent of the dense liquid but being formed at a much lower temperature. Finally, our simulations indicate the transient nature of this VHDA phase: it rapidly nucleates crystallites at 13-16 GPa, ultimately leading to the formation of a poly-crystalline, simple-hexagonal structure, consistent with experiments but not seen in earlier simulations.
△ Less
Submitted 16 December, 2019;
originally announced December 2019.
-
Machine Learning Inter-Atomic Potentials Generation Driven by Active Learning: A Case Study for Amorphous and Liquid Hafnium dioxide
Authors:
Ganesh Sivaraman,
Anand Narayanan Krishnamoorthy,
Matthias Baur,
Christian Holm,
Marius Stan,
Gabor Csányi,
Chris Benmore,
Álvaro Vázquez-Mayagoitia
Abstract:
We propose a novel active learning scheme for automatically sampling a minimum number of uncorrelated configurations for fitting the Gaussian Approximation Potential (GAP). Our active learning scheme consists of an unsupervised machine learning (ML) scheme coupled to Bayesian optimization technique that evaluates the GAP model. We apply this scheme to a Hafnium dioxide (HfO2) dataset generated fro…
▽ More
We propose a novel active learning scheme for automatically sampling a minimum number of uncorrelated configurations for fitting the Gaussian Approximation Potential (GAP). Our active learning scheme consists of an unsupervised machine learning (ML) scheme coupled to Bayesian optimization technique that evaluates the GAP model. We apply this scheme to a Hafnium dioxide (HfO2) dataset generated from a melt-quench ab initio molecular dynamics (AIMD) protocol. Our results show that the active learning scheme, with no prior knowledge of the dataset is able to extract a configuration that reaches the required energy fit tolerance. Further, molecular dynamics (MD) simulations performed using this active learned GAP model on 6144-atom systems of amorphous and liquid state elucidate the structural properties of HfO2 with near ab initio precision and quench rates (i.e. 1.0 K/ps) not accessible via AIMD. The melt and amorphous x-ray structural factors generated from our simulation are in good agreement with experiment. Additionally, the calculated diffusion constants are in good agreement with previous ab initio studies.
△ Less
Submitted 22 October, 2019;
originally announced October 2019.
-
Regularised Atomic Body-Ordered Permutation-Invariant Polynomials for the Construction of Interatomic Potentials
Authors:
Cas van der Oord,
Geneviève Dusson,
Gabor Csanyi,
Christoph Ortner
Abstract:
We investigate the use of invariant polynomials in the construction of data-driven interatomic potentials for material systems. The "atomic body-ordered permutation-invariant polynomials" (aPIPs) comprise a systematic basis and are constructed to preserve the symmetry of the potential energy function with respect to rotations and permutations. In contrast to kernel based and artificial neural netw…
▽ More
We investigate the use of invariant polynomials in the construction of data-driven interatomic potentials for material systems. The "atomic body-ordered permutation-invariant polynomials" (aPIPs) comprise a systematic basis and are constructed to preserve the symmetry of the potential energy function with respect to rotations and permutations. In contrast to kernel based and artificial neural network models, the explicit decomposition of the total energy as a sum of atomic body-ordered terms allows to keep the dimensionality of the fit reasonably low, up to just 10 for the 5-body terms. The explainability of the potential is aided by this decomposition, as the low body-order components can be studied and interpreted independently. Moreover, although polynomial basis functions are thought to extrapolate poorly, we show that the low dimensionality combined with careful regularisation actually leads to better transferability than the high dimensional, kernel based Gaussian Approximation Potential.
△ Less
Submitted 14 October, 2019;
originally announced October 2019.
-
A Performance and Cost Assessment of Machine Learning Interatomic Potentials
Authors:
Yunxing Zuo,
Chi Chen,
Xiangguo Li,
Zhi Deng,
Yiming Chen,
Jörg Behler,
Gábor Csányi,
Alexander V. Shapeev,
Aidan P. Thompson,
Mitchell A. Wood,
Shyue Ping Ong
Abstract:
Machine learning of the quantitative relationship between local environment descriptors and the potential energy surface of a system of atoms has emerged as a new frontier in the development of interatomic potentials (IAPs). Here, we present a comprehensive evaluation of ML-IAPs based on four local environment descriptors --- Behler-Parrinello symmetry functions, smooth overlap of atomic positions…
▽ More
Machine learning of the quantitative relationship between local environment descriptors and the potential energy surface of a system of atoms has emerged as a new frontier in the development of interatomic potentials (IAPs). Here, we present a comprehensive evaluation of ML-IAPs based on four local environment descriptors --- Behler-Parrinello symmetry functions, smooth overlap of atomic positions (SOAP), the Spectral Neighbor Analysis Potential (SNAP) bispectrum components, and moment tensors --- using a diverse data set generated using high-throughput density functional theory (DFT) calculations. The data set comprising bcc (Li, Mo) and fcc (Cu, Ni) metals and diamond group IV semiconductors (Si, Ge) is chosen to span a range of crystal structures and bonding. All descriptors studied show excellent performance in predicting energies and forces far surpassing that of classical IAPs, as well as predicting properties such as elastic constants and phonon dispersion curves. We observe a general trade-off between accuracy and the degrees of freedom of each model, and consequently computational cost. We will discuss these trade-offs in the context of model selection for molecular dynamics and other applications.
△ Less
Submitted 24 July, 2019; v1 submitted 20 June, 2019;
originally announced June 2019.
-
Machine-learned Interatomic Potentials for Alloys and Alloy Phase Diagrams
Authors:
Conrad W. Rosenbrock,
Konstantin Gubaev,
Alexander V. Shapeev,
Livia B. Pártay,
Noam Bernstein,
Gábor Csányi,
Gus L. W. Hart
Abstract:
We introduce machine-learned potentials for Ag-Pd to describe the energy of alloy configurations over a wide range of compositions. We compare two different approaches. Moment tensor potentials (MTP) are polynomial-like functions of interatomic distances and angles. The Gaussian Approximation Potential (GAP) framework uses kernel regression, and we use the Smooth Overlap of Atomic Positions (SOAP)…
▽ More
We introduce machine-learned potentials for Ag-Pd to describe the energy of alloy configurations over a wide range of compositions. We compare two different approaches. Moment tensor potentials (MTP) are polynomial-like functions of interatomic distances and angles. The Gaussian Approximation Potential (GAP) framework uses kernel regression, and we use the Smooth Overlap of Atomic Positions (SOAP) representation of atomic neighbourhoods that consists of a complete set of rotational and permutational invariants provided by the power spectrum of the spherical Fourier transform of the neighbour density. Both types of potentials give excellent accuracy for a wide range of compositions and rival the accuracy of cluster expansion, a benchmark for this system. While both models are able to describe small deformations away from the lattice positions, SOAP-GAP excels at transferability as shown by sensible transformation paths between configurations, and MTP allows, due to its lower computational cost, the calculation of compositional phase diagrams. Given the fact that both methods perform as well as cluster expansion would but yield off-lattice models, we expect them to open new avenues in computational materials modeling for alloys.
△ Less
Submitted 9 July, 2019; v1 submitted 18 June, 2019;
originally announced June 2019.
-
De novo exploration and self-guided learning of potential-energy surfaces
Authors:
Noam Bernstein,
Gábor Csányi,
Volker L. Deringer
Abstract:
Interatomic potential models based on machine learning (ML) are rapidly developing as tools for materials simulations. However, because of their flexibility, they require large fitting databases that are normally created with substantial manual selection and tuning of reference configurations. Here, we show that ML potentials can be built in a largely automated fashion, exploring and fitting poten…
▽ More
Interatomic potential models based on machine learning (ML) are rapidly developing as tools for materials simulations. However, because of their flexibility, they require large fitting databases that are normally created with substantial manual selection and tuning of reference configurations. Here, we show that ML potentials can be built in a largely automated fashion, exploring and fitting potential-energy surfaces from the beginning (de novo) within one and the same protocol. The key enabling step is the use of a configuration-averaged kernel metric that allows one to select the few most relevant structures at each step. The resulting potentials are accurate and robust for the wide range of configurations that occur during structure searching, despite only requiring a relatively small number of single-point DFT calculations on small unit cells. We apply the method to materials with diverse chemical nature and coordination environments, marking a milestone toward the more routine application of ML potentials in physics, chemistry, and materials science.
△ Less
Submitted 24 May, 2019;
originally announced May 2019.
-
Machine-learning of atomic-scale properties based on physical principles
Authors:
Michele Ceriotti,
Michael J. Willatt,
Gábor Csányi
Abstract:
We briefly summarize the kernel regression approach, as used recently in materials modelling, to fitting functions, particularly potential energy surfaces, and highlight how the linear algebra framework can be used to both predict and train from linear functionals of the potential energy, such as the total energy and atomic forces. We then give a detailed account of the Smooth Overlap of Atomic Po…
▽ More
We briefly summarize the kernel regression approach, as used recently in materials modelling, to fitting functions, particularly potential energy surfaces, and highlight how the linear algebra framework can be used to both predict and train from linear functionals of the potential energy, such as the total energy and atomic forces. We then give a detailed account of the Smooth Overlap of Atomic Positions (SOAP) representation and kernel, showing how it arises from an abstract representation of smooth atomic densities, and how it is related to several popular density-based representations of atomic structure. We also discuss recent generalisations that allow fine control of correlations between different atomic species, prediction and fitting of tensorial properties, and also how to construct structural kernels---applicable to comparing entire molecules or periodic systems---that go beyond an additive combination of local environments.
△ Less
Submitted 30 January, 2019;
originally announced January 2019.
-
Quantifying Chemical Structure and Atomic Energies in Amorphous Silicon Networks
Authors:
Noam Bernstein,
Bishal Bhattarai,
Gábor Csányi,
David A. Drabold,
Stephen R. Elliott,
Volker L. Deringer
Abstract:
Amorphous materials are coming within reach of realistic computer simulations, but new approaches are needed to fully understand their intricate atomic structures. Here, we show how machine-learning (ML)-based techniques can give new, quantitative chemical insight into the atomic-scale structure of amorphous silicon (a-Si). Based on a similarity function ("kernel"), we define a structural metric t…
▽ More
Amorphous materials are coming within reach of realistic computer simulations, but new approaches are needed to fully understand their intricate atomic structures. Here, we show how machine-learning (ML)-based techniques can give new, quantitative chemical insight into the atomic-scale structure of amorphous silicon (a-Si). Based on a similarity function ("kernel"), we define a structural metric that unifies the description of nearest- and next-nearest-neighbor environments in the amorphous state. We apply this to an ensemble of a-Si networks, generated in melt-quench simulations with an ML-based interatomic potential, in which we tailor the degree of ordering by varying the quench rates down to $10^{10}$ K/s (leading to a structural model that is lower in energy than the established WWW network). We then show how "machine-learned" atomic energies permit a chemical interpretation, associating coordination defects in a-Si with distinct energetic stability regions. The approach is straightforward and inexpensive to apply to arbitrary structural models, and it is therefore expected to have more general significance for developing a quantitative understanding of the amorphous state.
△ Less
Submitted 27 November, 2018;
originally announced November 2018.
-
Equation of state of fluid methane from first principles with machine learning potentials
Authors:
Max Veit,
Sandeep Kumar Jain,
Satyanarayana Bonakala,
Indranil Rudra,
Detlef Hohl,
Gábor Csányi
Abstract:
The predictive simulation of molecular liquids requires models that are not only accurate, but computationally efficient enough to handle the large systems and long time scales required for reliable prediction of macroscopic properties. We present a new approach to the systematic approximation of the first-principles potential energy surface (PES) of molecular liquids using the GAP (Gaussian Appro…
▽ More
The predictive simulation of molecular liquids requires models that are not only accurate, but computationally efficient enough to handle the large systems and long time scales required for reliable prediction of macroscopic properties. We present a new approach to the systematic approximation of the first-principles potential energy surface (PES) of molecular liquids using the GAP (Gaussian Approximation Potential) framework. The approach allows us to create potentials at several different levels of accuracy in reproducing the true PES, which allows us to test the level of quantum chemistry that is necessary to accurately predict its macroscopic properties. We test the approach by building potentials for liquid methane (CH$_4$), which is difficult to model from first principles because its behavior is dominated by weak dispersion interactions with a significant many-body component. We find that an accurate, consistent prediction of its bulk density across a wide range of temperature and pressure requires not only many-body dispersion, but also quantum nuclear effects to be modeled accurately.
△ Less
Submitted 24 October, 2018;
originally announced October 2018.
-
Machine-learned multi-system surrogate models for materials prediction
Authors:
Chandramouli Nyshadham,
Matthias Rupp,
Brayden Bekker,
Alexander V. Shapeev,
Tim Mueller,
Conrad W. Rosenbrock,
Gábor Csányi,
David W. Wingate,
Gus L. W. Hart
Abstract:
Surrogate machine-learning models are transforming computational materials science by predicting properties of materials with the accuracy of ab initio methods at a fraction of the computational cost. We demonstrate surrogate models that simultaneously interpolate energies of different materials on a dataset of 10 binary alloys (AgCu, AlFe, AlMg, AlNi, AlTi, CoNi, CuFe, CuNi, FeV, NbNi) with 10 di…
▽ More
Surrogate machine-learning models are transforming computational materials science by predicting properties of materials with the accuracy of ab initio methods at a fraction of the computational cost. We demonstrate surrogate models that simultaneously interpolate energies of different materials on a dataset of 10 binary alloys (AgCu, AlFe, AlMg, AlNi, AlTi, CoNi, CuFe, CuNi, FeV, NbNi) with 10 different species and all possible fcc, bcc and hcp structures up to 8 atoms in the unit cell, 15\,950 structures in total. We find that the deviation of prediction errors when increasing the number of simultaneously modeled alloys is less than 1\,meV/atom. Several state-of-the-art materials representations and learning algorithms were found to qualitatively agree on the prediction errors of formation enthalpy with relative errors of $<$2.5\% for all systems.
△ Less
Submitted 20 May, 2019; v1 submitted 24 September, 2018;
originally announced September 2018.
-
Machine learning a general purpose interatomic potential for silicon
Authors:
Albert P. Bartok,
James Kermode,
Noam Bernstein,
Gabor Csanyi
Abstract:
The success of first principles electronic structure calculation for predictive modeling in chemistry, solid state physics, and materials science is constrained by the limitations on simulated length and time scales due to computational cost and its scaling. Techniques based on machine learning ideas for interpolating the Born-Oppenheimer potential energy surface without explicitly describing elec…
▽ More
The success of first principles electronic structure calculation for predictive modeling in chemistry, solid state physics, and materials science is constrained by the limitations on simulated length and time scales due to computational cost and its scaling. Techniques based on machine learning ideas for interpolating the Born-Oppenheimer potential energy surface without explicitly describing electrons have recently shown great promise, but accurately and efficiently fitting the physically relevant space of configurations has remained a challenging goal. Here we present a Gaussian Approximation Potential for silicon that achieves this milestone, accurately reproducing density functional theory reference results for a wide range of observable properties, including crystal, liquid, and amorphous bulk phases, as well as point, line, and plane defects. We demonstrate that this new potential enables calculations that would be extremely expensive with a first principles electronic structure method, such as finite temperature phase boundary lines, self-diffusivity in the liquid, formation of the amorphous by slow quench, and dynamic brittle fracture. We show that the uncertainty quantification inherent to the Gaussian process regression framework gives a qualitative estimate of the potential's accuracy for a given atomic configuration. The success of this model shows that it is indeed possible to create a useful machine-learning-based interatomic potential that comprehensively describes a material, and serves as a template for the development of such models in the future.
△ Less
Submitted 3 May, 2018;
originally announced May 2018.
-
Growth Mechanism and Origin of High $sp^3$ Content in Tetrahedral Amorphous Carbon
Authors:
Miguel A. Caro,
Volker L. Deringer,
Jari Koskinen,
Tomi Laurila,
Gábor Csányi
Abstract:
We study the deposition of tetrahedral amorphous carbon (ta-C) films from molecular dynamics simulations based on a machine-learned interatomic potential trained from density-functional theory data. For the first time, the high $sp^3$ fractions in excess of 85% observed experimentally are reproduced by means of computational simulation, and the deposition energy dependence of the film's characteri…
▽ More
We study the deposition of tetrahedral amorphous carbon (ta-C) films from molecular dynamics simulations based on a machine-learned interatomic potential trained from density-functional theory data. For the first time, the high $sp^3$ fractions in excess of 85% observed experimentally are reproduced by means of computational simulation, and the deposition energy dependence of the film's characteristics is also accurately described. High confidence in the potential and direct access to the atomic interactions allow us to infer the microscopic growth mechanism in this material. While the widespread view is that ta-C grows by "subplantation," we show that the so-called "peening" model is actually the dominant mechanism responsible for the high $sp^3$ content. We show that pressure waves lead to bond rearrangement away from the impact site of the incident ion, and high $sp^3$ fractions arise from a delicate balance of transitions between three- and fourfold coordinated carbon atoms. These results open the door for a microscopic understanding of carbon nanostructure formation with an unprecedented level of predictive power.
△ Less
Submitted 20 April, 2018;
originally announced April 2018.
-
Realistic atomistic structure of amorphous silicon from machine-learning-driven molecular dynamics
Authors:
Volker L. Deringer,
Noam Bernstein,
Albert P. Bartók,
Matthew J. Cliffe,
Rachel N. Kerber,
Lauren E. Marbella,
Clare P. Grey,
Stephen R. Elliott,
Gábor Csányi
Abstract:
Amorphous silicon (a-Si) is a widely studied non-crystalline material, and yet the subtle details of its atomistic structure are still unclear. Here, we show that accurate structural models of a-Si can be obtained by harnessing the power of machine-learning algorithms to create interatomic potentials. Our best a-Si network is obtained by cooling from the melt in molecular-dynamics simulations, at…
▽ More
Amorphous silicon (a-Si) is a widely studied non-crystalline material, and yet the subtle details of its atomistic structure are still unclear. Here, we show that accurate structural models of a-Si can be obtained by harnessing the power of machine-learning algorithms to create interatomic potentials. Our best a-Si network is obtained by cooling from the melt in molecular-dynamics simulations, at a rate of 10$^{11}$ K/s (that is, on the 10 ns timescale). This structure shows a defect concentration of below 2% and agrees with experiments regarding excess energies, diffraction data, as well as $^{29}$Si solid-state NMR chemical shifts. We show that this level of quality is impossible to achieve with faster quench simulations. We then generate a 4,096-atom system which correctly reproduces the magnitude of the first sharp diffraction peak (FSDP) in the structure factor, achieving the closest agreement with experiments to date. Our study demonstrates the broader impact of machine-learning interatomic potentials for elucidating accurate structures and properties of amorphous functional materials.
△ Less
Submitted 7 March, 2018;
originally announced March 2018.
-
Gaussian approximation potential modeling of lithium intercalation in carbon nanostructures
Authors:
So Fujikake,
Volker L. Deringer,
Tae Hoon Lee,
Marcin Krynski,
Stephen R. Elliott,
Gábor Csányi
Abstract:
We demonstrate how machine-learning based interatomic potentials can be used to model guest atoms in host structures. Specifically, we generate Gaussian approximation potential (GAP) models for the interaction of lithium atoms with graphene, graphite, and disordered carbon nanostructures, based on reference density-functional theory (DFT) data. Rather than treating the full Li--C system, we demons…
▽ More
We demonstrate how machine-learning based interatomic potentials can be used to model guest atoms in host structures. Specifically, we generate Gaussian approximation potential (GAP) models for the interaction of lithium atoms with graphene, graphite, and disordered carbon nanostructures, based on reference density-functional theory (DFT) data. Rather than treating the full Li--C system, we demonstrate how the energy and force differences arising from Li intercalation can be modeled and then added to a (prexisting and unmodified) GAP model of pure elemental carbon. Furthermore, we show the benefit of using an explicit pair potential fit to capture "effective" Li--Li interactions, to improve the performance of the GAP model. This provides proof-of-concept for modeling guest atoms in host frameworks with machine-learning based potentials, and in the longer run is promising for carrying out detailed atomistic studies of battery materials.
△ Less
Submitted 13 February, 2018; v1 submitted 12 December, 2017;
originally announced December 2017.
-
Constant-pressure nested sampling with atomistic dynamics
Authors:
Robert J. N. Baldock,
Noam Bernstein,
K. Michael Salerno,
Lívia B. Pártay,
Gábor Csányi
Abstract:
The nested sampling algorithm has been shown to be a general method for calculating the pressure-temperature-composition phase diagrams of materials. While the previous implementation used single-particle Monte Carlo moves, these are inefficient for condensed systems with general interactions where single-particle moves cannot be evaluated faster than the energy of the whole system. Here we enhanc…
▽ More
The nested sampling algorithm has been shown to be a general method for calculating the pressure-temperature-composition phase diagrams of materials. While the previous implementation used single-particle Monte Carlo moves, these are inefficient for condensed systems with general interactions where single-particle moves cannot be evaluated faster than the energy of the whole system. Here we enhance the method by using all-particle moves: either Galilean Monte Carlo or a total enthalpy Hamiltonian Monte Carlo algorithm, introduced in this paper. We show that these algorithms enable the determination of phase transition temperatures with equivalent accuracy to the previous method at $1/N$ of the cost for an $N$-particle system with general interactions, or at equal cost when single particle moves can be done in $1/N$ of the cost of a full $N$-particle energy evaluation.
△ Less
Submitted 16 November, 2017; v1 submitted 30 October, 2017;
originally announced October 2017.
-
Data-driven learning of total and local energies in elemental boron
Authors:
Volker L. Deringer,
Chris J. Pickard,
Gábor Csányi
Abstract:
The allotropes of boron continue to challenge structural elucidation and solid-state theory. Here we use machine learning combined with random structure searching (RSS) algorithms to systematically construct an interatomic potential for boron. Starting from ensembles of randomized atomic configurations, we use alternating single-point quantum-mechanical energy and force computations, Gaussian appr…
▽ More
The allotropes of boron continue to challenge structural elucidation and solid-state theory. Here we use machine learning combined with random structure searching (RSS) algorithms to systematically construct an interatomic potential for boron. Starting from ensembles of randomized atomic configurations, we use alternating single-point quantum-mechanical energy and force computations, Gaussian approximation potential (GAP) fitting, and GAP-driven RSS to iteratively generate a representation of the element's potential-energy surface. Beyond the total energies of the very different boron allotropes, our model readily provides atom-resolved, local energies and thus deepened insight into the frustrated $β$-rhombohedral boron structure. Our results open the door for the efficient and automated generation of GAPs and other machine-learning-based interatomic potentials, and suggest their usefulness as a tool for materials discovery.
△ Less
Submitted 28 October, 2017;
originally announced October 2017.
-
A Machine Learning Potential for Graphene
Authors:
Patrick Rowe,
Gábor Csányi,
Dario Alfè,
Angelos Michaelides
Abstract:
We present an accurate interatomic potential for graphene, constructed using the Gaussian Approximation Potential (GAP) machine learning methodology. This GAP model obtains a faithful representation of a density functional theory (DFT) potential energy surface, facilitating highly accurate (approaching the accuracy of ab initio methods) molecular dynamics simulations. This is achieved at a computa…
▽ More
We present an accurate interatomic potential for graphene, constructed using the Gaussian Approximation Potential (GAP) machine learning methodology. This GAP model obtains a faithful representation of a density functional theory (DFT) potential energy surface, facilitating highly accurate (approaching the accuracy of ab initio methods) molecular dynamics simulations. This is achieved at a computational cost which is orders of magnitude lower than that of comparable calculations which directly invoke electronic structure methods. We evaluate the accuracy of our machine learning model alongside that of a number of popular empirical and bond-order potentials, using both experimental and ab initio data as references. We find that whilst significant discrepancies exist between the empirical interatomic potentials and the reference data - and amongst the empirical potentials themselves - the machine learning model introduced here provides exemplary performance in all of the tested areas. The calculated properties include: graphene phonon dispersion curves at 0 K (which we predict with sub-meV accuracy), phonon spectra at finite temperature, in-plane thermal expansion up to 2500 K as compared to NPT ab initio molecular dynamics simulations and a comparison of the thermally induced dispersion of graphene Raman bands to experimental observations. We have made our potential freely available online at [http://www.libatoms.org].
△ Less
Submitted 16 October, 2017; v1 submitted 11 October, 2017;
originally announced October 2017.
-
Symmetry-Adapted Machine-Learning for Tensorial Properties of Atomistic Systems
Authors:
Andrea Grisafi,
David M. Wilkins,
Gábor Csányi,
Michele Ceriotti
Abstract:
Statistical learning methods show great promise in providing an accurate prediction of materials and molecular properties, while minimizing the need for computationally demanding electronic structure calculations. The accuracy and transferability of these models are increased significantly by encoding into the learning procedure the fundamental symmetries of rotational and permutational invariance…
▽ More
Statistical learning methods show great promise in providing an accurate prediction of materials and molecular properties, while minimizing the need for computationally demanding electronic structure calculations. The accuracy and transferability of these models are increased significantly by encoding into the learning procedure the fundamental symmetries of rotational and permutational invariance of scalar properties. However, the prediction of tensorial properties requires that the model respects the appropriate geometric transformations, rather than invariance, when the reference frame is rotated. We introduce a formalism that can be used to perform machine-learning of tensorial properties of arbitrary rank for general molecular geometries. To demonstrate it, we derive a tensor kernel adapted to rotational symmetry, which is the natural generalization of the smooth overlap of atomic positions (SOAP) kernel commonly used for the prediction of scalar properties at the atomic scale. The performance and generality of the approach is demonstrated by learning the instantaneous electrical response of water oligomers of increasing complexity, from the isolated molecule to the condensed phase.
△ Less
Submitted 20 September, 2017;
originally announced September 2017.
-
Achieving DFT accuracy with a machine-learning interatomic potential: thermomechanics and defects in bcc ferromagnetic iron
Authors:
Daniele Dragoni,
Thomas D. Daff,
Gabor Csanyi,
Nicola Marzari
Abstract:
We show that the Gaussian Approximation Potential machine learning framework can describe complex magnetic potential energy surfaces, taking ferromagnetic iron as a paradigmatic challenging case. The training database includes total energies, forces, and stresses obtained from density-functional theory in the generalized-gradient approximation, and comprises approximately 150,000 local atomic envi…
▽ More
We show that the Gaussian Approximation Potential machine learning framework can describe complex magnetic potential energy surfaces, taking ferromagnetic iron as a paradigmatic challenging case. The training database includes total energies, forces, and stresses obtained from density-functional theory in the generalized-gradient approximation, and comprises approximately 150,000 local atomic environments, ranging from pristine and defected bulk configurations to surfaces and generalized stacking faults with different crystallographic orientations. We find the structural, vibrational and thermodynamic properties of the GAP model to be in excellent agreement with those obtained directly from first-principles electronic-structure calculations. There is good transferability to quantities, such as Peierls energy barriers, which are determined to a large extent by atomic configurations that were not part of the training set. We observe the benefit and the need of using highly converged electronic-structure calculations to sample a target potential energy surface. The end result is a systematically improvable potential that can achieve the same accuracy of density-functional theory calculations, but at a fraction of the computational cost.
△ Less
Submitted 30 June, 2017;
originally announced June 2017.
-
Machine Learning Unifies the Modelling of Materials and Molecules
Authors:
Albert P. Bartok,
Sandip De,
Carl Poelking,
Noam Bernstein,
James Kermode,
Gabor Csanyi,
Michele Ceriotti
Abstract:
Determining the stability of molecules and condensed phases is the cornerstone of atomistic modelling, underpinning our understanding of chemical and materials properties and transformations. Here we show that a machine learning model, based on a local description of chemical environments and Bayesian statistical learning, provides a unified framework to predict atomic-scale properties. It capture…
▽ More
Determining the stability of molecules and condensed phases is the cornerstone of atomistic modelling, underpinning our understanding of chemical and materials properties and transformations. Here we show that a machine learning model, based on a local description of chemical environments and Bayesian statistical learning, provides a unified framework to predict atomic-scale properties. It captures the quantum mechanical effects governing the complex surface reconstructions of silicon, predicts the stability of different classes of molecules with chemical accuracy, and distinguishes active and inactive protein ligands with more than 99% reliability. The universality and the systematic nature of our framework provides new insight into the potential energy surface of materials and molecules.
△ Less
Submitted 15 December, 2017; v1 submitted 1 June, 2017;
originally announced June 2017.
-
Polytypism in the ground state structure of the Lennard-Jonesium
Authors:
Lívia B. Pártay,
Christoph Ortner,
Albert P. Bartók,
Chris J. Pickard,
Gábor Csányi
Abstract:
We present a systematic study of the stability of nineteen different periodic structures using the finite range Lennard-Jones potential model discussing the effects of pressure, potential truncation, cutoff distance and Lennard-Jones exponents. The structures considered are the hexagonal close packed (hcp), face centred cubic (fcc) and seventeen other polytype stacking sequences, such as dhcp and…
▽ More
We present a systematic study of the stability of nineteen different periodic structures using the finite range Lennard-Jones potential model discussing the effects of pressure, potential truncation, cutoff distance and Lennard-Jones exponents. The structures considered are the hexagonal close packed (hcp), face centred cubic (fcc) and seventeen other polytype stacking sequences, such as dhcp and $9R$. We found that at certain pressure and cutoff distance values, neither fcc nor hcp is the ground state structure as previously documented, but different polytypic sequences. This behaviour shows a strong dependence on the way the tail of the potential is truncated.
△ Less
Submitted 4 May, 2017;
originally announced May 2017.