Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

An 02-norm regularized regression model for construction of robust cluster expansions in multicomponent systems

Peichen Zhong, Tina Chen, Luis Barroso-Luque, Fengyu Xie, and Gerbrand Ceder
Phys. Rev. B 106, 024203 – Published 25 July 2022

Abstract

We introduce 02-norm regularization and hierarchy constraints into linear regression for the construction of cluster expansions to describe configurational disorder in materials. The approach is implemented through mixed integer quadratic programming (MIQP). The 2-norm regularization is used to suppress intrinsic data noise, while the 0-norm is used to penalize the number of nonzero elements in the solution. The hierarchy relation between clusters imposes relevant physics and is naturally included by the MIQP paradigm. As such, sparseness and cluster hierarchy can be well optimized to obtain a robust, converged set of effective cluster interactions with improved physical meaning. We demonstrate the effectiveness of 02-norm regularization in two high-component disordered rocksalt cathode material systems, where we compare the cross-validation, convergence speed, and the reproduction of phase diagrams, voltage profiles, and Li-occupancy energies with those of the conventional 1-norm regularized cluster expansion models.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Received 28 April 2022
  • Revised 27 June 2022
  • Accepted 13 July 2022

DOI:https://doi.org/10.1103/PhysRevB.106.024203

©2022 American Physical Society

Physics Subject Headings (PhySH)

Condensed Matter, Materials & Applied Physics

Authors & Affiliations

Peichen Zhong1,2, Tina Chen1,2, Luis Barroso-Luque1,2, Fengyu Xie1,2, and Gerbrand Ceder1,2,*

  • 1Department of Materials Science and Engineering, University of California, Berkeley, California 94720, United States
  • 2Materials Sciences Division, Lawrence Berkeley National Laboratory, California 94720, United States

  • *gceder@berkeley.edu

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 106, Iss. 2 — 1 July 2022

Reuse & Permissions
Access Options
CHORUS

Article Available via CHORUS

Download Accepted Manuscript
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×

Images

  • Figure 1
    Figure 1

    (a) The general flowchart of constructing a CE model, including initialization of input structures, DFT calculations, fitting and convergence check, and cluster expansion Monte Carlo (CEMC) for sampling. (b) An illustration of 2, 1, and 0-norm regularization in a two-parameter space J = (J1, J2). The blue circles represent the contours of the data term ||EDFT,SΠSJ||22 in cost function. The red regions represent the constraints of parameters (e.g., J12+J22s for 2-norm, |J1|+|J2|s for 1-norm.) The dark red point is the intersection of data term and regularization of parameters, which jointly determines the estimation of J.

    Reuse & Permissions
  • Figure 2
    Figure 2

    Illustration of hierarchy relations (αβγ) between pair, triplet, and quadruplet orbit. The different colors on the cluster sites represent the decorating species for a given site-basis function. The equation in red shows a pseudoactive hierarchy constraint that may appear in 1-norm and its derivative methods.

    Reuse & Permissions
  • Figure 3
    Figure 3

    (a) An illustration of the rocksalt lattice structure. The cation sites are labeled in red and can be occupied by Li+ and transition metals (TM, including Mn2+, V3+, and Ti4+ in our example) in DRX. The anion sites are labeled in gray and can be occupied by O2 and F. The lower panel gives some examples of n-body (n=2,3,and4) clusters used in the CE model, including intra and inter-sublattice interactions. (b) The procedure to obtain an 02-norm regularized solution, including finding the μ2 by minimizing the CV error in ridge regression, sparseness engineering with 0-norm using MIQP, and terminating if the solution is converged with good sparseness, as well as good-reproduction-relevant physical properties.

    Reuse & Permissions
  • Figure 4
    Figure 4

    (a) Cross-validation error (meV/atom) of the 1-CE and the 02-CE. The sparseness is the number of nonzero ECIs in the fit (||J||0). The curves are generated by varying hyperparameters μ0,μ1, and μ2 in regularization. (b) ECIs convergence test vs training set size. J is the ECIs fitted with full training data, and Jsub is the ECIs fitted with a subset of corresponding size.

    Reuse & Permissions
  • Figure 5
    Figure 5

    (a) Phase diagram generated with DFT, 02-CE, and 1-CE. The DFT ground states are labeled in blue text. The incorrectly predicted ground states are labeled with red circles and text. (b) The simplified spinel voltage profile (blue line) generated by 1-CE and 02-CE for spinel orderings in LixMn2O4 is compared with the DFT ground-truths (orange line). (c) Energy difference of Li occupation in octahedral and tetrahedral sites in layered MnO2 (top) and spinel MnO2 framework (bottom).

    Reuse & Permissions
×

Sign up to receive regular email alerts from Physical Review B

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×