Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
\WarningFilter

revtex4-2Repair the float

Thermal Conductivity Predictions with Foundation Atomistic Models

Balázs Póta Theory of Condensed Matter Group of the Cavendish Laboratory, University of Cambridge, Cambridge, UK    Paramvir Ahlawat Theory of Condensed Matter Group of the Cavendish Laboratory, University of Cambridge, Cambridge, UK    Gábor Csányi Engineering Laboratory, University of Cambridge, Cambridge, UK    Michele Simoncelli ms2855@cam.ac.uk Theory of Condensed Matter Group of the Cavendish Laboratory, University of Cambridge, Cambridge, UK
Abstract

Recent advances in machine learning have led to foundation models for atomistic materials chemistry, potentially enabling quantum-accurate descriptions of interatomic forces at reduced computational cost. These models are benchmarked by predicting materials’ properties over large databases; however, these computationally intensive tests have been limited to basic quantities related to harmonic phonons, leaving uncertainty about the reliability for complex, technologically and experimentally relevant anharmonic heat-conduction properties. Here we present an automated framework that relies on foundation models to compute microscopic vibrational properties, and employs them within the Wigner formulation of heat transport to predict the macroscopic thermal conductivity in solids with arbitrary composition and structure. We apply this framework with the foundation models M3GNet, CHGNet, MACE-MP-0, and SevenNET to 103 diverse compounds, comparing predictions against first-principles references and introducing a benchmark metric based on conductivity. This framework paves the way for physics-aware, accurate predictions of vibrational and thermal properties, and for uncovering materials that violate semiclassical Boltzmann transport and feature exceptional heat-shielding or thermoelectric performance.

Over the past decades, several research groups have tackled the challenging computational task of fitting the Born-Oppenheimer potential energy surface as a function of atomic coordinates [1, 2, 3, 4, 5, 6, 7]. These efforts resulted in the development of so-called machine-learning potentials (MLPs), which allow us to reproduce first-principles energies and microscopic interatomic forces with nearly the same accuracy and orders-of-magnitude lower computational cost. These developments enable the prediction of macroscopic observables from the integration of atomistic properties at a computational cost much reduced compared to first-principles methods, effectively opening avenues to design materials for target applications from theory. For example, engineering the magnitude of the macroscopic thermal conductivity of a material through changes in its atomistic composition and structure is crucial for neuromorphic computing [8], thermoelectric energy harvesting [9], and aerospace [10] technologies. Major drawbacks of MLPs-based methods are the significant work required to generate a first-principles database used to train and validate MPLs [11], as well as the applicability limited to specific materials’ compositions or structural phases. Past works attempted to bypass these limitations by employing end-to-end machine-learning methods, which predict microscopic [12] or macroscopic [13, 14] materials’ properties very efficiently, but with the compromise of not rigorously resolving the fundamental physics that underlies them. Fittingly, recent work [15] has formally demonstrated the possibility to obtain a complete description (in the mathematical sense) of atomic environments (and thus of forces) by employing Message Passing Neural Networks (MPNNs) models [16] with many-body messages [17] (to be precise, with the MACE architecture [17] this can be achieved using 4-body terms and one message pass). This breakthrough has enabled the development of foundation machine-learning potentials (fMLPs) [18, 19, 20, 21, 22, 23], which are trained across nearly all chemical elements and can be directly combined to describe materials with diverse structures and compositions. Therefore, fMLPs could potentially overcome the problems related to complex training and the very limited transferability of conventional MLPs, ultimately enabling physics-aware predictions of macroscopic observables from the integration of atomistic quantities. Recent research efforts have assessed the accuracy of fMLPs in predicting, e.g., the structural stability of solids [24] or the harmonic vibrational frequencies [25, 26, 27, 28]. However, fMLPs have not yet been tested for predicting vibrational-anharmonic and heat-conduction properties, due to the complexity of the computational framework that relates the interatomic forces to the macroscopic conductivity [29, 30, 31, 32, 33, 34, 35, 36, 37].

Here we present an automated framework that uses fMLPs to compute anharmonic vibrational and thermal properties. We employ this to benchmark against first-principles reference data [30, 31] the predictions obtained from state-of-the-art (SOTA) fMLPs M3GNet [18], CHGNet [19], MACE-MP-0 [20], and SevenNET [21] (all trained on the Materials Project DFT-PBE database [38] and hereafter collectively referred to as ‘mp-fMLPs’). In particular, after detailing the methods to calculate harmonic and anharmonic vibrational properties of solids using fMLPs, we discuss how these quantities determine the thermal conductivity within the recently developed Wigner heat-Transport Equation (WTE) [34, 35], which generalizes the Boltzmann Transport Equation (BTE) [39] accounting not only for heat carried by particle-like propagation of phonons, but also for conduction arising from phonons’ wave-like tunneling. Thus, this framework is employed to predict from DFT-PBE or mp-fMLPs the thermal conductivity of 103 compounds made up of 34 different chemical species and having wurtzite, zincblende, or rocksalt structure. We introduce descriptors to quantify the accuracy of fMLPs in predicting anharmonic and thermal properties, and show that they could potentially be used as a metric to benchmark or fine-tune fMLPs. Finally, we show how the framework introduced paves the way for rigorous, physics-aware predictions of vibrational and thermal properties in materials with arbitrary composition and structure, highlighting its potential to find materials that violate semiclassical Boltzmann conduction relevant for thermal-insulation or thermoelectric applications.

Results
From vibrational energy to thermal conductivity
The temperature-dependent thermal conductivity (κ(T)𝜅𝑇\kappa(T)italic_κ ( italic_T )) of a solid is a macroscopic, experimentally and technologically relevant quantity that describes the capability to conduct heat. To predict such quantity, we start from the Born-Oppenheimer Hamiltonian for atomic vibrations [35] expanded up to anharmonic third order in displacements from equilibrium, and we account for kinetic-energy perturbations due to isotopes [40],

H^=𝑹,b,αp^𝑹bα22Mb+12𝑹,b,α𝑹,b,α2Vu𝑹bαu𝑹bα|equ𝑹bαu𝑹bα+13!𝑹,b,α𝑹,b,α3Vu𝑹bαu𝑹bαu𝑹′′b′′α′′|equ𝑹bαu𝑹bαu𝑹′′b′′α′′+𝑹,b,α(mbMb1)p^𝑹bα22Mb,\begin{split}&\hat{H}=\sum_{\bm{R},b,\alpha}{\frac{\hat{p}^{2}_{\bm{R}b\alpha}% }{2M_{b}}}+\frac{1}{2}\!\!\!\sum_{\begin{subarray}{c}\scriptscriptstyle{\bm{R}% ,b,\alpha}\\ \scriptscriptstyle{\bm{R^{\prime}}\!,b^{\prime}\!,\alpha^{\prime}}\end{% subarray}}\!\!\frac{\partial^{2}{V}}{\partial{u}_{\bm{R}b\alpha}\partial{u}_{% \bm{R^{\prime}}b^{\prime}\alpha^{\prime}}\!}\bigg{\lvert}_{\!\rm eq}\!\!{u}_{% \bm{R}b\alpha}{u}_{\bm{R^{\prime}}b^{\prime}\alpha^{\prime}}\\ &+\frac{1}{3!}\!\!\!\sum_{\begin{subarray}{c}\scriptscriptstyle{\bm{R},b,% \alpha}\\ \scriptscriptstyle{\bm{R^{\prime}}\!,b^{\prime}\!,\alpha^{\prime}}\end{% subarray}}\!\!\frac{\partial^{3}{V}}{\partial{u}_{\bm{R}b\alpha}\partial{u}_{% \bm{R^{\prime}}b^{\prime}\alpha^{\prime}}\!\partial{u}_{\bm{R}^{\prime\prime}b% ^{\prime\prime}\alpha^{\prime\prime}}\!}\bigg{\lvert}_{\!\rm eq}\!\!{u}_{\bm{R% }b\alpha}{u}_{\bm{R^{\prime}}b^{\prime}\alpha^{\prime}}{u}_{\bm{R}^{\prime% \prime}b^{\prime\prime}\alpha^{\prime\prime}}\,\\ &+\sum_{\scriptscriptstyle{\bm{R},b,\alpha}}\left(\frac{m_{b}}{M_{b}}-1\right)% {\frac{\hat{p}^{2}_{\bm{R}b\alpha}}{2M_{b}}},\end{split}start_ROW start_CELL end_CELL start_CELL over^ start_ARG italic_H end_ARG = ∑ start_POSTSUBSCRIPT bold_italic_R , italic_b , italic_α end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_ARG + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL bold_italic_R , italic_b , italic_α end_CELL end_ROW start_ROW start_CELL bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG end_POSTSUBSCRIPT divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_V end_ARG start_ARG ∂ italic_u start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT ∂ italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG | start_POSTSUBSCRIPT roman_eq end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + divide start_ARG 1 end_ARG start_ARG 3 ! end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL bold_italic_R , italic_b , italic_α end_CELL end_ROW start_ROW start_CELL bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG end_POSTSUBSCRIPT divide start_ARG ∂ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_V end_ARG start_ARG ∂ italic_u start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT ∂ italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∂ italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG | start_POSTSUBSCRIPT roman_eq end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + ∑ start_POSTSUBSCRIPT bold_italic_R , italic_b , italic_α end_POSTSUBSCRIPT ( divide start_ARG italic_m start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_ARG start_ARG italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_ARG - 1 ) divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_ARG , end_CELL end_ROW (1)

where p^𝑹bαsubscript^𝑝𝑹𝑏𝛼\hat{p}_{\bm{R}b\alpha}over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT and u^𝑹bαsubscript^𝑢𝑹𝑏𝛼\hat{u}_{\bm{R}b\alpha}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT bold_italic_R italic_b italic_α end_POSTSUBSCRIPT are the momentum and positions-displacement operators along the Cartesian direction α𝛼\alphaitalic_α for the atom b𝑏bitalic_b having isotope-averaged mass Mbsubscript𝑀𝑏M_{b}italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT and position 𝑹+𝝉b𝑹subscript𝝉𝑏\bm{R}{+}\bm{\tau}_{b}bold_italic_R + bold_italic_τ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT (here, 𝑹𝑹\bm{R}bold_italic_R is the Bravais-lattice vector and 𝝉bsubscript𝝉𝑏\bm{\tau}_{b}bold_italic_τ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT the position in the crystal’s primitive cell); the last term in Eq. (1) describes kinetic-energy perturbations induced by isotopes [40, 29] (mbsubscript𝑚𝑏m_{b}italic_m start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is the exact mass of the atom at position 𝝉bsubscript𝝉𝑏\bm{\tau}_{b}bold_italic_τ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, which can deviate from the isotopically-averaged mass Mbsubscript𝑀𝑏M_{b}italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT). The leading (harmonic) term in such an equation determines the vibrational frequencies; in particular, the Fourier transform of the mass-rescaled second-order derivative of the interatomic potential yields the dynamical matrix at wavevector 𝒒𝒒\bm{q}bold_italic_q, D(𝒒)bα,bα=𝑹2Vu𝑹bαu𝑹bα|eqei𝒒(𝑹+𝝉b𝝉b)MbMb\mathsfit{D}(\bm{q})_{b\alpha,b^{\prime}\!\alpha^{\prime}}{=}\sum_{\bm{R}}% \frac{\partial^{2}{V}}{\partial{u}_{\bm{R}b\alpha}\partial{u}_{\bm{R}^{\prime}% b^{\prime}\alpha^{\prime}}}\Big{\lvert}_{\rm eq}\frac{e^{-i\bm{q}\cdot(\bm{R}{% +}\bm{\tau}_{b}{-}\bm{\tau}_{b^{\prime}})}}{\sqrt{M_{b}M_{b^{\prime}}}}slanted_D ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_b italic_α , slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT bold_slanted_R end_POSTSUBSCRIPT divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT slanted_V end_ARG start_ARG ∂ slanted_u start_POSTSUBSCRIPT bold_slanted_R slanted_b italic_α end_POSTSUBSCRIPT ∂ slanted_u start_POSTSUBSCRIPT bold_slanted_R start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG | start_POSTSUBSCRIPT roman_eq end_POSTSUBSCRIPT divide start_ARG slanted_e start_POSTSUPERSCRIPT - slanted_i bold_slanted_q ⋅ ( bold_slanted_R + bold_italic_τ start_POSTSUBSCRIPT slanted_b end_POSTSUBSCRIPT - bold_italic_τ start_POSTSUBSCRIPT slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG square-root start_ARG slanted_M start_POSTSUBSCRIPT slanted_b end_POSTSUBSCRIPT slanted_M start_POSTSUBSCRIPT slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG end_ARG. By diagonalizing the dynamical matrix,

bαD(𝒒)bα,bα(𝒒)s,bα=ω2(𝒒)s(𝒒)s,bα,subscriptsuperscript𝑏superscript𝛼𝐷subscript𝒒𝑏𝛼superscript𝑏superscript𝛼subscript𝒒𝑠superscript𝑏superscript𝛼superscript𝜔2subscript𝒒𝑠subscript𝒒𝑠𝑏𝛼\textstyle\sum_{b^{\prime}\alpha^{\prime}}\mathsfit{D}(\bm{q})_{b\alpha,b^{% \prime}\!\alpha^{\prime}}\mathcal{E}(\bm{q})_{s,b^{\prime}\!\alpha^{\prime}}=% \omega^{2}(\bm{q})_{s}\mathcal{E}(\bm{q})_{s,b\alpha},∑ start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT slanted_D ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_b italic_α , slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT caligraphic_E ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_s , slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_s end_POSTSUBSCRIPT caligraphic_E ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_s , slanted_b italic_α end_POSTSUBSCRIPT , (2)

one obtains from the eigenvalues the phonon energies ω(𝒒)sPlanck-constant-over-2-pi𝜔subscript𝒒𝑠\hbar\omega(\bm{q})_{s}roman_ℏ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT of the solid (s𝑠sitalic_s is a band index ranging from 1 to 3Natsubscript𝑁atN_{\rm at}italic_N start_POSTSUBSCRIPT roman_at end_POSTSUBSCRIPT, where Natsubscript𝑁atN_{\rm at}italic_N start_POSTSUBSCRIPT roman_at end_POSTSUBSCRIPT is the number of atoms in the primitive cell), and from the eigenvectors (𝒒)s,bαsubscript𝒒𝑠𝑏𝛼\mathcal{E}(\bm{q})_{s,b\alpha}caligraphic_E ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_b italic_α end_POSTSUBSCRIPT the displacement patterns of atom b𝑏bitalic_b in direction α𝛼\alphaitalic_α for the normal mode s𝑠sitalic_s.

The third derivative in Eq. (1), instead, determines the anharmonic linewidth Γa(𝒒)sPlanck-constant-over-2-pisubscriptΓasubscript𝒒𝑠\hbar\Gamma_{\rm a}(\bm{q})_{s}roman_ℏ roman_Γ start_POSTSUBSCRIPT roman_a end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT (energy broadening due to three-phonon interactions [29]) of the phonon (𝒒)ssubscript𝒒𝑠(\bm{q})_{s}( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT:

Γa(𝒒)s=πNc2𝒒𝒒s,s′′{2[N¯(𝒒)sN¯(𝒒)s]δ(ω(𝒒)s+ω(𝒒)sω(𝒒)s)+[1+N¯(𝒒)s+N¯(𝒒)s]δ[ω(𝒒)sω(𝒒)sω(𝒒)s]}×|α,α,αb,b,b,𝑹𝑹3Vu𝟎bαu𝑹bαu𝑹bα(𝒒)s,bα(𝒒)s,bα(𝒒)s,bα38Δ(𝒒+𝒒+𝒒)ei[𝒒𝝉b+𝒒(𝑹+𝝉b)+𝒒(𝑹+𝝉b)]MbMbMbω(𝒒)sω(𝒒)sω(𝒒)s|2,\begin{split}&\hbar\Gamma_{\!\!\rm a}\!(\bm{q})_{\!s}{=}\!\frac{\pi}{N_{c}^{2}% }\!{\sum_{\begin{subarray}{c}\bm{q^{\prime}}\!\bm{q^{\prime}\!{}^{\prime}\!}\\ s^{\prime}\!,s^{\prime\prime}\!\end{subarray}}}\!\Big{\{}\!2\!\big{[}\bar{% \mathsfit{N}}(\bm{q^{\prime}\!})_{s^{\prime}\!}{-}\bar{\mathsfit{N}}(\bm{q^{% \prime}\!{}^{\prime}\!})_{s^{\prime}\!{}^{\prime}\!}\big{]}\delta\big{(}\omega% (\bm{q})_{s}\!{+}\omega(\bm{q^{\prime}\!})_{s^{\prime}\!}{-}\omega(\bm{q^{% \prime}\!{}^{\prime}\!})_{s^{\prime}\!{}^{\prime}\!}\big{)}\\ &+\big{[}1{+}\bar{\mathsfit{N}}(\bm{q^{\prime}\!})_{s^{\prime}\!}{+}\bar{% \mathsfit{N}}(\bm{q^{\prime}\!{}^{\prime}\!})_{s^{\prime}\!{}^{\prime}\!}\big{% ]}\delta\big{[}\omega(\bm{q})_{s}{-}\omega(\bm{q^{\prime}\!})_{s^{\prime}\!}{-% }\omega(\bm{q^{\prime}\!{}^{\prime}\!})_{s^{\prime}\!{}^{\prime}\!}\big{]}\Big% {\}}\times\\ &\Bigg{|}\sum_{\begin{subarray}{c}\alpha,\alpha^{\prime}\!,\alpha^{\prime}\!{}% ^{\prime}\\ b,b^{\prime}\!,b^{\prime}\!{}^{\prime}\!,\bm{R^{\prime}}\!\bm{R^{\prime}\!{}^{% \prime}}\end{subarray}}\frac{\partial^{3}V}{\partial u_{\bm{0}b\alpha}\partial u% _{\bm{R^{\prime}\!}b^{\prime}\!\alpha^{\prime}\!}\partial u_{\bm{R^{\prime}\!{% }^{\prime}}\!b^{\prime}\!{}^{\prime}\!\alpha^{\prime}\!{}^{\prime}}}\mathcal{E% }(\bm{q})_{s,b\alpha}\mathcal{E}(\bm{q^{\prime}})_{s^{\prime}\!,b^{\prime}\!% \alpha^{\prime}}\mathcal{E}(\bm{q^{\prime}\!{}^{\prime}})_{s^{\prime}\!{}^{% \prime}\!,b^{\prime}\!{}^{\prime}\!\alpha^{\prime}\!{}^{\prime}\!}\\ &\sqrt{\frac{\hbar^{3}}{8}}\frac{\Delta(\bm{q}{+}\bm{q^{\prime}\!}{+}\bm{q^{% \prime}\!{}^{\prime}\!})e^{-i[\bm{q}\cdot\bm{\tau}_{b}{+}\bm{q^{\prime}\!}% \cdot(\bm{R^{\prime}\!}{+}\bm{\tau}_{b^{\prime}\!}){+}\bm{q^{\prime}\!{}^{% \prime}\!}\cdot(\bm{R^{\prime}\!{}^{\prime}\!}{+}\bm{\tau}_{b^{\prime}\!{}^{% \prime}\!})]}}{\sqrt{M_{b}M_{b^{\prime}\!}M_{b^{\prime}\!{}^{\prime}\!}\omega(% \bm{q})_{s}\omega(\bm{q^{\prime}\!})_{s^{\prime}}\omega(\bm{q^{\prime}\!{}^{% \prime}\!})_{s^{\prime}\!{}^{\prime}\!}}}\Bigg{|}^{2},\end{split}start_ROW start_CELL end_CELL start_CELL roman_ℏ roman_Γ start_POSTSUBSCRIPT roman_a end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG italic_π end_ARG start_ARG italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_s start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG end_POSTSUBSCRIPT { 2 [ over¯ start_ARG slanted_N end_ARG ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - over¯ start_ARG slanted_N end_ARG ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT ] italic_δ ( italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + italic_ω ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ω ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + [ 1 + over¯ start_ARG slanted_N end_ARG ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + over¯ start_ARG slanted_N end_ARG ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT ] italic_δ [ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT - italic_ω ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ω ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT ] } × end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL | ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_α , italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_b , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT , bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT end_CELL end_ROW end_ARG end_POSTSUBSCRIPT divide start_ARG ∂ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_V end_ARG start_ARG ∂ italic_u start_POSTSUBSCRIPT bold_0 italic_b italic_α end_POSTSUBSCRIPT ∂ italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∂ italic_u start_POSTSUBSCRIPT bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT end_ARG caligraphic_E ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_b italic_α end_POSTSUBSCRIPT caligraphic_E ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT caligraphic_E ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT , italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL square-root start_ARG divide start_ARG roman_ℏ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG 8 end_ARG end_ARG divide start_ARG roman_Δ ( bold_italic_q + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) italic_e start_POSTSUPERSCRIPT - italic_i [ bold_italic_q ⋅ bold_italic_τ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ⋅ ( bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT + bold_italic_τ start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ⋅ ( bold_italic_R start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT + bold_italic_τ start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT ) ] end_POSTSUPERSCRIPT end_ARG start_ARG square-root start_ARG italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_ω ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ω ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT end_ARG end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , end_CELL end_ROW (3)

where N¯(𝒒)s=[exp(ω(𝒒)s/kBT)1]1¯𝑁subscript𝒒𝑠superscriptdelimited-[]Planck-constant-over-2-pi𝜔subscript𝒒𝑠subscript𝑘B𝑇11\bar{\mathsfit{N}}(\bm{q})_{s}{=}[\exp(\hbar\omega(\bm{q})_{s}/k_{\rm B}T){-}1% ]^{-1}over¯ start_ARG slanted_N end_ARG ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = [ roman_exp ( roman_ℏ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT / italic_k start_POSTSUBSCRIPT roman_B end_POSTSUBSCRIPT italic_T ) - 1 ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT is the Bose-Einstein distribution at temperature T𝑇Titalic_T, Δ(𝒒+𝒒+𝒒)\Delta(\bm{q}{+}\bm{q^{\prime}\!}{+}\bm{q^{\prime}\!{}^{\prime}\!})roman_Δ ( bold_italic_q + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT ) is the Kronecker delta (equal to 1 if 𝒒+𝒒+𝒒\bm{q}{+}\bm{q^{\prime}\!}{+}\bm{q^{\prime}\!{}^{\prime}\!}bold_italic_q + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT + bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT bold_′ end_FLOATSUPERSCRIPT is a reciprocal lattice vector, zero otherwise), δ𝛿\deltaitalic_δ is the Dirac delta. The last line in Eq. (1) accounts for the presence of isotopic-mass disorder and yields the following linewidth [40]

Γi(𝒒)s=π2Nc[ω(𝒒)s]2𝒒,sδ[ω(𝒒)sω(𝒒)s]×bg2b|α(𝒒)s,bα(𝒒)s,bα|2,Planck-constant-over-2-pisubscriptΓisubscript𝒒𝑠Planck-constant-over-2-pi𝜋2subscript𝑁𝑐superscriptdelimited-[]𝜔subscript𝒒𝑠2subscriptsuperscript𝒒bold-′superscript𝑠𝛿delimited-[]𝜔subscript𝒒𝑠𝜔subscriptsuperscript𝒒bold-′superscript𝑠subscript𝑏superscriptsubscript𝑔2𝑏superscriptsubscript𝛼subscriptsuperscript𝒒𝑠𝑏𝛼subscriptsuperscript𝒒bold-′superscript𝑠𝑏𝛼2\begin{split}\hbar\Gamma_{\rm{i}}(\bm{q})_{s}{=}&\frac{\hbar\pi}{2N_{c}}[% \omega(\bm{q})_{s}]^{2}{\textstyle\sum_{\bm{q^{\prime}},s^{\prime}}}\delta\big% {[}\omega(\bm{q})_{s}{-}\omega(\bm{q^{\prime}})_{s^{\prime}}\big{]}\\ &\times\textstyle\sum_{b}g_{2}^{b}\Big{|}\textstyle\sum_{\alpha}\mathcal{E}(% \bm{q})^{\star}_{s,b\alpha}\mathcal{E}(\bm{q^{\prime}})_{s^{\prime},b\alpha}% \Big{|}^{2},\end{split}start_ROW start_CELL roman_ℏ roman_Γ start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = end_CELL start_CELL divide start_ARG roman_ℏ italic_π end_ARG start_ARG 2 italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_ARG [ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_δ [ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT - italic_ω ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL × ∑ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT | ∑ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT caligraphic_E ( bold_italic_q ) start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , italic_b italic_α end_POSTSUBSCRIPT caligraphic_E ( bold_italic_q start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_b italic_α end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , end_CELL end_ROW (4)

where g2b=ifi,b(Mbmi,bMb)2superscriptsubscript𝑔2𝑏subscript𝑖subscript𝑓𝑖𝑏superscriptsubscript𝑀𝑏subscript𝑚𝑖𝑏subscript𝑀𝑏2g_{2}^{b}=\sum_{i}f_{i,b}\big{(}\frac{M_{b}-m_{i,b}}{M_{b}}\big{)}^{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_i , italic_b end_POSTSUBSCRIPT ( divide start_ARG italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_i , italic_b end_POSTSUBSCRIPT end_ARG start_ARG italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT describes the variance the isotopic masses of atom b𝑏bitalic_b (fi,bsubscript𝑓𝑖𝑏f_{i,b}italic_f start_POSTSUBSCRIPT italic_i , italic_b end_POSTSUBSCRIPT and mi,bsubscript𝑚𝑖𝑏m_{i,b}italic_m start_POSTSUBSCRIPT italic_i , italic_b end_POSTSUBSCRIPT are the mole fraction and mass, respectively, of the i𝑖iitalic_ith isotope of atom b𝑏bitalic_b; Mb=ifi,bmi,bsubscript𝑀𝑏subscript𝑖subscript𝑓𝑖𝑏subscript𝑚𝑖𝑏M_{b}=\sum_{i}f_{i,b}m_{i,b}italic_M start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_i , italic_b end_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_i , italic_b end_POSTSUBSCRIPT is the weighted average mass).

The recently developed WTE [34, 35] allows to predict the thermal conductivity of solids accounting for the interplay between structural disorder, anharmonicity, and Bose-Einstein statistics of vibrations. This offers a comprehensive approach to describe ordered ‘simple crystals’ having phonon interband spacings much larger than the linewidths [41, 42], completely disordered glasses [43, 36, 44], as well as the intermediate regime of ‘complex crystals’ with interband spacings smaller than the linewidths [35, 45, 46]. To assess how the accuracy in the prediction of the conductivity is affected by the precision with which fMLPs describe harmonic and anharmonic (third-order) force constants in Eq. (1), it is sufficient to consider the conductivity obtained from the WTE solved in the single-mode relaxation-time approximation (SMA)

κ(T)=1𝒱Nc𝒒,sC(𝒒)s𝒗(𝒒)s,s23[Γ(𝒒)s]1+1𝒱Nc𝒒,ssω(𝒒)s+ω(𝒒)s4[C(𝒒)sω(𝒒)s+C(𝒒)sω(𝒒)s]𝒗(𝒒)s,s23×12[Γ(𝒒)s+Γ(𝒒)s][ω(𝒒)sω(𝒒)s]2+14[Γ(𝒒)s+Γ(𝒒)s]2;𝜅𝑇1𝒱subscript𝑁𝑐subscript𝒒𝑠𝐶subscript𝒒𝑠superscriptnorm𝒗subscript𝒒𝑠𝑠23superscriptdelimited-[]Γsubscript𝒒𝑠11𝒱subscript𝑁csubscript𝒒𝑠superscript𝑠𝜔subscript𝒒𝑠𝜔subscript𝒒superscript𝑠4delimited-[]𝐶subscript𝒒𝑠𝜔subscript𝒒𝑠𝐶subscript𝒒superscript𝑠𝜔subscript𝒒superscript𝑠superscriptnorm𝒗subscript𝒒𝑠superscript𝑠2312delimited-[]Γsubscript𝒒𝑠Γsubscript𝒒superscript𝑠superscriptdelimited-[]𝜔subscript𝒒𝑠𝜔subscript𝒒superscript𝑠214superscriptdelimited-[]Γsubscript𝒒𝑠Γsubscript𝒒superscript𝑠2\begin{split}&\kappa(T)=\frac{1}{\mathcal{V}N_{c}}\sum_{\bm{q},s}C(\bm{q})_{s}% \frac{|\!|\bm{\mathsfit{v}}(\bm{q})_{s,s}|\!|^{2}}{3}[\Gamma(\bm{q})_{s}]^{-1}% \\ &{+}\frac{1}{\mathcal{V}{N_{\rm c}}}{\sum_{\bm{q},s\neq s^{\prime}}}\frac{% \omega(\bm{q})_{s}{+}\omega(\bm{q})_{s^{\prime}}}{4}\!\left[\frac{C(\bm{q})_{s% }}{\omega(\bm{q})_{s}}{+}\frac{C(\bm{q})_{s^{\prime}\!}}{\omega(\bm{q})_{s^{% \prime}\!}}\right]\frac{|\!|\bm{\mathsfit{v}}(\bm{q})_{s,s^{\prime}}|\!|^{2}}{% 3}\\ &\hskip 42.67912pt\times\frac{\frac{1}{2}\big{[}\Gamma(\bm{q})_{s}{+}\Gamma(% \bm{q})_{s^{\prime}}\big{]}}{\big{[}\omega(\bm{q})_{s}{-}\omega(\bm{q})_{s^{% \prime}}\big{]}^{2}+\frac{1}{4}\big{[}\Gamma(\bm{q})_{s}{+}\Gamma(\bm{q})_{s^{% \prime}}\big{]}^{2}}\;;\end{split}start_ROW start_CELL end_CELL start_CELL italic_κ ( italic_T ) = divide start_ARG 1 end_ARG start_ARG caligraphic_V italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT bold_italic_q , italic_s end_POSTSUBSCRIPT italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT divide start_ARG | | bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 3 end_ARG [ roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + divide start_ARG 1 end_ARG start_ARG caligraphic_V italic_N start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT bold_italic_q , italic_s ≠ italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG 4 end_ARG [ divide start_ARG italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG ] divide start_ARG | | bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 3 end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL × divide start_ARG divide start_ARG 1 end_ARG start_ARG 2 end_ARG [ roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] end_ARG start_ARG [ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT - italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 4 end_ARG [ roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ; end_CELL end_ROW (5)

where C(𝒒)s=2ω2(𝒒)skBT2N¯(𝒒)s[N¯(𝒒)s+1]𝐶subscript𝒒𝑠superscriptPlanck-constant-over-2-pi2superscript𝜔2subscript𝒒𝑠subscript𝑘Bsuperscript𝑇2¯𝑁subscript𝒒𝑠delimited-[]¯𝑁subscript𝒒𝑠1C(\bm{q})_{s}{=}\frac{\hbar^{2}\omega^{2}(\bm{q})_{s}}{k_{\rm B}T^{2}}\bar{% \mathsfit{N}}(\bm{q})_{s}\big{[}\bar{\mathsfit{N}}(\bm{q})_{s}{+}1\big{]}italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG roman_ℏ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_k start_POSTSUBSCRIPT roman_B end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG over¯ start_ARG slanted_N end_ARG ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT [ over¯ start_ARG slanted_N end_ARG ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + 1 ] is the specific heat of the vibration with energy ω(𝒒)s𝜔subscript𝒒𝑠\omega(\bm{q})_{s}italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and total linewidth Γ(𝒒)s=Γa(𝒒)s+Γi(𝒒)sΓsubscript𝒒𝑠subscriptΓasubscript𝒒𝑠subscriptΓisubscript𝒒𝑠\Gamma(\bm{q})_{s}{=}\Gamma_{\!\!\rm a}(\bm{q})_{s}{+}\Gamma_{\!\!\rm i}(\bm{q% })_{s}roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = roman_Γ start_POSTSUBSCRIPT roman_a end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + roman_Γ start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, vβ(𝒒)s,s=b,α,b,α(𝒒)s,bα[𝒒βD(𝒒)bα,bα](𝒒)s,bαsuperscript𝑣𝛽subscript𝒒𝑠superscript𝑠subscript𝑏𝛼superscript𝑏superscript𝛼superscriptsubscript𝒒𝑠𝑏𝛼delimited-[]subscriptsuperscript𝛽𝒒subscript𝐷𝒒𝑏𝛼superscript𝑏superscript𝛼subscript𝒒superscript𝑠superscript𝑏superscript𝛼\mathsfit{v}^{\beta}(\bm{q})_{s,s^{\prime}}{=}\!\sum_{\begin{subarray}{c}b,% \alpha,b^{\prime}\!,\alpha^{\prime}\end{subarray}}\!\mathcal{E}^{\star}(\bm{q}% )_{{s},b\alpha}[{{\nabla^{\beta}_{\bm{q}}\sqrt{\mathsfit{D}(\bm{q})}}}_{b% \alpha,b^{\prime}\alpha^{\prime}}]\mathcal{E}(\bm{q})_{s^{\prime},b^{\prime}% \alpha^{\prime}}slanted_v start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_s , slanted_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL slanted_b , italic_α , slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG end_POSTSUBSCRIPT caligraphic_E start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_s , slanted_b italic_α end_POSTSUBSCRIPT [ ∇ start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_slanted_q end_POSTSUBSCRIPT square-root start_ARG slanted_D ( bold_slanted_q ) end_ARG start_POSTSUBSCRIPT slanted_b italic_α , slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] caligraphic_E ( bold_slanted_q ) start_POSTSUBSCRIPT slanted_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , slanted_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT is the velocity operator coupling eigenstates s𝑠sitalic_s and ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT at the same wavevector 𝒒𝒒\bm{q}bold_italic_q (its diagonal elements s=s𝑠superscript𝑠s=s^{\prime}italic_s = italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are the usual phonon group velocities) [35], Ncsubscript𝑁cN_{\rm c}italic_N start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT is the number of 𝒒𝒒\bm{q}bold_italic_q-points used to sample the Brillouin zone and 𝒱𝒱\mathcal{V}caligraphic_V is the crystal’s primitive-cell volume. The first line on the right-hand side of Eq (5) describes a conduction mechanism in which vibrations carry the heat C(𝒒)s𝐶subscript𝒒𝑠C(\bm{q})_{s}italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT by propagating particle-like with velocity 𝒗(𝒒)s,s𝒗subscript𝒒𝑠𝑠{\bm{\mathsfit{v}}(\bm{q})_{s,s}}bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s end_POSTSUBSCRIPT over the lifetime [Γ(𝒒)s]1superscriptdelimited-[]Γsubscript𝒒𝑠1[\Gamma(\bm{q})_{s}]^{-1}[ roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. It can be rigorously shown [34, 35] that it coincides with the conductivity emerging from the Peierls-Boltzmann equation [39], and will be henceforth referred to as κPsubscript𝜅P\kappa_{\rm P}italic_κ start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT. The term on the second and third lines of Eq. (5) accounts for conduction through a wave-like tunneling mechanism between pairs of vibrational eigenstates. This term arises from the coherence between two different phonon modes s,s𝑠superscript𝑠s,s^{\prime}italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT at the same wavevector 𝒒𝒒\bm{q}bold_italic_q (i.e., it becomes more signicant as their frequency difference ω(𝒒)sω(𝒒)s𝜔subscript𝒒𝑠𝜔subscript𝒒superscript𝑠\omega(\bm{q})_{s}-\omega(\bm{q})_{s^{\prime}}italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT - italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT becomes smaller) and is therefore referred to as ‘coherences conductivity’, κCsubscript𝜅C\kappa_{\rm C}italic_κ start_POSTSUBSCRIPT roman_C end_POSTSUBSCRIPT. It has been shown in Refs. [34, 35, 47] that in simple crystals κPκCmuch-greater-thansubscript𝜅Psubscript𝜅C\kappa_{{\rm P}}{\gg}\kappa_{{\rm C}}italic_κ start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT ≫ italic_κ start_POSTSUBSCRIPT roman_C end_POSTSUBSCRIPT because particle-like propagation dominates over the wave-like tunneling. In contrast, in complex crystals both these mechanisms co-exist and can have comparable magnitude, implying κPκCsimilar-tosubscript𝜅Psubscript𝜅C\kappa_{{}_{\rm P}}{\sim}\kappa_{{}_{\rm C}}italic_κ start_POSTSUBSCRIPT start_FLOATSUBSCRIPT roman_P end_FLOATSUBSCRIPT end_POSTSUBSCRIPT ∼ italic_κ start_POSTSUBSCRIPT start_FLOATSUBSCRIPT roman_C end_FLOATSUBSCRIPT end_POSTSUBSCRIPT. Finally, Refs. [36, 44] have shown that in strongly disordered oxide glasses κPκCmuch-less-thansubscript𝜅Psubscript𝜅C\kappa_{{\rm P}}{\ll}\kappa_{{\rm C}}italic_κ start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT ≪ italic_κ start_POSTSUBSCRIPT roman_C end_POSTSUBSCRIPT.

Vibrational and thermal properties from fMLPs
In general, κ(T)𝜅𝑇\kappa(T)italic_κ ( italic_T ) is highly sensitive to both harmonic and anharmonic vibrational properties [35, 45, 46, 10, 47, 48, 49, 37, 50]; therefore, the accuracy of fMLPs in describing these properties can be quantified by comparing their predictions for κ(T)𝜅𝑇\kappa(T)italic_κ ( italic_T ) with those obtained from reference first principles data. To accomplish this goal, we begin by calculating the harmonic and third-order anharmonic force constants that determine the solid’s vibrational energy (Eq. (1)). We do this by using either: (i) Density-Functional Theory (DFT) with the Perdew, Burke, and Ernzerhof (PBE) functional (see Refs. [30, 31] for details); or (ii) SOTA non-proprietary mp-fMLPs trained on DFT-PBE. Then, we employ these force constants to determine harmonic vibrational properties (frequencies, velocity operators, and isotopic linewidths, see Eq. (2) and Eq. (4)) and anharmonic linewidths (Eq. (3)). Subsequently, these atomistic vibrational properties are used in Eq. (5) to calculate the thermal conductivity of the 103 diverse compounds in the phononDB-PBE database. The database contains rocksalt, zincblende, and wurtzite binary compunds and involves 34 chemical species, including alkali metals (Cs, K, Li, Na, Rb), alkaline earth metals (Be, Mg, Ca, Sr, Ba), transition metals (Ag, Cu, Zn, Cd), post-transition metals (Al, Ga, In, Pb), metalloids (As, Sb, Si, Te, B), nonmetals (H, C, N, O, P, S, Se), and halogens (F, Cl, Br, I). Computational details are provided in the Methods.

Refer to caption
Figure 1: Thermal conductivity computed at 300 K from DFT-PBE or MACE-MP-0, for 103 binary compounds taken from the phononDB-PBE database, which have rocksalt (green), zincblende (orange), and wurtzite (blue) structure. Solid lines indicate perfect agreement, while dashed lines represent discrepancies of up to a factor of 2. The arrows highlight three selected materials, which will be analyzed in detail in terms of their microscopic vibrational properties later in the manuscript. The inset displays the distribution of relative deviations between the conductivities predicted by DFT-PBE and MACE-MP-0.
Refer to captiona) wurtzite BeO
Refer to captionb) zincblende BeTe
Refer to captionc) rocksalt RbH
Figure 2: Phonon bands, specific heat at constant volume, phonon frequency-linewidth distributions, and macroscopic thermal conductivity, for wurtzite BeO (panel a), zincblende BeTe (panel b), and rocksalt RbH (panel c). DFT-PBE values are computed using the data from Refs. [30, 31], while ‘MACE’ values are computed using the MACE-MP-0 fMLP[20]. For the DFT-PBE calculations, phonons are plotted considering the long-range non-analytical correction term (NAC, red lines or scatter points) or not (dotted green); MACE-MP-0 does not account for long-range contributions to the vibrational energy (see text). In the frequency-linewidths distributions at 300 K, we resolve the anharmonic (ΓasubscriptΓ𝑎\Gamma_{a}roman_Γ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT) and isotopic (ΓisubscriptΓ𝑖\Gamma_{i}roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) part of the linewidths; the areas of the markers are proportional to the contribution of the given phonon mode to the conductivity (see Eq. (6)). Specific heat and thermal conductivity are negligibly affected by considering or not the long-range NAC term.

In Fig. 1 we compare the thermal conductivity predicted from first principles (DFT-PBE) or from the mp-fMLP MACE-MP-0 (trained on DFT-PBE data) [20]. We highlight how predictions from DFT-PBE or MACE-MP-0 are generally consistent within a factor of 2 across most materials, see inset. We note that in about 20% of the compounds studied the semiclassical particle-like BTE fails to fully describe heat transport, and it is crucial to employ the more general WTE, see Fig. Thermal Conductivity Predictions with Foundation Atomistic Models in the Methods. These cases generally have ultra-low conductivity (κTOT2less-than-or-similar-tosubscript𝜅TOT2\kappa_{\rm TOT}\lesssim 2italic_κ start_POSTSUBSCRIPT roman_TOT end_POSTSUBSCRIPT ≲ 2 W/mK), and are characterized by having wave-like tunneling conductivity (κCsubscript𝜅𝐶\kappa_{C}italic_κ start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, accounted for by the WTE but missing from the BTE) with magnitude comparable to the particle-like propagation conductivity (κPsubscript𝜅𝑃\kappa_{P}italic_κ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT, accounted for by both the WTE and BTE).

To understand the microscopic origin of the discrepancies in the macroscopic DFT-PBE and MACE-MP-0 conductivities, we select three representative materials — wurtzite BeO, zincblende BeTe, and rocksalt RbH — and show in Fig. 2 their phonon band structures, specific heat at constant volume, and the macroscopic thermal conductivity resolved in terms of contributions from microscopic phonon modes. Examining the phonon band structures in the first column, we observe that MACE-MP-0 tends to underestimate the high-frequency optical vibrational modes compared to DFT-PBE. To further investigate these differences, we plot the DFT-PBE phonons considering the non-analytical correction term [51] (NAC, red lines or scatter points), or not (dotted green line). This long-range interaction is responsible for the energy splitting between the longitudinal-optical and transversal-optical modes in polar dielectric materials [51], and is not fully considered in fMLPs trained using a radial force cutoff (e.g., for MACE-MP-0 such cutoff is 6 Å, while it is 5 Å  for SevenNet[21], CHGNet[19], and M3GNet[18]). This explains why the DFT-PBE phonons without NAC are in closer agreement with the MACE-MP-0 phonons (blue lines or scatter points). Importantly, it is noticeable that even without NAC, DFT-PBE frequencies tend to be higher than MACE-MP-0 frequencies, confirming the general tendency of MACE-MP-0 to underestimate vibrational frequencies discussed in Ref. [27]. Nevertheless, Fig. 2 shows that considering or not the NAC term has a negligible impact on the specific heat at constant volume and on the temperature-dependent conductivity (κ(T)𝜅𝑇\kappa(T)italic_κ ( italic_T )) of BeO, BeTe and RbH. This can be understood from the third column of Fig. 2, where we show that thermal transport in these materials is dominated by low-energy (acoustic) phonons, and these are negligibly affected by NAC. Specifically, these plots report the frequency-linewidth distributions, resolving both the anharmonic and isotopic contributions to the linewidths (Eq. 3 and Eq. 4, respectively), and also quantifying how much a single phonon mode contributes to the total conductivity with the following expression:

𝒦(𝒒)s=C(𝒒)s𝒗(𝒒)s,s23[Γ(𝒒)s]1+ssC(𝒒)sC(𝒒)s+C(𝒒)sω(𝒒)s+ω(𝒒)s2[C(𝒒sω(𝒒)s+C(𝒒sω(𝒒)s]×𝒗(𝒒)s,s2312[Γ(𝒒)s+Γ(𝒒)s][ω(𝒒)sω(𝒒)s]2+14[Γ(𝒒)s+Γ(𝒒)s]2.\begin{split}&{\mathcal{K}}(\bm{q})_{s}{=}C(\bm{q})_{s}\frac{|\!|\bm{\mathsfit% {v}}(\bm{q})_{s,s}|\!|^{2}}{3}[\Gamma(\bm{q})_{s}]^{-1}\\ &+\sum_{s^{\prime}\neq s}\!\frac{C(\bm{q})_{s}}{C(\bm{q})_{s}{+}C(\bm{q})_{s^{% \prime}\!}}\frac{\omega(\bm{q})_{s}{+}\omega(\bm{q})_{s^{\prime}\!}}{2}\!\!% \left[\frac{C(\bm{q}_{s}}{\omega(\bm{q})_{s}}{+}\frac{C(\bm{q}_{s^{\prime}\!}}% {\omega(\bm{q})_{s^{\prime}\!}}\right]\\ &\times\frac{|\!|\bm{\mathsfit{v}}(\bm{q})_{s,s^{\prime}}|\!|^{2}}{3}\frac{% \frac{1}{2}\left[\Gamma(\bm{q})_{s}{+}\Gamma(\bm{q})_{s^{\prime}\!}\right]}{[% \omega(\bm{q})_{s^{\prime}\!}{-}\omega(\bm{q})_{s}]^{2}{+}\frac{1}{4}[\Gamma(% \bm{q})_{s}{+}\Gamma(\bm{q})_{s^{\prime}\!}]^{2}}.\end{split}start_ROW start_CELL end_CELL start_CELL caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT divide start_ARG | | bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 3 end_ARG [ roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_s end_POSTSUBSCRIPT divide start_ARG italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG divide start_ARG italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG [ divide start_ARG italic_C ( bold_italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_C ( bold_italic_q start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL × divide start_ARG | | bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 3 end_ARG divide start_ARG divide start_ARG 1 end_ARG start_ARG 2 end_ARG [ roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] end_ARG start_ARG [ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 4 end_ARG [ roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + roman_Γ ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . end_CELL end_ROW (6)

This equation describes how much a single phonon mode (𝒒)ssubscript𝒒𝑠(\bm{q})_{s}( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT contributes to heat conduction. Specifically, the term on the first line accounts for the aforementioned particle-like conduction mechanism; the term on the second and third lines, instead, describes the wave-like conduction that originates from tunneling between two non-degenerate phonons (𝒒)ssubscript𝒒𝑠(\bm{q})_{s}( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and (𝒒)ssubscript𝒒superscript𝑠(\bm{q})_{s^{\prime}}( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT — here the single-phonon contributions from (𝒒)ssubscript𝒒𝑠(\bm{q})_{s}( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is resolved using the ratio between specific heats C(𝒒)sC(𝒒)s+C(𝒒)s𝐶subscript𝒒𝑠𝐶subscript𝒒𝑠𝐶subscript𝒒superscript𝑠\tfrac{C(\bm{q})_{s}}{C(\bm{q})_{s}+C(\bm{q})_{s^{\prime}}}divide start_ARG italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + italic_C ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG as weight, as discussed by Eq. (E3) in [35].

The frequency-linewidth distributions in Fig. 2a show overall agreement between DFT-PBE and MACE-MP-0 for the phonon modes that are mainly contributing to conduction — the acoustic modes and the optical modes with ω(𝒒)s<kBTPlanck-constant-over-2-pi𝜔subscript𝒒𝑠subscript𝑘𝐵𝑇\hbar\omega(\bm{q})_{s}<k_{B}Troman_ℏ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT < italic_k start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT italic_T — and we see that this implies very similar values for the corresponding macroscopic κ(T)𝜅𝑇\kappa(T)italic_κ ( italic_T ) (with a relative difference <10%absentpercent10<10\%< 10 %). Importantly, Fig. 2b shows compatibility between κ(T)𝜅𝑇\kappa(T)italic_κ ( italic_T ) predicted from DFT-PBE or MACE-MP-0 can also result from cancellation of errors; specifically, in BeTe there are visible differences in the frequency-linewidth distributions, and these largely cancel out when integrated to determine the conductivity. Finally, Fig. 2c illustrates that the presence of systematic, non-compensating differences in the microscopic frequency-linewidth distributions can also directly translate into significant discrepancies on the macroscopic conductivities (difference of a factor of 2.7). Additionally, we note that depending on the the chemical composition, the anharmonic linewidths at room temperature can either dominate over the isotopic linewidth (e.g., in BeO) or not (e.g., in BeTe). In the former case, the thermal conductivity is practically unaffected by considering or not the isotopic contributions to the linewidth, while in the latter case considering isotopic scattering yields a reduction of κ(300K)𝜅300K\kappa(300\,\rm{K})italic_κ ( 300 roman_K ) of about 75%similar-toabsentpercent75{\sim}75\%∼ 75 % (see Table. 2).

The results above motivate us to investigate when good agreement between DFT and fMLP conductivities is obtained from accurately described microscopic harmonic and anharmonic vibrational properties, or because of compensation of errors. To achieve this goal, we resolve the discrepancies in the total macroscopic thermal conductivities using the Symmetric Relative Error (SRE),

SRE[κ]=2|κfMLPκDFT|κfMLP+κDFT.SREdelimited-[]𝜅2subscript𝜅fMLPsubscript𝜅DFTsubscript𝜅fMLPsubscript𝜅DFT\text{SRE}\big{[}\kappa\big{]}=2\frac{\left|{\kappa}_{\rm{fMLP}}-{\kappa}_{\rm% {DFT}}\right|}{{\kappa}_{\rm{fMLP}}+{\kappa}_{\rm{DFT}}}\,.SRE [ italic_κ ] = 2 divide start_ARG | italic_κ start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT - italic_κ start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT | end_ARG start_ARG italic_κ start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT + italic_κ start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT end_ARG . (7)

Then, to determine whether a low value for SRE[κ]SREdelimited-[]𝜅\text{SRE}\big{[}\kappa\big{]}SRE [ italic_κ ] originates from accurately described microscopic vibrational properties, or because of compensation of errors, we introduce the Symmetric Relative Mean Error (SRME) on the single-phonon conductivity contribution 𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT:

SRME[{𝒦(𝒒)s}]=2Nc𝒱𝒒s|𝒦fMLP(𝒒)s𝒦DFT(𝒒)s|κfMLP+κDFT,SRMEdelimited-[]𝒦subscript𝒒𝑠2subscript𝑁𝑐𝒱subscript𝒒𝑠subscript𝒦fMLPsubscript𝒒𝑠subscript𝒦DFTsubscript𝒒𝑠subscript𝜅fMLPsubscript𝜅DFT\begin{split}\text{SRME}\big{[}\{\mathcal{K}(\bm{q})_{s}\}\big{]}{=}\frac{2}{N% _{c}\mathcal{V}}\frac{\sum_{\bm{q}s}\!\left|{\mathcal{K}}_{\rm{fMLP}}\!(\bm{q}% )_{s}{-}{\mathcal{K}}_{\rm{DFT}}\!(\bm{q})_{s}\right|}{{\kappa}_{\rm{fMLP}}+{% \kappa}_{\rm{DFT}}}\,,\;\;\end{split}start_ROW start_CELL SRME [ { caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } ] = divide start_ARG 2 end_ARG start_ARG italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT caligraphic_V end_ARG divide start_ARG ∑ start_POSTSUBSCRIPT bold_italic_q italic_s end_POSTSUBSCRIPT | caligraphic_K start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT - caligraphic_K start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT | end_ARG start_ARG italic_κ start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT + italic_κ start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT end_ARG , end_CELL end_ROW (8)

where 𝒦DFT(𝒒)ssubscript𝒦DFTsubscript𝒒𝑠{\mathcal{K}}_{\rm{DFT}}\!(\bm{q})_{s}caligraphic_K start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT refers to Eq. (8) evaluated using DFT, and 𝒦fMLP(𝒒)ssubscript𝒦fMLPsubscript𝒒𝑠{\mathcal{K}}_{\rm{fMLP}}\!(\bm{q})_{s}caligraphic_K start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT refers to the same equation evaluated using fMLP.

Refer to caption
Figure 3: Errors on conductivity: SRE & SRME, computed comparing DFT-PBE and fMLP (MACE-MP-0) predictions in 103 chemically and structurally diverse compounds. Large discrepancies between DFT-PBE and fMLP on microscopic, single-phonon conductivity contributions are described by large SRME (Eq. (8)), which implies a significant discrepancy between the macroscopic DFT-PBE and fMLP conductivities, and thus large SRE (Eq. (7)). Three materials are highlighted: wurtzite BeO shows low SRME and thus low SRE; rocksalt RbH displays high error in both SRME and SRE; zincblende BeTe exhibits high SRME but low SRE because of compensation of microscopic errors.

Fig. 3 illustrates that a large SRME generally implies large SRE. Importantly, knowing both SRE and SRME enables us to identify when microscopic error compensation occurs — this is indicated by a large SRME but small SRE. Three representative cases are highlighted in Fig. 3: wurtzite BeO shows low SRME and thus correspondingly low SRE; rocksalt RbH exhibits high error in both SRME and SRE; zincblende BeTe displays high SRME but low SRE due to compensation of microscopic errors. We note that the SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT] error can stem from discrepancies in the harmonic (second-order) or anharmonic (third-order) force constants. Therefore, in Fig. 6 in the Methods we discuss how to disentangle the SRME and SRE into errors on harmonic vibrational properties and anharmonic vibrational properties.

Overall, this analysis highlights the importance of benchmarking the accuracy of fMLPs not only at the macroscopic but also at the microscopic level. In particular the macroscopic SRE[κ𝜅\kappaitalic_κ] alone is not a reliable descriptor for the fMLPs’ ability to capture the harmonic and anharmonic physics underlying heat conduction. However, achieving both a small macroscopic SRE[κ𝜅\kappaitalic_κ] and a small microscopic SRME[κ𝜅\kappaitalic_κ] is a sufficient condition for accurately describing both the microscopic physics and the macroscopic thermal conductivity.

Accuracy of various SOTA foundation models
The previous section demonstrated that the SRE and SRME descriptors can effectively measure the accuracy of fMLPs in describing vibrational and thermal properties. Therefore, here we utilize these descriptors to compare the accuracy of different non-proprietary mp-fMLPs on the 103 diverse structures contained in the phononDB-PBE database[30, 31]. Fig. 4 compares the accuracy of the four foundation models M3GNet[18], CHGNet[19], MACE-MP-0[20], and SevenNet[21] mp-fMLPs. To determine if a certain mp-fMLP tend to systematically overestimate or underestimate the conductivity, it is useful to rely on the Symmetric Relative Difference (SRD) in the total Wigner conductivity from DFT-PBE or mp-fMLP,

SRD[κ]=2κfMLPκDFTκfMLP+κDFT,SRDdelimited-[]𝜅2superscript𝜅fMLPsuperscript𝜅DFTsuperscript𝜅fMLPsuperscript𝜅DFT\rm{SRD}\big{[}\kappa\big{]}=2\frac{\kappa^{\rm fMLP}-\kappa^{\rm DFT}}{\kappa% ^{\rm fMLP}+\kappa^{\rm DFT}},roman_SRD [ italic_κ ] = 2 divide start_ARG italic_κ start_POSTSUPERSCRIPT roman_fMLP end_POSTSUPERSCRIPT - italic_κ start_POSTSUPERSCRIPT roman_DFT end_POSTSUPERSCRIPT end_ARG start_ARG italic_κ start_POSTSUPERSCRIPT roman_fMLP end_POSTSUPERSCRIPT + italic_κ start_POSTSUPERSCRIPT roman_DFT end_POSTSUPERSCRIPT end_ARG , (9)

which ranges from -2 to +2, resolving both overestimation and underestimation of the conductivity. Figure  4a summarizes the SRD for all 103 binary compounds in the phononDB-PBE database, as depicted in the violin plot [52]. In this plot, the width of each violin shape is related to the percentage of materials with SRD values indicated on the y-axis. The median SRD value is marked by a white scatter point, while the black boxes illustrate the interquartile range, and the whiskers show 1.51.51.51.5 times the interquartile range below and above the first and third quartile points. Unstable structures with negative phonon frequencies were included considering κfMLP=0superscript𝜅fMLP0\kappa^{\rm fMLP}=0italic_κ start_POSTSUPERSCRIPT roman_fMLP end_POSTSUPERSCRIPT = 0, i.e., SRD=22-2- 2. This analysis reveals that all the mp-fMLPs assessed in this study tend to underestimate the thermal conductivity. This finding is consistent with the observed systematic underestimation of vibrational frequencies noted in Ref. [27], and might also derived from overestimation of anharmonic linewidths.

Refer to captiona)
Refer to captionb)
Figure 4: Performance of mp-fMLPs in calculating thermal conductivity. Panel a presents violin plots depicting the Symmetric Relative Difference (SRD) in the total Wigner thermal conductivity (Eq. (9)) for mp-fMLPs SevenNet[21], MACE-MP-0[20], CHGNet[19], M3GNet[18]. Panel b shows the SRME for mode-resolved thermal conductivity. In both panels, the medians are marked with white scatter points, the widths of the boxes represent the interquartile range, and the whiskers show the range of data points without outliers.

To examine whether the SRD shown in Fig. 4a is influenced by error compensation, we analyze in Fig. 4b the violin plots for SRME[{𝒦(𝒒)s}]SRMEdelimited-[]𝒦subscript𝒒𝑠\text{SRME}\big{[}\{\mathcal{K}(\bm{q})_{s}\}\big{]}SRME [ { caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } ]. We see that the SRME distribution for MACE-MP-0 is the closest to zero, followed by SevenNet, M3GNet, and CHGNet. In addition, to quantify the overall accuracy of a fMLPs in predicting the macroscopic conductivity over a materials’ database, without resolving the possible compensation of microscopic errors, it is informative to consider the mean of the modulus of the deviations, i.e., the mean of the distribution of SRE (7) — we prefer this over the mean of the SRD distribution, as the latter can be close to zero in the presence of very broad but symmetric distribution. Importantly, we note that the mean for SRE[κ𝜅\kappaitalic_κ] and mean for SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT] are expected to be comparable in the absence of compensation of microscopic errors. In contrast, in the presence of compensation of microscopic errors, the mean SRE[κ𝜅\kappaitalic_κ] is significantly lower than mean SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT]. The mean values for SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT] and SRE[κ𝜅\kappaitalic_κ] are reported in Table Thermal Conductivity Predictions with Foundation Atomistic Models.

SevenNet MACE CHGNet M3GNet
Mean in
SRE[κ]SREdelimited-[]𝜅\rm{SRE}[\kappa]roman_SRE [ italic_κ ] 0.5970.5970.5970.597 0.5120.5120.5120.512 1.6951.6951.6951.695 1.3971.3971.3971.397
Mean in
SRME[{𝒦(𝐪)s}]SRMEdelimited-[]𝒦subscript𝐪s\rm{SRME}[\{\mathcal{K}(\bm{q})_{s}\}]roman_SRME [ { caligraphic_K ( bold_q ) start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT } ] 0.7670.7670.7670.767 0.6640.6640.6640.664 1.7171.7171.7171.717 1.4691.4691.4691.469
Table 1: Mean of SRE[κ]SREdelimited-[]𝜅\text{SRE}[\kappa]SRE [ italic_κ ] and SRME[{𝒦(q)s}]SRMEdelimited-[]𝒦subscript𝑞𝑠\text{SRME}[\{\mathcal{K}(\bm{q})_{s}\}]SRME [ { caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } ] for different mp-fMLPs. Different (similar) values for SRE[κ]SREdelimited-[]𝜅\text{SRE}[\kappa]SRE [ italic_κ ] and SRME[{𝒦(𝒒)s}]SRMEdelimited-[]𝒦subscript𝒒𝑠\text{SRME}[\{\mathcal{K}(\bm{q})_{s}\}]SRME [ { caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } ] indicate presence (absence) of compensation of microscopic errors. Smaller values for SRME[{𝒦(𝒒)s}]SRMEdelimited-[]𝒦subscript𝒒𝑠\text{SRME}[\{\mathcal{K}(\bm{q})_{s}\}]SRME [ { caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } ] correspond to higher accuracy in predicting both microscopic vibrational properties, and macroscopic thermal conductivity.

In summary, our findings reveal that the mean SRE[κ𝜅\kappaitalic_κ] and the mean in SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT] are effective metrics for assessing the accuracy of mp-fMLPs in predicting macroscopic thermal conductivity. These metrics not only indicate the precision in predicting the macroscopic conductivity (low SRE[κ𝜅\kappaitalic_κ]), but can also assess whether accurate conductivity predictions follow from accurate estimates of microscopic vibrational and thermal properties (low SRE[κ𝜅\kappaitalic_κ] and low SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT]) or result from compensation of microscopic errors (low SRE[κ𝜅\kappaitalic_κ] and large SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT]).

Discussion We have introduced a robust and automated computational framework for directly predicting the thermal conductivity of solids using foundation Machine Learning Potentials. This framework has been employed to benchmark the out-of-the-box accuracy of SOTA fMLPs trained on the Materials Project DFT-PBE database [38] (mp-fMLPs) — SevenNet[21], MACE-MP-0[20], CHGNet[19], and M3GNet[18] — in predicting both microscopic harmonic and anharmonic vibrational properties, and the macroscopic thermal conductivity. In particular, we compared predictions from mp-fMLP against those obtained from the first-principles phononDB-PBE databasem, which includes 103 diverse compounds, made up of 34 different chemical species and having wurtzite, zincblende, or rocksalt structure [30, 31]. Our results show that the overall most accurate mp-fMLPs, MACE-MP-0[20], yields conductivities compatible within a factor of two of the DFT-PBE values for 69% of the materials in the phononDB-PBE database[30, 31]. We have shown that non-proprietary SOTA mp-fMLPs yield thermal conductivity predictions that tend to underestimate the corresponding first-principles DFT-PBE values, quantifying these discrepancies with the parameter SRD[κ]delimited-[]𝜅[\kappa][ italic_κ ]. Most importantly, we have introduced a descriptor quantifies the performance of mp-fMLPs in predicting both microscopic harmonic and anharmonic vibrational properties, and the macroscopic, experimentally measurable thermal conductivity — the Symmetric Relative Mean Error on the conductivity SRME[𝒦(𝒒)s]delimited-[]𝒦subscript𝒒𝑠[{\mathcal{K}(\bm{q})_{s}}\big{]}[ caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ]. This descriptor provides a metric that could be useful to training or fine-tune fMLPs, and its mean computed over a database of materials can be used to benchmark the accuracy of different families of mp-fMLPs. We have also provided computational recipes for incorporating the Mean in SRME[𝒦(𝒒)s𝒦subscript𝒒𝑠\mathcal{K}(\bm{q})_{s}caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT] into standard benchmark frameworks used to assess the accuracy of fMLPs [25]. Finally, we have discussed how our automated framework to compute the Wigner conductivity can accurately describe materials that violate semiclassical Boltzmann transport. These often feature very low conductivity (2less-than-or-similar-toabsent2\lesssim 2≲ 2 W/mK) and are potentially relevant for e.g., heat-shielding [10] and thermoelectric [9] technologies. Ultimately, the computational framework and analyses that we presented pave the way for quantum-accurate, physics-aware predictions of thermal conductivity of materials with arbitrary composition and structure.

Figure 5: Failures of semiclassical Boltzmann transport in binary crystals with low conductivity and having rocksalt (green), zincblende (orange), and wurtzite (blue) structure. These occur when the conductivity described by the BTE in terms of particle-like propagation of phonons (κPsubscript𝜅𝑃\kappa_{P}italic_κ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT) does not dominate over the coherences conductivity determined by wave-like tunneling of phonons (κCsubscript𝜅𝐶\kappa_{C}italic_κ start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT). The WTE [35] accounts for both κPsubscript𝜅𝑃\kappa_{P}italic_κ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT and κCsubscript𝜅𝐶\kappa_{C}italic_κ start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, and failures from semiclassical Boltzmann transport are highlighted by a relative strength of particle-wave transport (κPκC)/(κP+κC)subscript𝜅𝑃subscript𝜅𝐶subscript𝜅𝑃subscript𝜅𝐶(\kappa_{P}-\kappa_{C})/(\kappa_{P}+\kappa_{C})( italic_κ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT - italic_κ start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) / ( italic_κ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT + italic_κ start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) appreciably smaller than one. These conductivities were obtained from the DFT-PBE force constants in the phononDB-PBE dataset.

Methods Failures of semiclassical Boltzmann transport in binary crystals with low conductivity. Our automated framework can be used to identify materials in which the semiclassical BTE fails. In particular, the BTE [39] describes heat propagation exclusively in terms of particle-like phonons that propagate and scatter akin to the particles of a classical gas. While the BTE is accurate in weakly anharmonic and ordered ‘simple crystal’ (characterized by phonon interband spacings much larger than the linewidths[34, 41]), it fails in strongly anharmonic or disordered materials (with phonon interband spacings smaller than the linewidths). It has been recently shown [34, 35, 45, 48, 10] that in the latter cases heat conduction originates not only from particle-like propagation but also from wave-like interband tunneling of phonons. The recently developed WTE generalizes the BTE by accounting for both particle-like propagation and wave-like tunneling contributions to the conductivity (κPsubscript𝜅𝑃\kappa_{P}italic_κ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT and κCsubscript𝜅𝐶\kappa_{C}italic_κ start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, respectively; the total conductivity is given by κTOT=κP+κCsubscript𝜅𝑇𝑂𝑇subscript𝜅𝑃subscript𝜅𝐶\kappa_{TOT}=\kappa_{P}+\kappa_{C}italic_κ start_POSTSUBSCRIPT italic_T italic_O italic_T end_POSTSUBSCRIPT = italic_κ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT + italic_κ start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT), providing a comprehensive approach that overcomes the failures of the BTE.

Our automated framework computes the Wigner thermal conductivity[35], and therefore can be used to find materials that have very low thermal conductivity, relevant for thermoelectric or heat-shielding technologies. In Fig. Thermal Conductivity Predictions with Foundation Atomistic Models we show that among the 103 compounds analyzed, the BTE fails in materials with low thermal conductivity (κTOT2less-than-or-similar-tosubscript𝜅TOT2\kappa_{\rm TOT}\lesssim 2italic_κ start_POSTSUBSCRIPT roman_TOT end_POSTSUBSCRIPT ≲ 2 W/mK), such as AgBr and AgCl, since the wave-like tunneling conductivity becomes comparable to the particle-like propagation conductivity. We also note that in simple crystals with large conductivity, the BTE is accurate as the population conductivity dominates over the wave-like tunneling conductivity.

Symmetric Relative Mean Error on harmonic and anharmonic vibrational properties. The analyses reported in the main text discusses SRME[𝒦(𝒒)s]delimited-[]𝒦subscript𝒒𝑠[{\mathcal{K}(\bm{q})_{s}}\big{]}[ caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ] as a descriptor that is informative of the accuracy of fMLPs in describing microscopic harmonic and anharmonic properties. In this section we provide the tools to resolve whether the SRME[𝒦(𝒒)s]delimited-[]𝒦subscript𝒒𝑠[{\mathcal{K}(\bm{q})_{s}}\big{]}[ caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ] originates from errors in microscopic harmonic properties, anharmonic properties, or both. To this aim, we employ the expression for the microscopic, single-phonon contributions to the thermal conductivity (6) to define descriptors that quantity the accuracy with which harmonic and anharmonic properties are predicted. In particular, we note that the conductivity contribution of a phonon (𝒒)ssubscript𝒒𝑠(\bm{q})_{s}( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is a function of: (i) all the phonon frequencies {ω(𝒒s}\{\omega(\bm{q}_{s^{\prime}}\}{ italic_ω ( bold_italic_q start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } at a fixed wavevector 𝒒𝒒\bm{q}bold_italic_q and variable mode s=1,,Nssuperscript𝑠1subscript𝑁ss^{\prime}=1,\dots,N_{\rm s}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 , … , italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT (Nssubscript𝑁sN_{\rm s}italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT is the number of phonon bands, equal to 3 times the number of atoms in the crystal’s unit cell [53]); (ii) all velocity-operator elements {𝒗(𝒒)s,s}𝒗subscript𝒒𝑠superscript𝑠\{\bm{\mathsfit{v}}(\bm{q})_{s,s^{\prime}}\}{ bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } at fixed 𝒒𝒒\bm{q}bold_italic_q, s𝑠sitalic_s and variable ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT; (iii) all isotopic linewidths at fixed 𝒒𝒒\bm{q}bold_italic_q and variable ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, {Γi(𝒒)s}subscriptΓisubscript𝒒superscript𝑠\{\Gamma_{\!\rm i}(\bm{q})_{s^{\prime}}\}{ roman_Γ start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT }, which depend solely on harmonic properties, see Eq. 4; (iv) the anharmonic linewidths {Γa(𝒒)s}subscriptΓasubscript𝒒superscript𝑠\{\Gamma_{\!\rm a}(\bm{q})_{s^{\prime}}\}{ roman_Γ start_POSTSUBSCRIPT roman_a end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } at fixed 𝒒𝒒\bm{q}bold_italic_q and variable ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. This can be summarized using the following notation: 𝒦(𝒒)s=𝒦(𝒒)s[{ω(𝒒)s},{𝒗(𝒒)s,s},{Γi(𝒒)s},{Γa(𝒒)s}]𝒦subscript𝒒𝑠𝒦subscript𝒒𝑠𝜔subscript𝒒superscript𝑠𝒗subscript𝒒𝑠superscript𝑠subscriptΓisubscript𝒒superscript𝑠subscriptΓasubscript𝒒superscript𝑠{\mathcal{K}}(\bm{q})_{s}={\mathcal{K}}(\bm{q})_{s}[\{\omega(\bm{q})_{s^{% \prime}}\},\{\bm{\mathsfit{v}}(\bm{q})_{s,s^{\prime}}\},\{\Gamma_{\rm i}(\bm{q% })_{s^{\prime}}\},\{\Gamma_{\rm a}(\bm{q})_{s^{\prime}}\}]caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT [ { italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_a end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } ] where the curly brackets denote the set of values of a certain quantity over all the bands at fixed 𝒒𝒒\bm{q}bold_italic_q (e.g., {ω(𝒒)s}={ω(𝒒)1,ω(𝒒)2,,ω(𝒒)Ns}𝜔subscript𝒒superscript𝑠𝜔subscript𝒒1𝜔subscript𝒒2𝜔subscript𝒒subscript𝑁𝑠\{\omega(\bm{q})_{s^{\prime}}\}=\{\omega(\bm{q})_{1},\omega(\bm{q})_{2},\dots,% \omega(\bm{q})_{N_{s}}\}{ italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } = { italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT }).

To quantify the impact of errors on the harmonic (har) and anharmonic (anh) properties on the single-phonon conductivity contributions, we define the Symmetric Relative Mean Error (SRME) on these properties as follows:

SRME[har]=2𝒒s|𝒦(𝒒)s[{ωfMLP(𝒒)s},{𝒗fMLP(𝒒)s,s},{Γi,fMLP(𝒒)s},{Γa,DFT(𝒒)s}]𝒦DFT(𝒒)s|𝒒s(𝒦(𝒒)s[{ωfMLP(𝒒)s},{𝒗fMLP(𝒒)s,s},{Γi,fMLP(𝒒)s},{Γa,DFT(𝒒)s}]+𝒦DFT(𝒒)s),SRMEdelimited-[]har2subscript𝒒𝑠𝒦subscript𝒒𝑠subscript𝜔fMLPsubscript𝒒superscript𝑠subscript𝒗fMLPsubscript𝒒𝑠superscript𝑠subscriptΓifMLPsubscript𝒒superscript𝑠subscriptΓaDFTsubscript𝒒superscript𝑠subscript𝒦DFTsubscript𝒒𝑠subscript𝒒𝑠𝒦subscript𝒒𝑠subscript𝜔fMLPsubscript𝒒superscript𝑠subscript𝒗fMLPsubscript𝒒𝑠superscript𝑠subscriptΓifMLPsubscript𝒒superscript𝑠subscriptΓaDFTsubscript𝒒superscript𝑠subscript𝒦DFTsubscript𝒒𝑠\text{SRME}[{\rm{har}}]{=}2\frac{\sum_{\bm{q}s}\left|{\mathcal{K}}(\bm{q})_{s}% \big{[}\{\omega_{{\rm fMLP}}(\bm{q})_{s^{\prime}}\},\{\bm{\mathsfit{v}}_{{\rm fMLP% }}(\bm{q})_{s,s^{\prime}}\},\{\Gamma_{{\!\rm i,fMLP}}(\bm{q})_{s^{\prime}}\},% \{\Gamma_{{\!\rm a,DFT}}(\bm{q})_{s^{\prime}}\}\big{]}-{\mathcal{K}}_{\rm DFT}% (\bm{q})_{s}\right|}{\sum_{\bm{q}s}\left({\mathcal{K}}(\bm{q})_{s}\big{[}\{% \omega_{{\rm fMLP}}(\bm{q})_{s^{\prime}}\},\{\bm{\mathsfit{v}}_{{\rm fMLP}}(% \bm{q})_{s,s^{\prime}}\},\{\Gamma_{{\!\rm i,fMLP}}(\bm{q})_{s^{\prime}}\},\{% \Gamma_{{\!\rm a,DFT}}(\bm{q})_{s^{\prime}}\}\big{]}+{\mathcal{K}}_{\rm DFT}(% \bm{q})_{s}\right)},SRME [ roman_har ] = 2 divide start_ARG ∑ start_POSTSUBSCRIPT bold_italic_q italic_s end_POSTSUBSCRIPT | caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT [ { italic_ω start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { bold_slanted_v start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_i , roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_a , roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } ] - caligraphic_K start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT | end_ARG start_ARG ∑ start_POSTSUBSCRIPT bold_italic_q italic_s end_POSTSUBSCRIPT ( caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT [ { italic_ω start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { bold_slanted_v start_POSTSUBSCRIPT roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_i , roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_a , roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } ] + caligraphic_K start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_ARG , (10)
SRME[anh]=2𝒒s|𝒦(𝒒)s[{ωDFT(𝒒)s},{𝒗DFT(𝒒)s,s},{Γi,DFT(𝒒)s},{Γa,fMLP(𝒒)s}]𝒦DFT(𝒒)s|𝒒s(𝒦(𝒒)s[{ωDFT(𝒒)s},{𝒗DFT(𝒒)s,s},{Γi,DFT(𝒒)s},{Γa,fMLP(𝒒)s}]+𝒦DFT(𝒒)s),SRMEdelimited-[]anh2subscript𝒒𝑠𝒦subscript𝒒𝑠subscript𝜔DFTsubscript𝒒superscript𝑠subscript𝒗DFTsubscript𝒒𝑠superscript𝑠subscriptΓiDFTsubscript𝒒superscript𝑠subscriptΓafMLPsubscript𝒒superscript𝑠subscript𝒦DFTsubscript𝒒𝑠subscript𝒒𝑠𝒦subscript𝒒𝑠subscript𝜔DFTsubscript𝒒superscript𝑠subscript𝒗DFTsubscript𝒒𝑠superscript𝑠subscriptΓiDFTsubscript𝒒superscript𝑠subscriptΓafMLPsubscript𝒒superscript𝑠subscript𝒦DFTsubscript𝒒𝑠\text{SRME}[{\rm{anh}}]{=}2\frac{\sum_{\bm{q}s}\left|{\mathcal{K}}(\bm{q})_{s}% \big{[}\{\omega_{{\rm DFT}}(\bm{q})_{s^{\prime}}\},\{\bm{\mathsfit{v}}_{{\rm DFT% }}(\bm{q})_{s,s^{\prime}}\},\{\Gamma_{{\!\rm i,DFT}}(\bm{q})_{s^{\prime}}\},\{% \Gamma_{{\!\rm a,fMLP}}(\bm{q})_{s^{\prime}}\}\big{]}-{\mathcal{K}}_{\rm DFT}(% \bm{q})_{s}\right|}{\sum_{\bm{q}s}\left({\mathcal{K}}(\bm{q})_{s}\big{[}\{% \omega_{{\rm DFT}}(\bm{q})_{s^{\prime}}\},\{\bm{\mathsfit{v}}_{{\rm DFT}}(\bm{% q})_{s,s^{\prime}}\},\{\Gamma_{{\!\rm i,DFT}}(\bm{q})_{s^{\prime}}\},\{\Gamma_% {{\!\rm a,fMLP}}(\bm{q})_{s^{\prime}}\}\big{]}+{\mathcal{K}}_{\rm DFT}(\bm{q})% _{s}\right)},SRME [ roman_anh ] = 2 divide start_ARG ∑ start_POSTSUBSCRIPT bold_italic_q italic_s end_POSTSUBSCRIPT | caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT [ { italic_ω start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { bold_slanted_v start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_i , roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_a , roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } ] - caligraphic_K start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT | end_ARG start_ARG ∑ start_POSTSUBSCRIPT bold_italic_q italic_s end_POSTSUBSCRIPT ( caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT [ { italic_ω start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { bold_slanted_v start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_i , roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_a , roman_fMLP end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } ] + caligraphic_K start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_ARG , (11)

where we have used the shorthand notation 𝒦DFT(𝒒)s=𝒦DFT(𝒒)s[{ωDFT(𝒒)s},{𝒗DFT(𝒒)s,s},{Γi,DFT(𝒒)s},{Γa,DFT(𝒒)s}]subscript𝒦DFTsubscript𝒒𝑠subscript𝒦DFTsubscript𝒒𝑠subscript𝜔DFTsubscript𝒒superscript𝑠subscript𝒗DFTsubscript𝒒𝑠superscript𝑠subscriptΓiDFTsubscript𝒒superscript𝑠subscriptΓaDFTsubscript𝒒superscript𝑠\mathcal{K}_{{\!\rm DFT}}\!(\bm{q})_{\!s}=\mathcal{K}_{{}_{\!\rm DFT}}\!(\bm{q% })_{\!s}\big{[}\{\!\omega_{{}_{\!\rm DFT}}\!(\bm{q})_{\!s^{\prime}}\!\}{,}\{\!% \bm{\mathsfit{v}}_{{}_{\!\rm DFT}}\!(\bm{q})_{\!s{,}s^{\prime}}\!\}{,}\{\!% \Gamma_{{\rm i{,}DFT}}\!(\bm{q})_{\!s^{\prime}}\!\}{,}\{\!\Gamma_{{\rm a{,}DFT% }}\!(\bm{q})_{\!s^{\prime}}\!\}\big{]}caligraphic_K start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = caligraphic_K start_POSTSUBSCRIPT start_FLOATSUBSCRIPT roman_DFT end_FLOATSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT [ { italic_ω start_POSTSUBSCRIPT start_FLOATSUBSCRIPT roman_DFT end_FLOATSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { bold_slanted_v start_POSTSUBSCRIPT start_FLOATSUBSCRIPT roman_DFT end_FLOATSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_i , roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } , { roman_Γ start_POSTSUBSCRIPT roman_a , roman_DFT end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } ]. Intuitively, SRME[har] (10) is large when harmonic vibrational properties (ω(𝒒)s𝜔subscript𝒒𝑠\omega(\bm{q})_{s}italic_ω ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, or 𝒗(𝒒)s,s𝒗subscript𝒒𝑠superscript𝑠\bm{\mathsfit{v}}(\bm{q})_{s,s^{\prime}}bold_slanted_v ( bold_italic_q ) start_POSTSUBSCRIPT italic_s , italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, or Γi(𝒒)ssubscriptΓisubscript𝒒𝑠\Gamma_{{\!\rm i}}(\bm{q})_{s}roman_Γ start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT) differ between DFT and fMLP, while SRME[anh] (11) is large when the anharmonic linewidths (Γa(𝒒)ssubscriptΓasubscript𝒒𝑠\Gamma_{{\!\rm a}}(\bm{q})_{s}roman_Γ start_POSTSUBSCRIPT roman_a end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT) differ between DFT and fMLP. We show in Fig. 6 that small SRME[har] and small SRME[anh] (e.g., as in BeO), imply a small SRE[κ𝜅\kappaitalic_κ]

Refer to caption
Figure 6: Relation between harmonic and anharmonic errors, and thermal conductivity, calculated for 103 compounds using Eq. (10) and Eq. (11) considering DFT and MACE-MP-0[20]. The area of the marker is proportional to the symmetric percentage error in the average total conductivity (SRE[κ𝜅\kappaitalic_κ], see legend).We highlight how BeO (RbH) has low (large) SRME[har], SRME[anh], and SRE[κ𝜅\kappaitalic_κ]; in contrast, BeTe displays large SRME[har], SRME[anh], but low SRE[κ𝜅\kappaitalic_κ] due to compensation of errors (see text).

Fig. 6 also shows that often large SRME[har] and large SRME[anh] (e.g. RbH) translate into large SRE[κ]SREdelimited-[]𝜅\rm{SRE}\big{[}\kappa\big{]}roman_SRE [ italic_κ ]; however there also cases (e.g. BeTe) in which large SRME[har] and large SRME[anh] can also compensate each other and result in a small SRE[κ]SREdelimited-[]𝜅\rm{SRE}\big{[}\kappa\big{]}roman_SRE [ italic_κ ]. This confirms the statements made in the main text on SRE[κ]SREdelimited-[]𝜅\rm{SRE}\big{[}\kappa\big{]}roman_SRE [ italic_κ ] not being a reliable descriptor for the capability of fMLPs to capture the harmonic and anharmonic physics underlying heat conduction. In contrast, having both small SRME[har] and small SRME[anh] is a sufficient condition to accurately capture harmonic and anharmonic vibrational properties, as well as thermal conductivity.

Overall, these tests highlight the importance of benchmarking for the accuracy of fMLPs in predicting both microscopic harmonic and anharmonic vibrational properties, motivating the introduction of the SRME[{𝒦(𝒒)s}𝒦subscript𝒒𝑠\{\mathcal{K}(\bm{q})_{s}\}{ caligraphic_K ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT }].

Influence of isotopic scattering on conductivity.

wurtzite BeO zincblende BeTe rocksalt RbH
DFT MACE DFT MACE DFT MACE
with Γi(𝒒)ssubscriptΓisubscript𝒒𝑠\Gamma_{\rm{i}}(\bm{q})_{s}roman_Γ start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT 286.391 259.443 78.246 69.138 4.281 11.682
without Γi(𝒒)ssubscriptΓisubscript𝒒𝑠\Gamma_{\rm{i}}(\bm{q})_{s}roman_Γ start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ( bold_italic_q ) start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT 291.963 263.956 289.374 402.030 4.682 14.672
Table 2: Influence of isotopic scattering on the thermal conductivity at 300 K for BeO, BeTe, and RbH.

To analyze the influence of isotope scattering on the conductivity, we compare thermal conductivity values with and without considering the linewidths from isotope mass-disorder, as described in Eq. (4). The results are summarized in Table 2. In wurtzite BeO and rocksalt RbH, the impact of isotope scattering is minimal or small, respectively. However, in zincblende BeTe, as shown in Fig. 2b, isotopic and anharmonic linewidths have comparable values, both affecting the thermal conductivity. In this material, MACE-MP-0 significantly overestimates some linewidths compared to DFT-PBE, and this compensates for the overestimation in other regions. Consequently, MACE-MP-0 shows a significantly larger error in thermal conductivity when isotope scattering is not considered.

Automated Wigner Conductivity Workflow. The workflow, outlined in Fig. 7, includes structure relaxation, force constant computation, and thermal conductivity analysis. To automate the calculation of interatomic forces, we developed an interface between the ase[54] (version 3.23.0) and the phono3py[30, 55] (version 3.2.0) packages. Atomic positions and cell parameters were simultaneously relaxed to minimize stresses and ensure positive phonon frequencies. Since the determination of irreducible 𝒒𝒒\bm{q}bold_italic_q points during thermal conductivity calculations depends on the crystal symmetry, it is essential to consider symmetries. During the initial relaxation stage, symmetries are explicitly enforced as constraints while simultaneously relaxing atomic positions and cell parameters. After each step of atomic position relaxation, cell parameters are adjusted if the total forces exceed a a certain threshold (values are fMLP-specific and discussed later). In the second stage, the symmetry constraint is removed to allow for finer atomic adjustments and a complete relaxation of the structure. The joint relaxation of atomic positions and cell parameters is applied once more. If the symmetry is preserved, the final relaxed structure is used. If symmetry is broken, the structure from the first stage, with enforced symmetry, is retained. Finally, an additional relaxation of atomic positions with fixed cell parameters is performed using a stricter threshold to ensure the structure reaches a stable energy minimum at a fixed volume for accurate phonon calculations.

Refer to caption
Figure 7: Automated Wigner Conductivity Workflow. The first part of the workflow, involving force evaluation, is performed on a GPU. The input structures undergo relaxation in two stages. In the first stage, initial symmetries are enforced while simultaneously relaxing atomic positions and cell parameters. In the second stage, the structure is further relaxed by removing the symmetry constraint. If the symmetry group of the final structure changes, the structure with enforced symmetry relaxation is used instead. After initializing the displacement structures, the force sets are calculated for each displacement. CPUs handle the remainder of the workflow. Force constants are constructed from the displacement force sets, and thermal conductivity is calculated using phono3py. Structures with negative phonon frequencies are discarded from the final analysis (conducted on a PC).

To ensure positive phonons, strict thresholds were established, tailored to the performance of each individual model. M3GNet required thresholds of 2.0×103 eV/Å2.0superscript103 eV/Å2.0\times 10^{-3}\text{ eV/\AA}2.0 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT eV/Å and 8.0×104 eV/Å8.0superscript104 eV/Å8.0\times 10^{-4}\text{ eV/\AA}8.0 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT eV/Å. For CHGNet, MACE-MP-0, and SevenNet, the thresholds were set to 104 eV/Åsuperscript104 eV/Å10^{-4}\text{ eV/\AA}10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT eV/Å and 5.0×105 eV/Å5.0superscript105 eV/Å5.0\times 10^{-5}\text{ eV/\AA}5.0 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT eV/Å for initial and final values, respectively. In the supercell force-constant calculations, we used the same parameters used in the DFT reference data[30, 31].

Thermal conductivities were computed using phono3py following Ref. [30, 55]. A 19×19×1919191919\times 19\times 1919 × 19 × 19 q𝑞qitalic_q-mesh was used for rocksalt and zincblende structures, and a 19×19×1519191519\times 19\times 1519 × 19 × 15 q𝑞qitalic_q-mesh for wurtzite structures, with collision operator computed using the tetrahedron method. For Fig. 2, anharmonic linewidths were calculated on a 9×9×99999\times 9\times 99 × 9 × 9 q𝑞qitalic_q-mesh to ensure clarity. We utilized version 3 of phono3py for anharmonic linewidth calculations and employed the Wang method [56, 57] for the NAC term, as implemented in the phonopy[55, 58] package (version 2.26.6). When structures with negative phonon frequencies were found with mp-fMLPs, these were discarded from subsequent thermal-conductivity analysis.

DFT and mp-fMLP calculations. Structures were taken from the Materials Project (MP)[38]. The Perdew, Burke, and Ernzerhof (PBE) exchange-correlation functional [59] was employed for DFT calculations, consistent with the approach used for generating MP dataset. The computational details for the DFT calculations can be found in Refs. [30, 31]. As discussed in Refs. [18, 19, 20, 21], mp-fMLP were trained on the MP data using the PBE functional. M3GNet was used through the matgl[60] (version 1.1.2) package with the M3GNet-MP-2021.2.8-PES model. The CHGNet version 0.3.8 was used. The MACE-MP-0 2024-01-07-mace-128-L2 model was run through LAMMPS[61]. SevenNet was used through the sevenn package (version 0.9.2) with the SevenNet-0_11July2024 model.

Computational cost. The SevenNet, MACE, and CHGNet calculations were performed on an Nvidia Tesla A100 SXM4 80GB GPU, with the relaxation and force calculations for all 103 structures costing approximately 3similar-toabsent3\sim 3∼ 3 GPU hours per model. M3GNet was executed on AMD EPYC 7702 CPUs, with relaxation and force calculations costing around 30similar-toabsent30\sim 30∼ 30 CPU hours. Thermal conductivity calculations for all 103 structures required about 50similar-toabsent50\sim 50∼ 50 CPU hours per model, and similar resources were needed for evaluating thermal conductivity from the DFT force constants. Overall, the analysis of these four models required approximately 9similar-toabsent9\sim 9∼ 9 GPU hours and 280similar-toabsent280\sim 280∼ 280 CPU hours.

Data availability. The phononDB-PBE dataset including the displacements generated by phono3py and the corresponding force sets are available https://github.com/atztogo/phonondb [30, 31]. The remaining dataset will be made available in the future.

Code availability. The phono3py and phonopy packages are available at https://github.com/phonopy/; the ase package is available at https://gitlab.com/ase/ase. matgl containing the M3GNet model is available at https://github.com/materialsvirtuallab/matgl; the CHGNet model and package is available at https://github.com/CederGroupHub/chgnet; the SevenNet model and package is available at https://github.com/MDIL-SNU/SevenNet. The MACE used through LAMMPS[61] is available at https://github.com/ACEsuit/lammps, and the MACE-MP-0 model is available at https://github.com/ACEsuit/mace-mp. The automatic framework and the analysis code will be made available soon.

Competing interests. The authors declare no competing interests.

Author contributions. M.S. conceived and supervised the project. B.P. developed the automated computational framework and performed the numerical calculations with inputs from P.A., G.C., and M.S. B.P. and M.S. analyzed and organized the data, with inputs from P.A. All authors contributed to discussing the data, writing and editing the manuscript.

Acknowledgements

M. S. acknowledges support from: (i) Gonville and Caius College; (ii) the Swiss National Science Foundation (SNSF) project P500PT_203178; (iii) the Sulis Tier 2 HPC platform (funded by EPSRC Grant EP/T022108/1 and the HPC Midlands+consortium); (iv) the Kelvin2 HPC platform at the NI-HPC Centre (funded by EPSRC and jointly managed by Queen’s University Belfast and Ulster University). P.A. acknowledges support from SNSF through Post.Doc mobility fellowship P500PN_206693. We thank William Chuck Witt for the useful discussions, and we gratefully acknowledge Atsushi Togo for having provided harmonic and anharmonic force constants computed using density functional theory (PBE functional) from the phononDB-PBE database [30, 31].

References