Search | arXiv e-print repository

inlabru: software for fitting latent Gaussian models with non-linear predictors

Authors: Finn Lindgren, Fabian Bachl, Janine Illian, Man Ho Suen, Håvard Rue, Andrew E. Seaton

Abstract: The integrated nested Laplace approximation (INLA) method has become a popular approach for computationally efficient approximate Bayesian computation. In particular, by leveraging sparsity in random effect precision matrices, INLA is commonly used in spatial and spatio-temporal applications. However, the speed of INLA comes at the cost of restricting the user to the family of latent Gaussian mode… ▽ More The integrated nested Laplace approximation (INLA) method has become a popular approach for computationally efficient approximate Bayesian computation. In particular, by leveraging sparsity in random effect precision matrices, INLA is commonly used in spatial and spatio-temporal applications. However, the speed of INLA comes at the cost of restricting the user to the family of latent Gaussian models and the likelihoods currently implemented in {INLA}, the main software implementation of the INLA methodology. {inlabru} is a software package that extends the types of models that can be fitted using INLA by allowing the latent predictor to be non-linear in its parameters, moving beyond the additive linear predictor framework to allow more complex functional relationships. For inference it uses an approximate iterative method based on the first-order Taylor expansion of the non-linear predictor, fitting the model using INLA for each linearised model configuration. {inlabru} automates much of the workflow required to fit models using {R-INLA}, simplifying the process for users to specify, fit and predict from models. There is additional support for fitting joint likelihood models by building each likelihood individually. {inlabru} also supports the direct use of spatial data structures, such as those implemented in the {sf} and {terra} packages. In this paper we outline the statistical theory, model structure and basic syntax required for users to understand and develop their own models using {inlabru}. We evaluate the approximate inference method using a Bayesian method checking approach. We provide three examples modelling simulated spatial data that demonstrate the benefits of the additional flexibility provided by {inlabru}. △ Less

Submitted 30 June, 2024; originally announced July 2024.

MSC Class: 62-04

arXiv:2403.10680 [pdf, other]

Spatio-temporal Occupancy Models with INLA

Authors: Jafet Belmont, Sara Martino, Janine Illian, Håvard Rue

Abstract: Modern methods for quantifying and predicting species distribution play a crucial part in biodiversity conservation. Occupancy models are a popular choice for analyzing species occurrence data as they allow to separate the observational error induced by imperfect detection, and the sources of bias affecting the occupancy process. However, the spatial and temporal variation in occupancy not account… ▽ More Modern methods for quantifying and predicting species distribution play a crucial part in biodiversity conservation. Occupancy models are a popular choice for analyzing species occurrence data as they allow to separate the observational error induced by imperfect detection, and the sources of bias affecting the occupancy process. However, the spatial and temporal variation in occupancy not accounted for by environmental covariates is often ignored or modelled through simple spatial structures as the computational costs of fitting explicit spatio-temporal models is too high. In this work, we demonstrate how INLA may be used to fit complex occupancy models and how the R-INLA package can provide a user-friendly interface to make such complex models available to users. We show how occupancy models, provided some simplification on the detection process, can be framed as latent Gaussian models and benefit from the powerful INLA machinery. A large selection of complex modelling features, and random effect modelshave already been implemented in R-INLA. These become available for occupancy models, providing the user with an efficient and flexible toolbox. We illustrate how INLA provides a computationally efficient framework for developing and fitting complex occupancy models using two case studies. Through these, we show how different spatio-temporal models that include spatial-varying trends, smooth terms, and spatio-temporal random effects can be fitted. At the cost of limiting the complexity of the detection model, INLA can incorporate a range of complex structures in the process. INLA-based occupancy models provide an alternative framework to fit complex spatiotemporal occupancy models. The need for new and more flexible computationally approaches to fit such models makes INLA an attractive option for addressing complex ecological problems, and a promising area of research. △ Less

Submitted 15 March, 2024; originally announced March 2024.

arXiv:2402.08335 [pdf, other]

Joint Modeling of Multivariate Longitudinal and Survival Outcomes with the R package INLAjoint

Authors: Denis Rustand, Janet van Niekerk, Elias Teixeira Krainski, Håvard Rue

Abstract: This paper introduces the R package INLAjoint, designed as a toolbox for fitting a diverse range of regression models addressing both longitudinal and survival outcomes. INLAjoint relies on the computational efficiency of the integrated nested Laplace approximations methodology, an efficient alternative to Markov chain Monte Carlo for Bayesian inference, ensuring both speed and accuracy in paramet… ▽ More This paper introduces the R package INLAjoint, designed as a toolbox for fitting a diverse range of regression models addressing both longitudinal and survival outcomes. INLAjoint relies on the computational efficiency of the integrated nested Laplace approximations methodology, an efficient alternative to Markov chain Monte Carlo for Bayesian inference, ensuring both speed and accuracy in parameter estimation and uncertainty quantification. The package facilitates the construction of complex joint models by treating individual regression models as building blocks, which can be assembled to address specific research questions. Joint models are relevant in biomedical studies where the collection of longitudinal markers alongside censored survival times is common. They have gained significant interest in recent literature, demonstrating the ability to rectify biases present in separate modeling approaches such as informative censoring by a survival event or confusion bias due to population heterogeneity. We provide a comprehensive overview of the joint modeling framework embedded in INLAjoint with illustrative examples. Through these examples, we demonstrate the practical utility of INLAjoint in handling complex data scenarios encountered in biomedical research. △ Less

Submitted 3 April, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

arXiv:2312.06289 [pdf, other]

A graphical framework for interpretable correlation matrix models

Authors: Anna Freni Sterrantino, Denis Rustand, Janet van Niekerk, Elias Teixeira Krainski, Håvard Rue

Abstract: In this work, we present a new approach for constructing models for correlation matrices with a user-defined graphical structure. The graphical structure makes correlation matrices interpretable and avoids the quadratic increase of parameters as a function of the dimension. We suggest an automatic approach to define a prior using a natural sequence of simpler models within the Penalized Complexity… ▽ More In this work, we present a new approach for constructing models for correlation matrices with a user-defined graphical structure. The graphical structure makes correlation matrices interpretable and avoids the quadratic increase of parameters as a function of the dimension. We suggest an automatic approach to define a prior using a natural sequence of simpler models within the Penalized Complexity framework for the unknown parameters in these models. We illustrate this approach with three applications: a multivariate linear regression of four biomarkers, a multivariate disease mapping, and a multivariate longitudinal joint modelling. Each application underscores our method's intuitive appeal, signifying a substantial advancement toward a more cohesive and enlightening model that facilitates a meaningful interpretation of correlation matrices. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2312.01166 [pdf]

Enhanced spatial modeling on linear networks using Gaussian Whittle-Matérn fields

Authors: Somnath Chaudhuri, Maria A. Barceló, Pablo Juan, Diego Varga, David Bolin, Haavard Rue, Marc Saez

Abstract: Spatial statistics is traditionally based on stationary models on $\mathbb{R^d}$ like Matérn fields. The adaptation of traditional spatial statistical methods, originally designed for stationary models in Euclidean spaces, to effectively model phenomena on linear networks such as stream systems and urban road networks is challenging. The current study aims to analyze the incidence of traffic accid… ▽ More Spatial statistics is traditionally based on stationary models on $\mathbb{R^d}$ like Matérn fields. The adaptation of traditional spatial statistical methods, originally designed for stationary models in Euclidean spaces, to effectively model phenomena on linear networks such as stream systems and urban road networks is challenging. The current study aims to analyze the incidence of traffic accidents on road networks using three different methodologies and compare the model performance for each methodology. Initially, we analyzed the application of spatial triangulation precisely on road networks instead of traditional continuous regions. However, this approach posed challenges in areas with complex boundaries, leading to the emergence of artificial spatial dependencies. To address this, we applied an alternative computational method to construct nonstationary barrier models. Finally, we explored a recently proposed class of Gaussian processes on compact metric graphs, the Whittle-Matérn fields, defined by a fractional SPDE on the metric graph. The latter fields are a natural extension of Gaussian fields with Matérn covariance functions on Euclidean domains to non-Euclidean metric graph settings. A ten-year period (2010-2019) of daily traffic-accident records from Barcelona, Spain have been used to evaluate the three models referred above. While comparing model performance we observed that the Whittle-Matérn fields defined directly on the network outperformed the network triangulation and barrier models. Due to their flexibility, the Whittle-Matérn fields can be applied to a wide range of environmental problems on linear networks such as spatio-temporal modeling of water contamination in stream networks or modeling air quality or accidents on urban road networks. △ Less

Submitted 9 December, 2023; v1 submitted 2 December, 2023; originally announced December 2023.

Comments: 24 pages, 9 figures

arXiv:2311.17100 [pdf, other]

doi 10.1016/j.spasta.2024.100843

Automatic cross-validation in structured models: Is it time to leave out leave-one-out?

Authors: A. Adin, E. Krainski, A. Lenzi, Z. Liu, J. Martínez-Minaya, H. Rue

Abstract: Standard techniques such as leave-one-out cross-validation (LOOCV) might not be suitable for evaluating the predictive performance of models incorporating structured random effects. In such cases, the correlation between the training and test sets could have a notable impact on the model's prediction error. To overcome this issue, an automatic group construction procedure for leave-group-out cross… ▽ More Standard techniques such as leave-one-out cross-validation (LOOCV) might not be suitable for evaluating the predictive performance of models incorporating structured random effects. In such cases, the correlation between the training and test sets could have a notable impact on the model's prediction error. To overcome this issue, an automatic group construction procedure for leave-group-out cross validation (LGOCV) has recently emerged as a valuable tool for enhancing predictive performance measurement in structured models. The purpose of this paper is (i) to compare LOOCV and LGOCV within structured models, emphasizing model selection and predictive performance, and (ii) to provide real data applications in spatial statistics using complex structured models fitted with INLA, showcasing the utility of the automatic LGOCV method. First, we briefly review the key aspects of the recently proposed LGOCV method for automatic group construction in latent Gaussian models. We also demonstrate the effectiveness of this method for selecting the model with the highest predictive performance by simulating extrapolation tasks in both temporal and spatial data analyses. Finally, we provide insights into the effectiveness of the LGOCV method in modelling complex structured data, encompassing spatio-temporal multivariate count data, spatial compositional data, and spatio-temporal geospatial data. △ Less

Submitted 7 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

Journal ref: Spatial Statistics (2024)

arXiv:2311.08050 [pdf, other]

INLA+ -- Approximate Bayesian inference for non-sparse models using HPC

Authors: Esmail Abdul-Fattah, Janet Van Niekerk, Haavard Rue

Abstract: The integrated nested Laplace approximations (INLA) method has become a widely utilized tool for researchers and practitioners seeking to perform approximate Bayesian inference across various fields of application. To address the growing demand for incorporating more complex models and enhancing the method's capabilities, this paper introduces a novel framework that leverages dense matrices for pe… ▽ More The integrated nested Laplace approximations (INLA) method has become a widely utilized tool for researchers and practitioners seeking to perform approximate Bayesian inference across various fields of application. To address the growing demand for incorporating more complex models and enhancing the method's capabilities, this paper introduces a novel framework that leverages dense matrices for performing approximate Bayesian inference based on INLA across multiple computing nodes using HPC. When dealing with non-sparse precision or covariance matrices, this new approach scales better compared to the current INLA method, capitalizing on the computational power offered by multiprocessors in shared and distributed memory architectures available in contemporary computing resources and specialized dense matrix algebra. To validate the efficacy of this approach, we conduct a simulation study then apply it to analyze cancer mortality data in Spain, employing a three-way spatio-temporal interaction model. △ Less

Submitted 14 November, 2023; originally announced November 2023.

Comments: 29 pages, 13 figures

arXiv:2310.06130 [pdf, other]

Statistical inference for radially-stable generalized Pareto distributions and return level-sets in geometric extremes

Authors: Ioannis Papastathopoulos, Lambert de Monte, Ryan Campbell, Haavard Rue

Abstract: We use a functional analogue of the quantile function for probability measures admitting a continuous Lebesgue density on $\mathbb{R}^d$ to characterise the class of non-trivial limit distributions of radially recentered and rescaled multivariate exceedances. A new class of multivariate distributions is identified, termed radially-stable generalised Pareto distributions, and is shown to admit cert… ▽ More We use a functional analogue of the quantile function for probability measures admitting a continuous Lebesgue density on $\mathbb{R}^d$ to characterise the class of non-trivial limit distributions of radially recentered and rescaled multivariate exceedances. A new class of multivariate distributions is identified, termed radially-stable generalised Pareto distributions, and is shown to admit certain stability properties that permit extrapolation to extremal sets along any direction in cones such as $\mathbb{R}^d$ and $\mathbb{R}_+^d$. Leveraging the limit Poisson point process likelihood of the point process of radially renormalised exceedances, we develop parsimonious statistical models that exploit theoretical links between structural star-bodies and are amenable to Bayesian inference. Our framework sharpens statistical inference by suitably including additional information from the angular directions of the geometric exceedances and facilitates efficient computations in dimensions $d=2$ and $d=3$. Additionally, it naturally leads to the notion of return level-set, which is a canonical quantile set expressed in terms of its average recurrence interval, and a geometric analogue of the uni-dimensional return level. We illustrate our methods with a simulation study showing superior predictive performance of probabilities of rare events, and with two case studies, one associated with river flow extremes, and the other with oceanographic extremes. △ Less

Submitted 23 January, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

Comments: 80 pages, 32 figures

arXiv:2309.05435 [pdf, other]

Parallel Selected Inversion for Space-Time Gaussian Markov Random Fields

Authors: Abylay Zhumekenov, Elias T. Krainski, Håvard Rue

Abstract: Performing a Bayesian inference on large spatio-temporal models requires extracting inverse elements of large sparse precision matrices for marginal variances. Although direct matrix factorizations can be used for the inversion, such methods fail to scale well for distributed problems when run on large computing clusters. On the contrary, Krylov subspace methods for the selected inversion have bee… ▽ More Performing a Bayesian inference on large spatio-temporal models requires extracting inverse elements of large sparse precision matrices for marginal variances. Although direct matrix factorizations can be used for the inversion, such methods fail to scale well for distributed problems when run on large computing clusters. On the contrary, Krylov subspace methods for the selected inversion have been gaining traction. We propose a parallel hybrid approach based on domain decomposition, which extends the Rao-Blackwellized Monte Carlo estimator for distributed precision matrices. Our approach exploits the strength of Krylov subspace methods as global solvers and efficiency of direct factorizations as base case solvers to compute the marginal variances using a divide-and-conquer strategy. By introducing subdomain overlaps, one can achieve a greater accuracy at an increased computational effort with little to no additional communication. We demonstrate the speed improvements on both simulated models and a massive US daily temperature data. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 17 pages, 7 figures

arXiv:2308.13928 [pdf, other]

doi 10.1007/s11222-024-10427-3

A flexible Bayesian tool for CoDa mixed models: logistic-normal distribution with Dirichlet covariance

Authors: Joaquín Martínez-Minaya, Haavard Rue

Abstract: Compositional Data Analysis (CoDa) has gained popularity in recent years. This type of data consists of values from disjoint categories that sum up to a constant. Both Dirichlet regression and logistic-normal regression have become popular as CoDa analysis methods. However, fitting this kind of multivariate models presents challenges, especially when structured random effects are included in the m… ▽ More Compositional Data Analysis (CoDa) has gained popularity in recent years. This type of data consists of values from disjoint categories that sum up to a constant. Both Dirichlet regression and logistic-normal regression have become popular as CoDa analysis methods. However, fitting this kind of multivariate models presents challenges, especially when structured random effects are included in the model, such as temporal or spatial effects. To overcome these challenges, we propose the logistic-normal Dirichlet Model (LNDM). We seamlessly incorporate this approach into the R-INLA package, facilitating model fitting and model prediction within the framework of Latent Gaussian Models (LGMs). Moreover, we explore metrics like Deviance Information Criteria (DIC), Watanabe Akaike information criterion (WAIC), and cross-validation measure conditional predictive ordinate (CPO) for model selection in R-INLA for CoDa. Illustrating LNDM through a simple simulated example and with an ecological case study on Arabidopsis thaliana in the Iberian Peninsula, we underscore its potential as an effective tool for managing CoDa and large CoDa databases. △ Less

Submitted 8 November, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

Journal ref: Statistics and Compunting (2024)

arXiv:2307.12365 [pdf, other]

Robustness, model checking and latent Gaussian models

Authors: Rafael Cabral, David Bolin, Håvard Rue

Abstract: Model checking is essential to evaluate the adequacy of statistical models and the validity of inferences drawn from them. Particularly, hierarchical models such as latent Gaussian models (LGMs) pose unique challenges as it is difficult to check assumptions about the distribution of the latent parameters. Discrepancy measures are often used to quantify the degree to which a model fit deviates from… ▽ More Model checking is essential to evaluate the adequacy of statistical models and the validity of inferences drawn from them. Particularly, hierarchical models such as latent Gaussian models (LGMs) pose unique challenges as it is difficult to check assumptions about the distribution of the latent parameters. Discrepancy measures are often used to quantify the degree to which a model fit deviates from the observed data. We construct discrepancy measures by (a) defining an alternative model with relaxed assumptions and (b) deriving the discrepancy measure most sensitive to discrepancies induced by this alternative model. We also promote a workflow for model criticism that combines model checking with subsequent robustness analysis. As a result, we obtain a general recipe to check assumptions in LGMs and the impact of these assumptions on the results. We demonstrate the ideas by assessing the latent Gaussianity assumption, a crucial but often overlooked assumption in LGMs. We illustrate the methods via examples utilising Stan and provide functions for easy usage of the methods for general models fitted through R-INLA. △ Less

Submitted 23 July, 2023; originally announced July 2023.

Comments: 40 pages, 21 figures

MSC Class: 62A01; 62C10; 62F03; 62F35

arXiv:2306.17236 [pdf, other]

Non-stationary Bayesian Spatial Model for Disease Mapping based on Sub-regions

Authors: Esmail Abdul Fattah, Elias Krainski, Janet van Niekerk, Håvard Rue

Abstract: This paper aims to extend the Besag model, a widely used Bayesian spatial model in disease mapping, to a non-stationary spatial model for irregular lattice-type data. The goal is to improve the model's ability to capture complex spatial dependence patterns and increase interpretability. The proposed model uses multiple precision parameters, accounting for different intensities of spatial dependenc… ▽ More This paper aims to extend the Besag model, a widely used Bayesian spatial model in disease mapping, to a non-stationary spatial model for irregular lattice-type data. The goal is to improve the model's ability to capture complex spatial dependence patterns and increase interpretability. The proposed model uses multiple precision parameters, accounting for different intensities of spatial dependence in different sub-regions. We derive a joint penalized complexity prior for the flexible local precision parameters to prevent overfitting and ensure contraction to the stationary model at a user-defined rate. The proposed methodology can be used as a basis for the development of various other non-stationary effects over other domains such as time. An accompanying R package 'fbesag' equips the reader with the necessary tools for immediate use and application. We illustrate the novelty of the proposal by modeling the risk of dengue in Brazil, where the stationary spatial assumption fails and interesting risk profiles are estimated when accounting for spatial non-stationary. △ Less

Submitted 29 June, 2023; originally announced June 2023.

arXiv:2303.15254 [pdf, other]

Integrated Nested Laplace Approximations for Large-Scale Spatial-Temporal Bayesian Modeling

Authors: Lisa Gaedke-Merzhäuser, Elias Krainski, Radim Janalik, Håvard Rue, Olaf Schenk

Abstract: Bayesian inference tasks continue to pose a computational challenge. This especially holds for spatial-temporal modeling where high-dimensional latent parameter spaces are ubiquitous. The methodology of integrated nested Laplace approximations (INLA) provides a framework for performing Bayesian inference applicable to a large subclass of additive Bayesian hierarchical models. In combination with t… ▽ More Bayesian inference tasks continue to pose a computational challenge. This especially holds for spatial-temporal modeling where high-dimensional latent parameter spaces are ubiquitous. The methodology of integrated nested Laplace approximations (INLA) provides a framework for performing Bayesian inference applicable to a large subclass of additive Bayesian hierarchical models. In combination with the stochastic partial differential equations (SPDE) approach it gives rise to an efficient method for spatial-temporal modeling. In this work we build on the INLA-SPDE approach, by putting forward a performant distributed memory variant, INLA-DIST, for large-scale applications. To perform the arising computational kernel operations, consisting of Cholesky factorizations, solving linear systems, and selected matrix inversions, we present two numerical solver options, a sparse CPU-based library and a novel blocked GPU-accelerated approach which we propose. We leverage the recurring nonzero block structure in the arising precision (inverse covariance) matrices, which allows us to employ dense subroutines within a sparse setting. Both versions of INLA-DIST are highly scalable, capable of performing inference on models with millions of latent parameters. We demonstrate their accuracy and performance on synthetic as well as real-world climate dataset applications. △ Less

Submitted 27 March, 2023; originally announced March 2023.

Comments: 22 pages, 14 figures

arXiv:2303.15041 [pdf, other]

Towards black-box parameter estimation

Authors: Amanda Lenzi, Haavard Rue

Abstract: Deep learning algorithms have recently shown to be a successful tool in estimating parameters of statistical models for which simulation is easy, but likelihood computation is challenging. But the success of these approaches depends on simulating parameters that sufficiently reproduce the observed data, and, at present, there is a lack of efficient methods to produce these simulations. We develop… ▽ More Deep learning algorithms have recently shown to be a successful tool in estimating parameters of statistical models for which simulation is easy, but likelihood computation is challenging. But the success of these approaches depends on simulating parameters that sufficiently reproduce the observed data, and, at present, there is a lack of efficient methods to produce these simulations. We develop new black-box procedures to estimate parameters of statistical models based only on weak parameter structure assumptions. For well-structured likelihoods with frequent occurrences, such as in time series, this is achieved by pre-training a deep neural network on an extensive simulated database that covers a wide range of data sizes. For other types of complex dependencies, an iterative algorithm guides simulations to the correct parameter region in multiple rounds. These approaches can successfully estimate and quantify the uncertainty of parameters from non-Gaussian models with complex spatial and temporal dependencies. The success of our methods is a first step towards a fully flexible automatic black-box estimation framework. △ Less

Submitted 19 February, 2024; v1 submitted 27 March, 2023; originally announced March 2023.

arXiv:2212.10976 [pdf, other]

Bayesian Inference for Multivariate Spatial Models with R-INLA

Authors: Francisco Palmí-Perales, Virgilio Gómez-Rubio, Roger S Bivand, Michela Cameletti, Håvard Rue

Abstract: Bayesian methods and software for spatial data analysis are generally now well established in the scientific community. Despite the wide application of spatial models, the analysis of multivariate spatial data using R-INLA has not been widely described in the existing literature. Therefore, the main objective of this article is to demonstrate that R-INLA is a convenient toolbox to analyse differen… ▽ More Bayesian methods and software for spatial data analysis are generally now well established in the scientific community. Despite the wide application of spatial models, the analysis of multivariate spatial data using R-INLA has not been widely described in the existing literature. Therefore, the main objective of this article is to demonstrate that R-INLA is a convenient toolbox to analyse different types of multivariate spatial datasets. Additionally, this will be illustrated by analysing three datasets which are publicly available. Furthermore, the details and the R code of these analyses are provided to exemplify how to adjust multivariate spatial datasets with R-INLA. △ Less

Submitted 21 December, 2022; originally announced December 2022.

Comments: Submitted to the RJournal (19 pages and 6 figures)

arXiv:2212.01900 [pdf, other]

Bayesian survival analysis with INLA

Authors: Danilo Alvares, Janet van Niekerk, Elias Teixeira Krainski, Håvard Rue, Denis Rustand

Abstract: This tutorial shows how various Bayesian survival models can be fitted using the integrated nested Laplace approximation in a clear, legible, and comprehensible manner using the INLA and INLAjoint R-packages. Such models include accelerated failure time, proportional hazards, mixture cure, competing risks, multi-state, frailty, and joint models of longitudinal and survival data, originally present… ▽ More This tutorial shows how various Bayesian survival models can be fitted using the integrated nested Laplace approximation in a clear, legible, and comprehensible manner using the INLA and INLAjoint R-packages. Such models include accelerated failure time, proportional hazards, mixture cure, competing risks, multi-state, frailty, and joint models of longitudinal and survival data, originally presented in the article "Bayesian survival analysis with BUGS" (Alvares et al., 2021). In addition, we illustrate the implementation of a new joint model for a longitudinal semicontinuous marker, recurrent events, and a terminal event. Our proposal aims to provide the reader with syntax examples for implementing survival models using a fast and accurate approximate Bayesian inferential approach. △ Less

Submitted 18 March, 2024; v1 submitted 4 December, 2022; originally announced December 2022.

arXiv:2211.11050 [pdf, other]

Fitting latent non-Gaussian models using variational Bayes and Laplace approximations

Authors: Rafael Cabral, David Bolin, Håvard Rue

Abstract: Latent Gaussian models (LGMs) are perhaps the most commonly used class of models in statistical applications. Nevertheless, in areas ranging from longitudinal studies in biostatistics to geostatistics, it is easy to find datasets that contain inherently non-Gaussian features, such as sudden jumps or spikes, that adversely affect the inferences and predictions made from an LGM. These datasets requi… ▽ More Latent Gaussian models (LGMs) are perhaps the most commonly used class of models in statistical applications. Nevertheless, in areas ranging from longitudinal studies in biostatistics to geostatistics, it is easy to find datasets that contain inherently non-Gaussian features, such as sudden jumps or spikes, that adversely affect the inferences and predictions made from an LGM. These datasets require more general latent non-Gaussian models (LnGMs) that can handle these non-Gaussian features automatically. However, fast implementation and easy-to-use software are lacking, which prevent LnGMs from becoming widely applicable. In this paper, we derive variational Bayes algorithms for fast and scalable inference of LnGMs. The approximation leads to an LGM that downweights extreme events in the latent process, reducing their impact and leading to more robust inferences. It can be applied to a wide range of models, such as autoregressive processes for time series, simultaneous autoregressive models for areal data, and spatial Matérn models. To facilitate Bayesian inference, we introduce the ngvb package, where LGMs implemented in R-INLA can be easily extended to LnGMs by adding a single line of code. △ Less

Submitted 20 November, 2022; originally announced November 2022.

Comments: 30 pages, 10 figures

MSC Class: 62M20; 62F15; 62F35

arXiv:2210.04482 [pdf, other]

Leave-group-out cross-validation for latent Gaussian models

Authors: Zhedong Liu, Haavard Rue

Abstract: Evaluating the predictive performance of a statistical model is commonly done using cross-validation. Although the leave-one-out method is frequently employed, its application is justified primarily for independent and identically distributed observations. However, this method tends to mimic interpolation rather than prediction when dealing with dependent observations. This paper proposes a modifi… ▽ More Evaluating the predictive performance of a statistical model is commonly done using cross-validation. Although the leave-one-out method is frequently employed, its application is justified primarily for independent and identically distributed observations. However, this method tends to mimic interpolation rather than prediction when dealing with dependent observations. This paper proposes a modified cross-validation for dependent observations. This is achieved by excluding an automatically determined set of observations from the training set to mimic a more reasonable prediction scenario. Also, within the framework of latent Gaussian models, we illustrate a method to adjust the joint posterior for this modified cross-validation to avoid model refitting. This new approach is accessible in the R-INLA package (www.r-inla.org). △ Less

Submitted 12 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

arXiv:2210.03222 [pdf, other]

doi 10.1103/PhysRevLett.129.132701

Deep underground laboratory measurement of $^{13}$C($α$,$n$)$^{16}$O in the Gamow windows of the $s$- and $i$-processes

Authors: B. Gao, T. Y. Jiao, Y. T. Li, H. Chen, W. P. Lin, Z. An, L. H. Ru, Z. C. Zhang, X. D. Tang, X. Y. Wang, N. T. Zhang, X. Fang, D. H. Xie, Y. H. Fan, L. Ma, X. Zhang, F. Bai, P. Wang, Y. X. Fan, G. Liu, H. X. Huang, Q. Wu, Y. B. Zhu, J. L. Chai, J. Q. Li , et al. (50 additional authors not shown)

Abstract: The $^{13}$C($α$,$n$)$^{16}$O reaction is the main neutron source for the slow-neutron-capture (s-) process in Asymptotic Giant Branch stars and for the intermediate (i-) process. Direct measurements at astrophysical energies in above-ground laboratories are hindered by the extremely small cross sections and vast cosmic-ray induced background. We performed the first consistent direct measurement i… ▽ More The $^{13}$C($α$,$n$)$^{16}$O reaction is the main neutron source for the slow-neutron-capture (s-) process in Asymptotic Giant Branch stars and for the intermediate (i-) process. Direct measurements at astrophysical energies in above-ground laboratories are hindered by the extremely small cross sections and vast cosmic-ray induced background. We performed the first consistent direct measurement in the range of $E_{\rm c.m.}=$0.24 MeV to 1.9 MeV using the accelerators at the China Jinping Underground Laboratory (CJPL) and Sichuan University. Our measurement covers almost the entire i-process Gamow window in which the large uncertainty of the previous experiments has been reduced from 60\% down to 15\%, eliminates the large systematic uncertainty in the extrapolation arising from the inconsistency of existing data sets, and provides a more reliable reaction rate for the studies of the s- and i-processes along with the first direct determination of the alpha strength for the near-threshold state. △ Less

Submitted 6 October, 2022; originally announced October 2022.

Journal ref: Physical Review Letters 129, 132701 (2022)

arXiv:2209.10744 [pdf]

doi 10.1021/acs.nanolett.2c03587

Suppression and revival of superconducting phase coherence in monolayer FeSe/SrTiO$_3$

Authors: H. Ru, Z. J. Li, S. Y. Wang, B. K. Xiang, Y. H. Wang

Abstract: Monolayer FeSe grown on SrTiO$_3$ (FeSe/STO) is an interfacial high temperature superconductor distinctively different from bulk FeSe. Due to the fragility of this two-dimensional system in the atmosphere, the investigation of its intrinsic superconductivity and intertwined orders has largely been limited to surface-sensitive charge probes compatible with ultra-high vacuum environment. However, th… ▽ More Monolayer FeSe grown on SrTiO$_3$ (FeSe/STO) is an interfacial high temperature superconductor distinctively different from bulk FeSe. Due to the fragility of this two-dimensional system in the atmosphere, the investigation of its intrinsic superconductivity and intertwined orders has largely been limited to surface-sensitive charge probes compatible with ultra-high vacuum environment. However, the superconducting phase coherence of the interface is challenging to probe. Here, we perform in-situ mutual inductance in ultra-high vacuum on FeSe/STO in combination with band mapping by angle-resolved photoemission spectroscopy (ARPES). We find that even though the monolayer showed a gap-closing temperature above 50 K, surprisingly no diamagnetism is visible down to 5 K. This is the case for few-layer FeSe/STO until it exceeds a critical number of 5 layers where diamagnetism suddenly appears. But the superfluid density does not saturate down to the base temperature in these thick samples. On the other hand, the suppression of diamagnetism in the few-layer FeSe/STO can be lifted by depositing a FeTe layer on top. The superconducting transition is much sharper than that in the thick FeSe/STO. However, Tc and superfluid density both decrease with increasing FeTe thickness. Shining ultraviolet light on the FeTe/FeSe/STO heterostructure enhances Tc similarly independent of the FeSe thickness, showing that the diamagnetism originates at the FeSe/STO interface. Our observation may be understood by a scenario in which interfacial superconducting phase coherence is highly anisotropic. △ Less

Submitted 21 September, 2022; originally announced September 2022.

arXiv:2208.00642 [pdf]

doi 10.1103/PhysRevX.13.011046

Direct observation of quantum anomalous vortex in Fe(Se,Te)

Authors: Y. S. Lin, S. Y. Wang, X. Zhang, Y. Feng, Y. P. Pan, H. Ru, J. J. Zhu, B. K. Xiang, K. Liu, C. L. Zheng, L. Y. Wei, M. X. Wang, Z. K. Liu, L. Chen, K. Jiang, Y. F. Guo, Ziqiang Wang, Y. H. Wang

Abstract: Vortices are topological defects of type-II superconductors in an external magnetic field. In a similar fashion to a quantum anomalous Hall insulator, quantum anomalous vortex (QAV) spontaneously nucleates due to orbital-and-spin exchange interaction between vortex core states and magnetic impurity moment, breaking time-reversal symmetry (TRS) of the vortex without an external field. Here, we used… ▽ More Vortices are topological defects of type-II superconductors in an external magnetic field. In a similar fashion to a quantum anomalous Hall insulator, quantum anomalous vortex (QAV) spontaneously nucleates due to orbital-and-spin exchange interaction between vortex core states and magnetic impurity moment, breaking time-reversal symmetry (TRS) of the vortex without an external field. Here, we used scanning superconducting quantum interference device microscopy (sSQUID) to search for its signatures in iron-chalcogenide superconductor Fe(Se,Te). Under zero magnetic field, we found a stochastic distribution of isolated anomalous vortices and antivortices with flux quanta $Φ_0$. By applying a small local magnetic field under the coil of the nano-SQUID device, we observed hysteretic flipping of the vortices reminiscent of the switching of ferromagnetic domains, suggesting locally broken-TRS. We further showed vectorial rotation of a flux line linking a paired vortex-antivortex with the local field. These unique properties of the anomalous vortices satisfied the defining criteria of QAV. Our observation suggests a quantum vortex phase with spontaneous broken-TRS in a high-temperature superconductor. △ Less

Submitted 1 August, 2022; originally announced August 2022.

Journal ref: PhysRevX.13.011046 (2023)

arXiv:2206.15140 [pdf, other]

doi 10.1103/PhysRevC.106.L011301

Calculation of the Thomas-Ehrman shift in $^{16}$F and $^{15}$O(p,p) cross section with the Gamow shell model

Authors: N. Michel, J. G. Li, L. H. Ru, W. Zuo

Abstract: The $^{16}$F nucleus is situated at the proton drip-line and is unbound by proton emission by only about 500 keV. Continuum coupling is then prominent in this nucleus. Added to that, its low-lying spectrum consists of narrow proton resonances as well. It is then a very good candidate to study nuclear structure and reactions at proton drip-line. The low-lying spectrum and scattering proton-proton c… ▽ More The $^{16}$F nucleus is situated at the proton drip-line and is unbound by proton emission by only about 500 keV. Continuum coupling is then prominent in this nucleus. Added to that, its low-lying spectrum consists of narrow proton resonances as well. It is then a very good candidate to study nuclear structure and reactions at proton drip-line. The low-lying spectrum and scattering proton-proton cross section of $^{16}$F have then been calculated with the coupled-channel Gamow shell model framework for that matter using an effective Hamiltonian. Experimental data are very well reproduced, as well as in its mirror nucleus $^{16}$N. Isospin-symmetry breaking generated by the Coulomb interaction and continuum coupling explicitly appears in our calculations. In particular, the different continuum couplings in $^{16}$F and $^{16}$N involving $s_{1/2}$ partial waves allow to explain the different ordering of low-lying states in their spectrum. △ Less

Submitted 30 June, 2022; originally announced June 2022.

Comments: 7 pages, 2 figures, accepted for publication in Phys. Rev. C. (Letters)

arXiv:2206.09287 [pdf, other]

Approximate Bayesian Inference for the Interaction Types 1, 2, 3 and 4 with Application in Disease Mapping

Authors: Esmail Abdul Fattah, Haavard Rue

Abstract: We address in this paper a new approach for fitting spatiotemporal models with application in disease mapping using the interaction types 1,2,3, and 4. When we account for the spatiotemporal interactions in disease-mapping models, inference becomes more useful in revealing unknown patterns in the data. However, when the number of locations and/or the number of time points is large, the inference g… ▽ More We address in this paper a new approach for fitting spatiotemporal models with application in disease mapping using the interaction types 1,2,3, and 4. When we account for the spatiotemporal interactions in disease-mapping models, inference becomes more useful in revealing unknown patterns in the data. However, when the number of locations and/or the number of time points is large, the inference gets computationally challenging due to the high number of required constraints necessary for inference, and this holds for various inference architectures including Markov chain Monte Carlo (MCMC) and Integrated Nested Laplace Approximations (INLA). We re-formulate INLA approach based on dense matrices to fit the intrinsic spatiotemporal models with the four interaction types and account for the sum-to-zero constraints, and discuss how the new approach can be implemented in a high-performance computing framework. The computing time using the new approach does not depend on the number of constraints and can reach a 40-fold faster speed compared to INLA in realistic scenarios. This approach is verified by a simulation study and a real data application, and it is implemented in the R package INLAPLUS and the Python header function: inla1234(). △ Less

Submitted 18 June, 2022; originally announced June 2022.

MSC Class: 62H11 ACM Class: G.3.14

arXiv:2204.06797 [pdf, other]

A new avenue for Bayesian inference with INLA

Authors: Janet van Niekerk, Elias Krainski, Denis Rustand, Haavard Rue

Abstract: Integrated Nested Laplace Approximations (INLA) has been a successful approximate Bayesian inference framework since its proposal by Rue et al. (2009). The increased computational efficiency and accuracy when compared with sampling-based methods for Bayesian inference like MCMC methods, are some contributors to its success. Ongoing research in the INLA methodology and implementation thereof in the… ▽ More Integrated Nested Laplace Approximations (INLA) has been a successful approximate Bayesian inference framework since its proposal by Rue et al. (2009). The increased computational efficiency and accuracy when compared with sampling-based methods for Bayesian inference like MCMC methods, are some contributors to its success. Ongoing research in the INLA methodology and implementation thereof in the R package R-INLA, ensures continued relevance for practitioners and improved performance and applicability of INLA. The era of big data and some recent research developments, presents an opportunity to reformulate some aspects of the classic INLA formulation, to achieve even faster inference, improved numerical stability and scalability. The improvement is especially noticeable for data-rich models. We demonstrate the efficiency gains with various examples of data-rich models, like Cox's proportional hazards model, an item-response theory model, a spatial model including prediction, and a 3-dimensional model for fMRI data. △ Less

Submitted 14 April, 2022; originally announced April 2022.

arXiv:2204.04678 [pdf, ps, other]

Parallelized integrated nested Laplace approximations for fast Bayesian inference

Authors: Lisa Gaedke-Merzhäuser, Janet van Niekerk, Olaf Schenk, Håvard Rue

Abstract: There is a growing demand for performing larger-scale Bayesian inference tasks, arising from greater data availability and higher-dimensional model parameter spaces. In this work we present parallelization strategies for the methodology of integrated nested Laplace approximations (INLA), a popular framework for performing approximate Bayesian inference on the class of Latent Gaussian models. Our a… ▽ More There is a growing demand for performing larger-scale Bayesian inference tasks, arising from greater data availability and higher-dimensional model parameter spaces. In this work we present parallelization strategies for the methodology of integrated nested Laplace approximations (INLA), a popular framework for performing approximate Bayesian inference on the class of Latent Gaussian models. Our approach makes use of nested OpenMP parallelism, a parallel line search procedure using robust regression in INLA's optimization phase and the state-of-the-art sparse linear solver PARDISO. We leverage mutually independent function evaluations in the algorithm as well as advanced sparse linear algebra techniques. This way we can flexibly utilize the power of today's multi-core architectures. We demonstrate the performance of our new parallelization scheme on a number of different real-world applications. The introduction of parallelism leads to speedups of a factor 10 and more for all larger models. Our work is already integrated in the current version of the open-source R-INLA package, making its improved performance conveniently available to all users. △ Less

Submitted 10 April, 2022; originally announced April 2022.

Comments: 18 pages, 7 figures

arXiv:2203.14304 [pdf, other]

An Extended Simplified Laplace strategy for Approximate Bayesian inference of Latent Gaussian Models using R-INLA

Authors: Cristian Chiuchiolo, Janet van Niekerk, Håvard Rue

Abstract: Various computational challenges arise when applying Bayesian inference approaches to complex hierarchical models. Sampling-based inference methods, such as Markov Chain Monte Carlo strategies, are renowned for providing accurate results but with high computational costs and slow or questionable convergence. On the contrary, approximate methods like the Integrated Nested Laplace Approximation (INL… ▽ More Various computational challenges arise when applying Bayesian inference approaches to complex hierarchical models. Sampling-based inference methods, such as Markov Chain Monte Carlo strategies, are renowned for providing accurate results but with high computational costs and slow or questionable convergence. On the contrary, approximate methods like the Integrated Nested Laplace Approximation (INLA) construct a deterministic approximation to the univariate posteriors through nested Laplace Approximations. This method enables fast inference performance in Latent Gaussian Models, which encode a large class of hierarchical models. R-INLA software mainly consists of three strategies to compute all the required posterior approximations depending on the accuracy requirements. The Simplified Laplace approximation (SLA) is the most attractive because of its speed performance since it is based on a Taylor expansion up to order three of a full Laplace Approximation. Here we enhance the methodology by simplifying the computations necessary for the skewness and modal configuration. Then we propose an expansion up to order four and use the Extended Skew Normal distribution as a new parametric fit. The resulting approximations to the marginal posterior densities are more accurate than those calculated with the SLA, with essentially no additional cost. △ Less

Submitted 27 March, 2022; originally announced March 2022.

Comments: 22 pages, 11 figures

arXiv:2203.06256 [pdf, other]

doi 10.1093/biostatistics/kxad019

Fast and flexible inference for joint models of multivariate longitudinal and survival data using Integrated Nested Laplace Approximations

Authors: Denis Rustand, Janet van Niekerk, Elias Teixeira Krainski, Håvard Rue, Cécile Proust-Lima

Abstract: Modeling longitudinal and survival data jointly offers many advantages such as addressing measurement error and missing data in the longitudinal processes, understanding and quantifying the association between the longitudinal markers and the survival events and predicting the risk of events based on the longitudinal markers. A joint model involves multiple submodels (one for each longitudinal/sur… ▽ More Modeling longitudinal and survival data jointly offers many advantages such as addressing measurement error and missing data in the longitudinal processes, understanding and quantifying the association between the longitudinal markers and the survival events and predicting the risk of events based on the longitudinal markers. A joint model involves multiple submodels (one for each longitudinal/survival outcome) usually linked together through correlated or shared random effects. Their estimation is computationally expensive (particularly due to a multidimensional integration of the likelihood over the random effects distribution) so that inference methods become rapidly intractable, and restricts applications of joint models to a small number of longitudinal markers and/or random effects. We introduce a Bayesian approximation based on the Integrated Nested Laplace Approximation algorithm implemented in the R package R-INLA to alleviate the computational burden and allow the estimation of multivariate joint models with fewer restrictions. Our simulation studies show that R-INLA substantially reduces the computation time and the variability of the parameter estimates compared to alternative estimation strategies. We further apply the methodology to analyze 5 longitudinal markers (3 continuous, 1 count, 1 binary, and 16 random effects) and competing risks of death and transplantation in a clinical trial on primary biliary cholangitis. R-INLA provides a fast and reliable inference technique for applying joint models to the complex multivariate data encountered in health research. △ Less

Submitted 12 July, 2023; v1 submitted 11 March, 2022; originally announced March 2022.

arXiv:2203.05510 [pdf, other]

Controlling the flexibility of non-Gaussian processes through shrinkage priors

Authors: Rafael Cabral, David Bolin, Håvard Rue

Abstract: The normal inverse Gaussian (NIG) and generalized asymmetric Laplace (GAL) distributions can be seen as skewed and semi-heavy-tailed extensions of the Gaussian distribution. Models driven by these more flexible noise distributions are then regarded as flexible extensions of simpler Gaussian models. Inferential procedures tend to overestimate the degree of non-Gaussianity in the data and therefore… ▽ More The normal inverse Gaussian (NIG) and generalized asymmetric Laplace (GAL) distributions can be seen as skewed and semi-heavy-tailed extensions of the Gaussian distribution. Models driven by these more flexible noise distributions are then regarded as flexible extensions of simpler Gaussian models. Inferential procedures tend to overestimate the degree of non-Gaussianity in the data and therefore we propose controlling the flexibility of these non-Gaussian models by adding sensible priors in the inferential framework that contract the model towards Gaussianity. In our venture to derive sensible priors, we also propose a new intuitive parameterization of the non-Gaussian models and discuss how to implement them efficiently in $Stan$. The methods are derived for a generic class of non-Gaussian models that include spatial Matérn fields, autoregressive models for time series, and simultaneous autoregressive models for aerial data. The results are illustrated with a simulation study and geostatistics application, where priors that penalize model complexity were shown to lead to more robust estimation and give preference to the Gaussian model, while at the same time allowing for non-Gaussianity if there is sufficient evidence in the data. △ Less

Submitted 29 October, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

Comments: 34 pages, 12 figures; Added new section about Stan implementation and a cross-validation study in the application section, removed unecessary details

MSC Class: 62F15; 62M20 (Primary); 62M40 (Secondary)

arXiv:2202.06502 [pdf, ps, other]

Joint Modeling and Prediction of Massive Spatio-Temporal Wildfire Count and Burnt Area Data with the INLA-SPDE Approach

Authors: Zhongwei Zhang, Elias Krainski, Peng Zhong, Håvard Rue, Raphaël Huser

Abstract: This paper describes the methodology used by the team RedSea in the data competition organized for EVA 2021 conference. We develop a novel two-part model to jointly describe the wildfire count data and burnt area data provided by the competition organizers with covariates. Our proposed methodology relies on the integrated nested Laplace approximation combined with the stochastic partial differenti… ▽ More This paper describes the methodology used by the team RedSea in the data competition organized for EVA 2021 conference. We develop a novel two-part model to jointly describe the wildfire count data and burnt area data provided by the competition organizers with covariates. Our proposed methodology relies on the integrated nested Laplace approximation combined with the stochastic partial differential equation (INLA-SPDE) approach. In the first part, a binary non-stationary spatio-temporal model is used to describe the underlying process that determines whether or not there is wildfire at a specific time and location. In the second part, we consider a non-stationary model that is based on log-Gaussian Cox processes for positive wildfire count data, and a non-stationary log-Gaussian model for positive burnt area data. Dependence between the positive count data and positive burnt area data is captured by a shared spatio-temporal random effect. Our two-part modeling approach performs well in terms of the prediction score criterion chosen by the data competition organizers. Moreover, our model results show that surface pressure is the most influential driver for the occurrence of a wildfire, whilst surface net solar radiation and surface pressure are the key drivers for large numbers of wildfires, and temperature and evaporation are the key drivers of large burnt areas. △ Less

Submitted 14 February, 2022; originally announced February 2022.

arXiv:2201.12902 [pdf, other]

Joint Quantile Disease Mapping with Application to Malaria and G6PD Deficiency

Authors: Hanan Alahmadi, Håvard Rue, Janet van Niekerk

Abstract: Statistical analysis based on quantile regression methods is more comprehensive, flexible, and less sensitive to outliers when compared to mean regression methods. When the link between different diseases are of interest, joint disease mapping is useful for measuring directional correlation between them. Most studies study this link through multiple correlated mean regressions. In this paper we pr… ▽ More Statistical analysis based on quantile regression methods is more comprehensive, flexible, and less sensitive to outliers when compared to mean regression methods. When the link between different diseases are of interest, joint disease mapping is useful for measuring directional correlation between them. Most studies study this link through multiple correlated mean regressions. In this paper we propose a joint quantile regression framework for multiple diseases where different quantile levels can be considered. We are motivated by the theorized link between the presence of Malaria and the gene deficiency G6PD, where medical scientist have anecdotally discovered a possible link between high levels of G6PD and lower than expected levels of Malaria initially pointing towards the occurrence of G6PD inhibiting the occurrence of Malaria. This link cannot be investigated with mean regressions and thus the need for flexible joint quantile regression in a disease mapping framework. Our joint quantile disease mapping model can be used for linear and non-linear effects of covariates by stochastic splines, since we define it as a latent Gaussian model. We perform Bayesian inference of this model using the INLA framework embedded in the R software package INLA. Finally, we illustrate the applicability of model by analyzing the malaria and G6PD deficiency incidences in 21 African countries using linked quantiles of different levels. △ Less

Submitted 30 January, 2022; originally announced January 2022.

arXiv:2112.04782 [pdf]

doi 10.1093/nsr/nwad249

Oscillating paramagnetic Meissner effect and Berezinskii-Kosterlitz-Thouless transition in $Bi_2Sr_2CaCu_2O_{8+δ}$ monolayer

Authors: S. Y. Wang, Y. Yu, J. X. Hao, Y. Feng, J. J. Zhu, Y. S. Lin, B. K. Xiang, H. Ru, Y. P. Pan, G. D. Gu, K. Watanabe, T. Taniguchi, Y. Qi, Y. Zhang, Y. H. Wang

Abstract: Monolayers of a prototypical cuprate high transition-temperature ($T_C$) superconductor $Bi_2Sr_2CaCu_2O_{8+δ}$ (Bi2212) was recently found to show $T_C$ and other electronic properties similar to those of the bulk. The robustness of superconductivity in an ideal two-dimensional (2D) system was an intriguing fact that defied the Mermin-Wagner theorem. Here, we took advantage of the high sensitivit… ▽ More Monolayers of a prototypical cuprate high transition-temperature ($T_C$) superconductor $Bi_2Sr_2CaCu_2O_{8+δ}$ (Bi2212) was recently found to show $T_C$ and other electronic properties similar to those of the bulk. The robustness of superconductivity in an ideal two-dimensional (2D) system was an intriguing fact that defied the Mermin-Wagner theorem. Here, we took advantage of the high sensitivity of scanning SQUID susceptometry to image the phase stiffness throughout the phase transition of Bi2212 in the 2D limit. We found susceptibility oscillated with flux between diamagnetism and paramagnetism in a Fraunhofer-like pattern up till $T_C$. The temperature and sample size-dependence of the modulation period agreed well with our Coulomb gas analogy of a finite 2D system based on Berezinskii-Kosterlitz-Thouless (BKT) transition. In the multilayers, the susceptibility oscillation differed in a small temperature regime below $T_C$ in consistent with a dimensional-crossover led by interlayer coupling. Serving as strong evidence of BKT transition in the bulk, there appeared a sharp superfluid density jump at zero-field and paramagnetism at small fields just below $T_C$. These results unified the phase transitions from the monolayer Bi2212 to the bulk as BKT transition with finite interlayer coupling. This elucidating picture favored the pre-formed pairs scenario for the underdoped cuprates regardless of lattice dimensionality. △ Less

Submitted 9 December, 2021; originally announced December 2021.

arXiv:2112.02861 [pdf, other]

Joint Posterior Inference for Latent Gaussian Models with R-INLA

Authors: Cristian Chiuchiolo, Janet van Niekerk, Haavard Rue

Abstract: Efficient Bayesian inference remains a computational challenge in hierarchical models. Simulation-based approaches such as Markov Chain Monte Carlo methods are still popular but have a large computational cost. When dealing with the large class of Latent Gaussian Models, the INLA methodology embedded in the R-INLA software provides accurate Bayesian inference by computing deterministic mixture rep… ▽ More Efficient Bayesian inference remains a computational challenge in hierarchical models. Simulation-based approaches such as Markov Chain Monte Carlo methods are still popular but have a large computational cost. When dealing with the large class of Latent Gaussian Models, the INLA methodology embedded in the R-INLA software provides accurate Bayesian inference by computing deterministic mixture representation to approximate the joint posterior, from which marginals are computed. The INLA approach has from the beginning been targeting to approximate univariate posteriors. In this paper we lay out the development foundation of the tools for also providing joint approximations for subsets of the latent field. These approximations inherit Gaussian copula structure and additionally provide corrections for skewness. The same idea is carried forward also to sampling from the mixture representation, which we now can adjust for skewness. △ Less

Submitted 6 December, 2021; originally announced December 2021.

Comments: 33 pages, 11 figures

arXiv:2111.12945 [pdf, other]

Low-rank variational Bayes correction to the Laplace method

Authors: Janet van Niekerk, Haavard Rue

Abstract: Approximate inference methods like the Laplace method, Laplace approximations and variational methods, amongst others, are popular methods when exact inference is not feasible due to the complexity of the model or the abundance of data. In this paper we propose a hybrid approximate method called Low-Rank Variational Bayes correction (VBC), that uses the Laplace method and subsequently a Variationa… ▽ More Approximate inference methods like the Laplace method, Laplace approximations and variational methods, amongst others, are popular methods when exact inference is not feasible due to the complexity of the model or the abundance of data. In this paper we propose a hybrid approximate method called Low-Rank Variational Bayes correction (VBC), that uses the Laplace method and subsequently a Variational Bayes correction in a lower dimension, to the joint posterior mean. The cost is essentially that of the Laplace method which ensures scalability of the method, in both model complexity and data size. Models with fixed and unknown hyperparameters are considered, for simulated and real examples, for small and large datasets. △ Less

Submitted 14 November, 2023; v1 submitted 25 November, 2021; originally announced November 2021.

arXiv:2111.12552 [pdf, ps, other]

Development of a low-background neutron detector array

Authors: Y. T. Li, W. P. Lin, B. Gao, H. Chen, H. Huang, Y. Huang, T. Y. Jiao, K. A. Li, X. D. Tang, X. Y. Wang, X. Fang, H. X. Huang, J. Ren, L. H. Ru, X. C. Ruan, N. T. Zhang, Z. C. Zhang

Abstract: A low-background neutron detector array was developed to measure the cross section of the $^{13}$C($α$,n)$^{16}$O reaction, which is the neutron source for the $s$-process in AGB stars, in the Gamow window ($E_{c.m.}$ = 190 $\pm$ 40 keV) at the China Jinping Underground Laboratory (CJPL). The detector array consists of 24 $^{3}$He proportional counters embedded in a polyethylene cube. Due to the d… ▽ More A low-background neutron detector array was developed to measure the cross section of the $^{13}$C($α$,n)$^{16}$O reaction, which is the neutron source for the $s$-process in AGB stars, in the Gamow window ($E_{c.m.}$ = 190 $\pm$ 40 keV) at the China Jinping Underground Laboratory (CJPL). The detector array consists of 24 $^{3}$He proportional counters embedded in a polyethylene cube. Due to the deep underground location and a borated polyethylene shield around the detector array, a low background of 4.5(2)/hour was achieved. The $^{51}$V(p, n)$^{51}$Cr reaction was used to determine the neutron detection efficiency of the array for neutrons with energy $E_n$ $<$ 1 MeV. Geant4 simulations, which were shown to well reproduce experimental results, were used to extrapolate the detection efficiency to higher energies for neutrons emitted in the $^{13}$C($α$,n) $^{16}$O reaction. The theoretical angular distributions of the $^{13}$C($α$,n)$^{16}$O reaction were shown to be important in estimating the uncertainties of the detection efficiency. △ Less

Submitted 16 March, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

arXiv:2111.01084 [pdf, other]

doi 10.1016/j.spasta.2022.100599

The SPDE approach for Gaussian and non-Gaussian fields: 10 years and still running

Authors: Finn Lindgren, David Bolin, Håvard Rue

Abstract: Gaussian processes and random fields have a long history, covering multiple approaches to representing spatial and spatio-temporal dependence structures, such as covariance functions, spectral representations, reproducing kernel Hilbert spaces, and graph based models. This article describes how the stochastic partial differential equation approach to generalising Matérn covariance models via Hilbe… ▽ More Gaussian processes and random fields have a long history, covering multiple approaches to representing spatial and spatio-temporal dependence structures, such as covariance functions, spectral representations, reproducing kernel Hilbert spaces, and graph based models. This article describes how the stochastic partial differential equation approach to generalising Matérn covariance models via Hilbert space projections connects with several of these approaches, with each connection being useful in different situations. In addition to an overview of the main ideas, some important extensions, theory, applications, and other recent developments are discussed. The methods include both Markovian and non-Markovian models, non-Gaussian random fields, non-stationary fields and space-time fields on arbitrary manifolds, and practical computational considerations. △ Less

Submitted 4 January, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

Comments: 33 pages, 1 figure

MSC Class: 60G60 (Primary); 60G60; 62M40; 62-08 (Secondary)

arXiv:2110.06417 [pdf]

doi 10.3390/ma14164584

Oxygen adsorption induced superconductivity in ultrathin FeTe film on SrTiO3(001)

Authors: Wei Ren, Hao Ru, Kun Peng, Huifang Li, Shuai Lu, Aixi Chen, Pengdong Wang, Xinwei Fang, Zhiyun Li, Rong Huang, Li Wang, Yihua Wang, Fangsen Li

Abstract: The phenomenon of oxygen incorporation induced superconductivity in iron telluride (Fe1+yTe, with antiferromagnetic (AFM) orders) is intriguing and quite different from the case of FeSe. Until now, the microscopic origin of the induced superconductivity and the role of oxygen are far from clear. Here, by combining in-situ scanning tunneling microscopy/spectroscopy (STM/STS) and x-ray photoemission… ▽ More The phenomenon of oxygen incorporation induced superconductivity in iron telluride (Fe1+yTe, with antiferromagnetic (AFM) orders) is intriguing and quite different from the case of FeSe. Until now, the microscopic origin of the induced superconductivity and the role of oxygen are far from clear. Here, by combining in-situ scanning tunneling microscopy/spectroscopy (STM/STS) and x-ray photoemission spectroscopy (XPS) on oxygenated FeTe, we found physically adsorbed O2 molecules crystallized into c(2/3x2) structure as an oxygen overlayer at low temperature, which was vital for superconductivity. The O2 overlayer were not epitaxial on the FeTe lattice, which implied weak O2-FeTe interaction but strong molecular interactions. Energy shift observed in the STS and XPS measurements indicated hole doping effect from the O2 overlayer to the FeTe layer, leading to a superconducting gap of 4.5 meV opened across the Fermi level. Our direct microscopic probe clarified the role of oxygen on FeTe and emphasized the importance of charge transfer effect to induce superconductivity in iron-chalcogenide thin films. △ Less

Submitted 12 October, 2021; originally announced October 2021.

Comments: 15 pages,4 figures

Journal ref: Materials 14, 4584 (2021)

arXiv:2109.13374 [pdf, other]

doi 10.1177/09622802221099642

Variance partitioning in spatio-temporal disease mapping models

Authors: Maria Franco-Villoria, Massimo Ventrucci, Håvard Rue

Abstract: Bayesian disease mapping, yet if undeniably useful to describe variation in risk over time and space, comes with the hurdle of prior elicitation on hard-to-interpret random effect precision parameters. We introduce a reparametrized version of the popular spatio-temporal interaction models, based on Kronecker product intrinsic Gaussian Markov Random Fields, that we name the variance partitioning (V… ▽ More Bayesian disease mapping, yet if undeniably useful to describe variation in risk over time and space, comes with the hurdle of prior elicitation on hard-to-interpret random effect precision parameters. We introduce a reparametrized version of the popular spatio-temporal interaction models, based on Kronecker product intrinsic Gaussian Markov Random Fields, that we name the variance partitioning (VP) model. The VP model includes a mixing parameter that balances the contribution of the main and interaction effects to the total (generalized) variance and enhances interpretability. The use of a penalized complexity prior on the mixing parameter aids in coding prior information in a intuitive way. We illustrate the advantages of the VP model using two case studies. △ Less

Submitted 17 May, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

arXiv:2109.11870 [pdf, other]

Quantification of empirical determinacy: the impact of likelihood weighting on posterior location and spread in Bayesian meta-analysis estimated with JAGS and INLA

Authors: Sona Hunanyan, Håvard Rue, Martyn Plummer, Małgorzata Roos

Abstract: The popular Bayesian meta-analysis expressed by Bayesian normal-normal hierarchical model (NNHM) synthesizes knowledge from several studies and is highly relevant in practice. Moreover, NNHM is the simplest Bayesian hierarchical model (BHM), which illustrates problems typical in more complex BHMs. Until now, it has been unclear to what extent the data determines the marginal posterior distribution… ▽ More The popular Bayesian meta-analysis expressed by Bayesian normal-normal hierarchical model (NNHM) synthesizes knowledge from several studies and is highly relevant in practice. Moreover, NNHM is the simplest Bayesian hierarchical model (BHM), which illustrates problems typical in more complex BHMs. Until now, it has been unclear to what extent the data determines the marginal posterior distributions of the parameters in NNHM. To address this issue we computed the second derivative of the Bhattacharyya coefficient with respect to the weighted likelihood, defined the total empirical determinacy (TED), the proportion of the empirical determinacy of location to TED (pEDL), and the proportion of the empirical determinacy of spread to TED (pEDS). We implemented this method in the R package \texttt{ed4bhm} and considered two case studies and one simulation study. We quantified TED, pEDL and pEDS under different modeling conditions such as model parametrization, the primary outcome, and the prior. This clarified to what extent the location and spread of the marginal posterior distributions of the parameters are determined by the data. Although these investigations focused on Bayesian NNHM, the method proposed is applicable more generally to complex BHMs. △ Less

Submitted 24 September, 2021; originally announced September 2021.

Comments: 22 pages, 1 figure

arXiv:2108.10737 [pdf]

doi 10.1364/OE.443942

Spatially homogeneous few-cycle compression of Yb lasers via all-solid-state free-space soliton management

Authors: Bingbing Zhu, Zongyuan Fu, Yudong Chen, Sainan Peng, Cheng Jin, Guangyu Fan, Sheng Zhang, Shunjia Wang, Hao Ru, Chuanshan Tian, Yihua Wang, Henry Kapteyn, Margaret Murnane, Zhensheng Tao

Abstract: The high power and variable repetition rate of Yb femtosecond lasers make them very attractive for ultrafast science. However, for capturing sub-200 fs dynamics, efficient, high-fidelity, and high-stability pulse compression techniques are essential. Spectral broadening using an all-solid-state free-space geometry is particularly attractive, as it is simple, robust, and low-cost. However, spatial… ▽ More The high power and variable repetition rate of Yb femtosecond lasers make them very attractive for ultrafast science. However, for capturing sub-200 fs dynamics, efficient, high-fidelity, and high-stability pulse compression techniques are essential. Spectral broadening using an all-solid-state free-space geometry is particularly attractive, as it is simple, robust, and low-cost. However, spatial and temporal losses caused by spatio-spectral inhomogeneities have been a major challenge to date, due to coupled space-time dynamics associated with unguided nonlinear propagation. In this work, we use all-solid-state free-space compressors to demonstrate compression of 170 fs pulses at a wavelength of 1030nm from a Yb:KGW laser to ~9.2 fs, with a highly spatially homogeneous mode. This is achieved by ensuring that the nonlinear beam propagation in periodic layered Kerr media occurs in soliton modes and confining the nonlinear phase through each material layer to less than 1.0 rad. A remarkable spatio-spectral homogeneity of ~0.87 can be realized, which yields a high efficiency of >50% for few-cycle compression. The universality of the method is demonstrated by implementing high-quality pulse compression under a wide range of laser conditions. The high spatiotemporal quality and the exceptional stability of the compressed pulses are further verified by high-harmonic generation. This work represents the highest efficiency and the best spatio-spectral quality ever achieved by an all-solid-state free-space pulse compressor for few-cycle-pulse generation. △ Less

Submitted 24 August, 2021; originally announced August 2021.

Journal ref: Optics Express (2022)

arXiv:2108.04553 [pdf, other]

doi 10.1103/PhysRevLett.127.172701

Advancement of Photospheric Radius Expansion and Clocked Type-I X-Ray Burst Models with the New $^{22}$Mg$(α,p)^{25}$Al Reaction Rate Determined at Gamow Energy

Authors: J. Hu, H. Yamaguchi, Y. H. Lam, A. Heger, D. Kahl, A. M. Jacobs, Z. Johnston, S. W. Xu, N. T. Zhang, S. B. Ma, L. H. Ru, E. Q. Liu, T. Liu, S. Hayakawa, L. Yang, H. Shimizu, C. B. Hamill, A. St J. Murphy, J. Su, X. Fang, K. Y. Chae, M. S. Kwag, S. M. Cha, N. N. Duy, N. K. Uyen , et al. (12 additional authors not shown)

Abstract: We report the first (in)elastic scattering measurement of $^{25}\mathrm{Al}+p$ with the capability to select and measure in a broad energy range the proton resonances in $^{26}$Si contributing to the $^{22}$Mg$(α,p)$ reaction at type I x-ray burst energies. We measured spin-parities of four resonances above the $α$ threshold of $^{26}$Si that are found to strongly impact the $^{22}$Mg$(α,p)$ rate.… ▽ More We report the first (in)elastic scattering measurement of $^{25}\mathrm{Al}+p$ with the capability to select and measure in a broad energy range the proton resonances in $^{26}$Si contributing to the $^{22}$Mg$(α,p)$ reaction at type I x-ray burst energies. We measured spin-parities of four resonances above the $α$ threshold of $^{26}$Si that are found to strongly impact the $^{22}$Mg$(α,p)$ rate. The new rate advances a state-of-the-art model to remarkably reproduce light curves of the GS 1826$-$24 clocked burster with mean deviation $<9$ % and permits us to discover a strong correlation between the He abundance in the accreting envelope of photospheric radius expansion burster and the dominance of $^{22}$Mg$(α,p)$ branch. △ Less

Submitted 20 October, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

Comments: accepted by Physical Review Letters on 5 August 2021, published 19 October 2021

Journal ref: Phys. Rev. Lett. 127 (2021) 172701

arXiv:2107.04562 [pdf, other]

The Bayesian Learning Rule

Authors: Mohammad Emtiyaz Khan, Håvard Rue

Abstract: We show that many machine-learning algorithms are specific instances of a single algorithm called the \emph{Bayesian learning rule}. The rule, derived from Bayesian principles, yields a wide-range of algorithms from fields such as optimization, deep learning, and graphical models. This includes classical algorithms such as ridge regression, Newton's method, and Kalman filter, as well as modern dee… ▽ More We show that many machine-learning algorithms are specific instances of a single algorithm called the \emph{Bayesian learning rule}. The rule, derived from Bayesian principles, yields a wide-range of algorithms from fields such as optimization, deep learning, and graphical models. This includes classical algorithms such as ridge regression, Newton's method, and Kalman filter, as well as modern deep-learning algorithms such as stochastic-gradient descent, RMSprop, and Dropout. The key idea in deriving such algorithms is to approximate the posterior using candidate distributions estimated by using natural gradients. Different candidate distributions result in different algorithms and further approximations to natural gradients give rise to variants of those algorithms. Our work not only unifies, generalizes, and improves existing algorithms, but also helps us design new ones. △ Less

Submitted 8 June, 2024; v1 submitted 9 July, 2021; originally announced July 2021.

Journal ref: Journal of Machine Learning Research 24, no. 281 (2023): 1-46

arXiv:2106.13110 [pdf, other]

Practical strategies for GEV-based regression models for extremes

Authors: Daniela Castro-Camilo, Raphaël Huser, Håvard Rue

Abstract: The generalised extreme value (GEV) distribution is a three parameter family that describes the asymptotic behaviour of properly renormalised maxima of a sequence of independent and identically distributed random variables. If the shape parameter $ξ$ is zero, the GEV distribution has unbounded support, whereas if $ξ$ is positive, the limiting distribution is heavy-tailed with infinite upper endpoi… ▽ More The generalised extreme value (GEV) distribution is a three parameter family that describes the asymptotic behaviour of properly renormalised maxima of a sequence of independent and identically distributed random variables. If the shape parameter $ξ$ is zero, the GEV distribution has unbounded support, whereas if $ξ$ is positive, the limiting distribution is heavy-tailed with infinite upper endpoint but finite lower endpoint. In practical applications, we assume that the GEV family is a reasonable approximation for the distribution of maxima over blocks, and we fit it accordingly. This implies that GEV properties, such as finite lower endpoint in the case $ξ>0$, are inherited by the finite-sample maxima, which might not have bounded support. This is particularly problematic when predicting extreme observations based on multiple and interacting covariates. To tackle this usually overlooked issue, we propose a blended GEV distribution, which smoothly combines the left tail of a Gumbel distribution (GEV with $ξ=0$) with the right tail of a Fréchet distribution (GEV with $ξ>0$) and, therefore, has unbounded support. Using a Bayesian framework, we reparametrise the GEV distribution to offer a more natural interpretation of the (possibly covariate-dependent) model parameters. Independent priors over the new location and spread parameters induce a joint prior distribution for the original location and scale parameters. We introduce the concept of property-preserving penalised complexity (P$^3$C) priors and apply it to the shape parameter to preserve first and second moments. We illustrate our methods with an application to NO$_2$ pollution levels in California, which reveals the robustness of the bGEV distribution, as well as the suitability of the new parametrisation and the P$^3$C prior framework. △ Less

Submitted 7 May, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

Comments: 19 pages, 3 figures

arXiv:2106.07313 [pdf, other]

doi 10.3934/fods.2021037

Smart Gradient -- An Adaptive Technique for Improving Gradient Estimation

Authors: Esmail Abdul Fattah, Janet Van Niekerk, Haavard Rue

Abstract: Computing the gradient of a function provides fundamental information about its behavior. This information is essential for several applications and algorithms across various fields. One common application that require gradients are optimization techniques such as stochastic gradient descent, Newton's method and trust region methods. However, these methods usually requires a numerical computation… ▽ More Computing the gradient of a function provides fundamental information about its behavior. This information is essential for several applications and algorithms across various fields. One common application that require gradients are optimization techniques such as stochastic gradient descent, Newton's method and trust region methods. However, these methods usually requires a numerical computation of the gradient at every iteration of the method which is prone to numerical errors. We propose a simple limited-memory technique for improving the accuracy of a numerically computed gradient in this gradient-based optimization framework by exploiting (1) a coordinate transformation of the gradient and (2) the history of previously taken descent directions. The method is verified empirically by extensive experimentation on both test functions and on real data applications. The proposed method is implemented in the R package smartGrad and in C++. △ Less

Submitted 14 June, 2021; originally announced June 2021.

arXiv:2105.09062 [pdf, other]

doi 10.1007/s13253-022-00500-7

Modelling sub-daily precipitation extremes with the blended generalised extreme value distribution

Authors: Silius M. Vandeskog, Sara Martino, Daniela Castro-Camilo, Håvard Rue

Abstract: A new method is proposed for modelling the yearly maxima of sub-daily precipitation, with the aim of producing spatial maps of return level estimates. Yearly precipitation maxima are modelled using a Bayesian hierarchical model with a latent Gaussian field, with the blended generalised extreme value (bGEV) distribution used as a substitute for the more standard generalised extreme value (GEV) dist… ▽ More A new method is proposed for modelling the yearly maxima of sub-daily precipitation, with the aim of producing spatial maps of return level estimates. Yearly precipitation maxima are modelled using a Bayesian hierarchical model with a latent Gaussian field, with the blended generalised extreme value (bGEV) distribution used as a substitute for the more standard generalised extreme value (GEV) distribution. Inference is made less wasteful with a novel two-step procedure that performs separate modelling of the scale parameter of the bGEV distribution using peaks over threshold data. Fast inference is performed using integrated nested Laplace approximations (INLA) together with the stochastic partial differential equation (SPDE) approach, both implemented in R-INLA. Heuristics for improving the numerical stability of R-INLA with the GEV and bGEV distributions are also presented. The model is fitted to yearly maxima of sub-daily precipitation from the south of Norway, and is able to quickly produce high-resolution return level maps with uncertainty. The proposed two-step procedure provides an improved model fit over standard inference techniques when modelling the yearly maxima of sub-daily precipitation with the bGEV distribution. △ Less

Submitted 21 May, 2022; v1 submitted 19 May, 2021; originally announced May 2021.

arXiv:2103.02721 [pdf, other]

Importance Sampling with the Integrated Nested Laplace Approximation

Authors: Martin Outzen Berild, Sara Martino, Virgilio Gómez-Rubio, Håvard Rue

Abstract: The Integrated Nested Laplace Approximation (INLA) is a deterministic approach to Bayesian inference on latent Gaussian models (LGMs) and focuses on fast and accurate approximation of posterior marginals for the parameters in the models. Recently, methods have been developed to extend this class of models to those that can be expressed as conditional LGMs by fixing some of the parameters in the mo… ▽ More The Integrated Nested Laplace Approximation (INLA) is a deterministic approach to Bayesian inference on latent Gaussian models (LGMs) and focuses on fast and accurate approximation of posterior marginals for the parameters in the models. Recently, methods have been developed to extend this class of models to those that can be expressed as conditional LGMs by fixing some of the parameters in the models to descriptive values. These methods differ in the manner descriptive values are chosen. This paper proposes to combine importance sampling with INLA (IS-INLA), and extends this approach with the more robust adaptive multiple importance sampling algorithm combined with INLA (AMIS-INLA). This paper gives a comparison between these approaches and existing methods on a series of applications with simulated and observed datasets and evaluates their performance based on accuracy, efficiency, and robustness. The approaches are validated by exact posteriors in a simple bivariate linear model; then, they are applied to a Bayesian lasso model, a Bayesian imputation of missing covariate values, and lastly, in parametric Bayesian quantile regression. The applications show that the AMIS-INLA approach, in general, outperforms the other methods, but the IS-INLA algorithm could be considered for faster inference when good proposals are available. △ Less

Submitted 3 March, 2021; originally announced March 2021.

arXiv:2010.13704 [pdf, other]

Bayesian Estimation of Two-Part Joint Models for a Longitudinal Semicontinuous Biomarker and a Terminal Event with R-INLA: Interests for Cancer Clinical Trial Evaluation

Authors: Denis Rustand, Janet van Niekerk, Håvard Rue, Christophe Tournigand, Virginie Rondeau, Laurent Briollais

Abstract: Two-part joint models for a longitudinal semicontinuous biomarker and a terminal event have been recently introduced based on frequentist estimation. The biomarker distribution is decomposed into a probability of positive value and the expected value among positive values. Shared random effects can represent the association structure between the biomarker and the terminal event. The computational… ▽ More Two-part joint models for a longitudinal semicontinuous biomarker and a terminal event have been recently introduced based on frequentist estimation. The biomarker distribution is decomposed into a probability of positive value and the expected value among positive values. Shared random effects can represent the association structure between the biomarker and the terminal event. The computational burden increases compared to standard joint models with a single regression model for the biomarker. In this context, the frequentist estimation implemented in the R package frailtypack can be challenging for complex models (i.e., large number of parameters and dimension of the random effects). As an alternative, we propose a Bayesian estimation of two-part joint models based on the Integrated Nested Laplace Approximation (INLA) algorithm to alleviate the computational burden and fit more complex models. Our simulation studies confirm that INLA provides accurate approximation of posterior estimates and to reduced computation time and variability of estimates compared to frailtypack in the situations considered. We contrast the Bayesian and frequentist approaches in the analysis of two randomized cancer clinical trials (GERCOR and PRIME studies), where INLA has a reduced variability for the association between the biomarker and the risk of event. Moreover, the Bayesian approach was able to characterize subgroups of patients associated with different responses to treatment in the PRIME study. Our study suggests that the Bayesian approach using INLA algorithm enables to fit complex joint models that might be of interest in a wide range of clinical applications. △ Less

Submitted 27 January, 2023; v1 submitted 26 October, 2020; originally announced October 2020.

arXiv:2010.05870 [pdf, ps, other]

Model-based bias correction for short AR(1) and AR(2) processes

Authors: Sigrunn H. Sørbye, Pedro G. Nicolau, Håvard Rue

Abstract: The class of autoregressive (AR) processes is extensively used to model temporal dependence in observed time series. Such models are easily available and routinely fitted using freely available statistical software like R. A potential caveat in analyzing short time series is that commonly applied estimators for the coefficients of AR processes are severely biased. This paper suggests a model-based… ▽ More The class of autoregressive (AR) processes is extensively used to model temporal dependence in observed time series. Such models are easily available and routinely fitted using freely available statistical software like R. A potential caveat in analyzing short time series is that commonly applied estimators for the coefficients of AR processes are severely biased. This paper suggests a model-based approach for bias correction of well-known estimators for the coefficients of first and second-order stationary AR processes, taking the sampling distribution of the original estimator into account. This is achieved by modeling the relationship between the true and estimated AR coefficients using weighted orthogonal polynomial regression, fitted to a huge number of simulations. The finite-sample distributions of the new estimators are approximated using transformations of skew-normal densities and their properties are demonstrated by simulations and in the analysis of a real ecological data set. The new estimators are easily available in our accompanying R-package ARbiascorrect for time series of length n = 10, 11, ... , 50, where original estimates are found using exact or conditional maximum likelihood, Burg's method or the Yule-Walker equations. △ Less

Submitted 12 October, 2020; originally announced October 2020.

Comments: 21 pages, 11 figures

MSC Class: 62M10; 62F10; 62E17

arXiv:2009.09414 [pdf, other]

Skewed probit regression -- Identifiability, contraction and reformulation

Authors: Janet van Niekerk, Haavard Rue

Abstract: Skewed probit regression is but one example of a statistical model that generalizes a simpler model, like probit regression. All skew-symmetric distributions and link functions arise from symmetric distributions by incorporating a skewness parameter through some skewing mechanism. In this work we address some fundamental issues in skewed probit regression, and more genreally skew-symmetric distrib… ▽ More Skewed probit regression is but one example of a statistical model that generalizes a simpler model, like probit regression. All skew-symmetric distributions and link functions arise from symmetric distributions by incorporating a skewness parameter through some skewing mechanism. In this work we address some fundamental issues in skewed probit regression, and more genreally skew-symmetric distributions or skew-symmetric link functions. We address the issue of identifiability of the skewed probit model parameters by reformulating the intercept from first principles. A new standardization of the skew link function is given to provide and anchored interpretation of the inference. Possible skewness parameters are investigated and the penalizing complexity priors of these are derived. This prior is invariant under reparameterization of the skewness parameter and quantifies the contraction of the skewed probit model to the probit model. The proposed results are available in the R-INLA package and we illustrate the use and effects of this work using simulated data, and well-known datasets using the link as well as the likelihood. △ Less

Submitted 20 September, 2020; originally announced September 2020.

arXiv:2006.06305 [pdf, other]

doi 10.1088/1674-1137/abae56

The modified astrophysical S-factor of the ${}^{12}$C+${}^{12}$C fusion reaction at sub-barrier energies

Authors: Y. J. Li, X. Fang, B. Bucher, K. A. Li, L. H. Ru, X. D. Tang

Abstract: The $^{12}$C+$^{12}$C fusion reaction plays a crucial role in stellar evolution and explosions. Its open reaction channels mainly include $α$, $p$, $n$, and ${}^{8}$Be. Despite more than a half century of efforts, large discrepancies remain among the experimental data measured using various techniques. In this work, we analyze the existing data using the statistical model. Our calculation shows: 1… ▽ More The $^{12}$C+$^{12}$C fusion reaction plays a crucial role in stellar evolution and explosions. Its open reaction channels mainly include $α$, $p$, $n$, and ${}^{8}$Be. Despite more than a half century of efforts, large discrepancies remain among the experimental data measured using various techniques. In this work, we analyze the existing data using the statistical model. Our calculation shows: 1) the relative systematic uncertainties of the predicted branching ratios get smaller as the predicted ratios increase; 2) the total modified astrophysical S-factors (S$^*$ factors) of the $p$ and $α$ channels can each be obtained by summing the S$^*$ factors of their corresponding ground-state transitions and the characteristic $γ$ rays while taking into account the contributions of the missing channels to the latter. After applying corrections based on branching ratios predicted by the statistical model, an agreement is achieved among the different data sets at ${E}_{cm}>$4 MeV, while some discrepancies remain at lower energies suggesting the need for better measurements in the near future. We find that the recent S$^*$ factor obtained from an indirect measurement is inconsistent with the direct measurement at energies below 2.6 MeV. We recommend upper and lower limits for the ${}^{12}$C+${}^{12}$C S$^*$ factor based on the existing models. A new $^{12}$C+$^{12}$C reaction rate is also recommended. △ Less

Submitted 15 July, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

Journal ref: Chinese Physics C, 44 No.11 (2020): 115001

arXiv:2006.04917 [pdf, other]

A diffusion-based spatio-temporal extension of Gaussian Matérn fields

Authors: Finn Lindgren, Haakon Bakka, David Bolin, Elias Krainski, Håvard Rue

Abstract: Gaussian random fields with Matérn covariance functions are popular models in spatial statistics and machine learning. In this work, we develop a spatio-temporal extension of the Gaussian Matérn fields formulated as solutions to a stochastic partial differential equation. The spatially stationary subset of the models have marginal spatial Matérn covariances, and the model also extends to Whittle-M… ▽ More Gaussian random fields with Matérn covariance functions are popular models in spatial statistics and machine learning. In this work, we develop a spatio-temporal extension of the Gaussian Matérn fields formulated as solutions to a stochastic partial differential equation. The spatially stationary subset of the models have marginal spatial Matérn covariances, and the model also extends to Whittle-Matérn fields on curved manifolds, and to more general non-stationary fields. In addition to the parameters of the spatial dependence (variance, smoothness, and practical correlation range) it additionally has parameters controlling the practical correlation range in time, the smoothness in time, and the type of non-separability of the spatio-temporal covariance. Through the separability parameter, the model also allows for separable covariance functions. We provide a sparse representation based on a finite element approximation, that is well suited for statistical inference and which is implemented in the R-INLA software. The flexibility of the model is illustrated in an application to spatio-temporal modeling of global temperature data. △ Less

Submitted 5 April, 2023; v1 submitted 8 June, 2020; originally announced June 2020.

Comments: 40 pages, 10 figures

MSC Class: 60G60 (Primary); 62M20; 62M30; 62M40; 62-08 (Secondary)

Showing 1–50 of 115 results for author: Rue, H