Search | arXiv e-print repository

Topological Analysis of Seizure-Induced Changes in Brain Hierarchy Through Effective Connectivity

Authors: Anass B. El-Yaagoubi, Moo K. Chung, Hernando Ombao

Abstract: Traditional Topological Data Analysis (TDA) methods, such as Persistent Homology (PH), rely on distance measures (e.g., cross-correlation, partial correlation, coherence, and partial coherence) that are symmetric by definition. While useful for studying topological patterns in functional brain connectivity, the main limitation of these methods is their inability to capture the directional dynamics… ▽ More Traditional Topological Data Analysis (TDA) methods, such as Persistent Homology (PH), rely on distance measures (e.g., cross-correlation, partial correlation, coherence, and partial coherence) that are symmetric by definition. While useful for studying topological patterns in functional brain connectivity, the main limitation of these methods is their inability to capture the directional dynamics - which is crucial for understanding effective brain connectivity. We propose the Causality-Based Topological Ranking (CBTR) method, which integrates Causal Inference (CI) to assess effective brain connectivity with Hodge Decomposition (HD) to rank brain regions based on their mutual influence. Our simulations confirm that the CBTR method accurately and consistently identifies hierarchical structures in multivariate time series data. Moreover, this method effectively identifies brain regions showing the most significant interaction changes with other regions during seizures using electroencephalogram (EEG) data. These results provide novel insights into the brain's hierarchical organization and illuminate the impact of seizures on its dynamics. △ Less

Submitted 18 July, 2024; originally announced July 2024.

arXiv:2406.02360 [pdf, other]

A Practical Approach for Exploring Granger Connectivity in High-Dimensional Networks of Time Series

Authors: Sipan Aslan, Hernando Ombao

Abstract: This manuscript presents a novel method for discovering effective connectivity between specified pairs of nodes in a high-dimensional network of time series. To accurately perform Granger causality analysis from the first node to the second node, it is essential to eliminate the influence of all other nodes within the network. The approach proposed is to create a low-dimensional representation of… ▽ More This manuscript presents a novel method for discovering effective connectivity between specified pairs of nodes in a high-dimensional network of time series. To accurately perform Granger causality analysis from the first node to the second node, it is essential to eliminate the influence of all other nodes within the network. The approach proposed is to create a low-dimensional representation of all other nodes in the network using frequency-domain-based dynamic principal component analysis (spectral DPCA). The resulting scores are subsequently removed from the first and second nodes of interest, thus eliminating the confounding effect of other nodes within the high-dimensional network. To conduct hypothesis testing on Granger causality, we propose a permutation-based causality test. This test enhances the accuracy of our findings when the error structures are non-Gaussian. The approach has been validated in extensive simulation studies, which demonstrate the efficacy of the methodology as a tool for causality analysis in complex time series networks. The proposed methodology has also been demonstrated to be both expedient and viable on real datasets, with particular success observed on multichannel EEG networks. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2404.09157 [pdf, other]

Statistics of Extremes for Neuroscience

Authors: Paolo V. Redondo, Matheus B. Guerrero, Raphaël Huser, Hernando Ombao

Abstract: This chapter illustrates how tools from univariate and multivariate statistics of extremes can complement classical methods used to study brain signals and enhance the understanding of brain activity and connectivity during specific cognitive tasks or abnormal episodes, such as an epileptic seizure. This chapter illustrates how tools from univariate and multivariate statistics of extremes can complement classical methods used to study brain signals and enhance the understanding of brain activity and connectivity during specific cognitive tasks or abnormal episodes, such as an epileptic seizure. △ Less

Submitted 14 April, 2024; originally announced April 2024.

arXiv:2401.16928 [pdf, other]

Dynamic MRI reconstruction using low-rank plus sparse decomposition with smoothness regularization

Authors: Chee-Ming Ting, Fuad Noman, Raphaël C. -W. Phan, Hernando Ombao

Abstract: The low-rank plus sparse (L+S) decomposition model has enabled better reconstruction of dynamic magnetic resonance imaging (dMRI) with separation into background (L) and dynamic (S) component. However, use of low-rank prior alone may not fully explain the slow variations or smoothness of the background part at the local scale. In this paper, we propose a smoothness-regularized L+S (SR-L+S) model f… ▽ More The low-rank plus sparse (L+S) decomposition model has enabled better reconstruction of dynamic magnetic resonance imaging (dMRI) with separation into background (L) and dynamic (S) component. However, use of low-rank prior alone may not fully explain the slow variations or smoothness of the background part at the local scale. In this paper, we propose a smoothness-regularized L+S (SR-L+S) model for dMRI reconstruction from highly undersampled k-t-space data. We exploit joint low-rank and smooth priors on the background component of dMRI to better capture both its global and local temporal correlated structures. Extending the L+S formulation, the low-rank property is encoded by the nuclear norm, while the smoothness by a general \ell_{p}-norm penalty on the local differences of the columns of L. The additional smoothness regularizer can promote piecewise local consistency between neighboring frames. By smoothing out the noise and dynamic activities, it allows accurate recovery of the background part, and subsequently more robust dMRI reconstruction. Extensive experiments on multi-coil cardiac and synthetic data shows that the SR-L+S model outp △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 9 pages

arXiv:2401.05343 [pdf, other]

Spectral Topological Data Analysis of Brain Signals

Authors: Anass B. El-Yaagoubi, Shuhao Jiao, Moo K. Chung, Hernando Ombao

Abstract: Topological data analysis (TDA) has become a powerful approach over the last twenty years, mainly due to its ability to capture the shape and the geometry inherent in the data. Persistence homology, which is a particular tool in TDA, has been demonstrated to be successful in analyzing functional brain connectivity. One limitation of standard approaches is that they use arbitrarily chosen threshold… ▽ More Topological data analysis (TDA) has become a powerful approach over the last twenty years, mainly due to its ability to capture the shape and the geometry inherent in the data. Persistence homology, which is a particular tool in TDA, has been demonstrated to be successful in analyzing functional brain connectivity. One limitation of standard approaches is that they use arbitrarily chosen threshold values for analyzing connectivity matrices. To overcome this weakness, TDA provides a filtration of the weighted brain network across a range of threshold values. However, current analyses of the topological structure of functional brain connectivity primarily rely on overly simplistic connectivity measures, such as the Pearson orrelation. These measures do not provide information about the specific oscillators that drive dependence within the brain network. Here, we develop a frequency-specific approach that utilizes coherence, a measure of dependence in the spectral domain, to evaluate the functional connectivity of the brain. Our approach, the spectral TDA (STDA), has the ability to capture more nuanced and detailed information about the underlying brain networks. The proposed STDA method leads to a novel topological summary, the spectral landscape, which is a 2D-generalization of the persistence landscape. Using the novel spectral landscape, we analyze the EEG brain connectivity of patients with attention deficit hyperactivity disorder (ADHD) and shed light on the frequency-specific differences in the topology of brain connectivity between the controls and ADHD patients. △ Less

Submitted 1 December, 2023; originally announced January 2024.

Comments: 28 pages, 23 figures

arXiv:2310.05398 [pdf, other]

Statistical Inference for Modulation Index in Phase-Amplitude Coupling

Authors: Marco Antonio Pinto-Orellana, Hernando Ombao, Beth Lopour

Abstract: Phase-amplitude coupling is a phenomenon observed in several neurological processes, where the phase of one signal modulates the amplitude of another signal with a distinct frequency. The modulation index (MI) is a common technique used to quantify this interaction by assessing the Kullback-Leibler divergence between a uniform distribution and the empirical conditional distribution of amplitudes w… ▽ More Phase-amplitude coupling is a phenomenon observed in several neurological processes, where the phase of one signal modulates the amplitude of another signal with a distinct frequency. The modulation index (MI) is a common technique used to quantify this interaction by assessing the Kullback-Leibler divergence between a uniform distribution and the empirical conditional distribution of amplitudes with respect to the phases of the observed signals. The uniform distribution is an ideal representation that is expected to appear under the absence of coupling. However, it does not reflect the statistical properties of coupling values caused by random chance. In this paper, we propose a statistical framework for evaluating the significance of an observed MI value based on a null hypothesis that a MI value can be entirely explained by chance. Significance is obtained by comparing the value with a reference distribution derived under the null hypothesis of independence (i.e., no coupling) between signals. We derived a closed-form distribution of this null model, resulting in a scaled beta distribution. To validate the efficacy of our proposed framework, we conducted comprehensive Monte Carlo simulations, assessing the significance of MI values under various experimental scenarios, including amplitude modulation, trains of spikes, and sequences of high-frequency oscillations. Furthermore, we corroborated the reliability of our model by comparing its statistical significance thresholds with reported values from other research studies conducted under different experimental settings. Our method offers several advantages such as meta-analysis reliability, simplicity and computational efficiency, as it provides p-values and significance levels without resorting to generating surrogate data through sampling procedures. △ Less

Submitted 9 October, 2023; originally announced October 2023.

arXiv:2307.16275 [pdf, other]

Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation

Authors: Md Nurul Muttakin, Malik Shahid Sultan, Robert Hoehndorf, Hernando Ombao

Abstract: Generative Adversarial Networks are used for generating the data using a generator and a discriminator, GANs usually produce high-quality images, but training GANs in an adversarial setting is a difficult task. GANs require high computation power and hyper-parameter regularization for converging. Projected GANs tackle the training difficulty of GANs by using transfer learning to project the genera… ▽ More Generative Adversarial Networks are used for generating the data using a generator and a discriminator, GANs usually produce high-quality images, but training GANs in an adversarial setting is a difficult task. GANs require high computation power and hyper-parameter regularization for converging. Projected GANs tackle the training difficulty of GANs by using transfer learning to project the generated and real samples into a pre-trained feature space. Projected GANs improve the training time and convergence but produce artifacts in the generated images which reduce the quality of the generated samples, we propose an optimized architecture called Stylized Projected GANs which integrates the mapping network of the Style GANs with Skip Layer Excitation of Fast GAN. The integrated modules are incorporated within the generator architecture of the Fast GAN to mitigate the problem of artifacts in the generated images. △ Less

Submitted 30 July, 2023; originally announced July 2023.

Comments: We present a new architecture for generating realistic images by combining mapping network of Style GANs and Projected GANs

arXiv:2306.07912 [pdf, other]

doi 10.1007/978-981-99-0803-5

Topological Data Analysis for Directed Dependence Networks of Multivariate Time Series Data

Authors: Anass B. El-Yaagoubi, Hernando Ombao

Abstract: Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological impairments such as epileptic seizures. Existing TDA approaches rely on the notion of dis… ▽ More Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological impairments such as epileptic seizures. Existing TDA approaches rely on the notion of distance between data points that is symmetric by definition for building graph filtrations. For brain dependence networks, this is a major limitation that constrains practitioners to using only symmetric dependence measures, such as correlations or coherence. However, it is known that the brain dependence network may be very complex and can contain a directed flow of information from one brain region to another. Such dependence networks are usually captured by more advanced measures of dependence such as partial directed coherence, which is a Granger causality based dependence measure. These dependence measures will result in a non-symmetric distance function, especially during epileptic seizures. In this paper we propose to solve this limitation by decomposing the weighted connectivity network into its symmetric and anti-symmetric components using matrix decomposition and comparing the anti-symmetric component prior to and post seizure. Our analysis of epileptic seizure EEG data shows promising results. △ Less

Submitted 13 June, 2023; originally announced June 2023.

arXiv:2305.19511 [pdf, other]

An MCMC Approach to Bayesian Image Analysis in Fourier Space

Authors: Konstantinos Bakas, John Kornak, Hernando Ombao

Abstract: Bayesian methods are commonly applied to solve image analysis problems such as noise-reduction, feature enhancement and object detection. A primary limitation of these approaches is the computational complexity due to the interdependence of neighboring pixels which limits the ability to perform full posterior sampling through Markov chain Monte Carlo (MCMC). To alleviate this problem, we develop a… ▽ More Bayesian methods are commonly applied to solve image analysis problems such as noise-reduction, feature enhancement and object detection. A primary limitation of these approaches is the computational complexity due to the interdependence of neighboring pixels which limits the ability to perform full posterior sampling through Markov chain Monte Carlo (MCMC). To alleviate this problem, we develop a new posterior sampling method that is based on modeling the prior and likelihood in the space of the Fourier transform of the image. One advantage of Fourier-based methods is that many spatially correlated processes in image space can be represented via independent processes over Fourier space. A recent approach known as Bayesian Image Analysis in Fourier Space (or BIFS), has introduced parameter functions to describe prior expectations about image properties in Fourier space. To date BIFS has relied on Maximum a Posteriori (MAP) estimation for generating posterior estimates; providing just a single point estimate. The work presented here develops a posterior sampling approach for BIFS that can explore the full posterior distribution while continuing to take advantage of the independence modeling over Fourier space. As a result computational efficiency is improved over that for conventional Bayesian image analysis and mixing concerns that commonly have to be dealt with in high dimensional Markov chain Monte Carlo sampling problems are avoided. Implementation results and details are provided using simulated data. △ Less

Submitted 30 May, 2023; originally announced May 2023.

Comments: 15 pages, 5 figures

arXiv:2305.10878 [pdf, other]

Multi-scale wavelet coherence with its applications

Authors: Haibo Wu, MI Knight, H Ombao

Abstract: The goal in this paper is to develop a novel statistical approach to characterize functional interactions between channels in a brain network. Wavelets are effective for capturing transient properties of non-stationary signals because they have compact support that can be compressed or stretched according to the dynamic properties of the signal. Wavelets give a multi-scale decomposition of signals… ▽ More The goal in this paper is to develop a novel statistical approach to characterize functional interactions between channels in a brain network. Wavelets are effective for capturing transient properties of non-stationary signals because they have compact support that can be compressed or stretched according to the dynamic properties of the signal. Wavelets give a multi-scale decomposition of signals and thus can be few for studying potential cross-scale interactions between signals. To achieve this, we develop the scale-specific sub-processes of a multivariate locally stationary wavelet stochastic process. Under this proposed framework, a novel cross-scale dependence measure is developed. This provides a measure for dependence structure of components at different scales of multivariate time series. Extensive simulation studies are conducted to demonstrate that the theoretical properties hold in practice. The proposed cross-scale analysis is applied to the electroencephalogram (EEG) data to study alterations in the functional connectivity structure in children diagnosed with attention deficit hyperactivity disorder (ADHD). Our approach identified novel interesting cross-scale interactions between channels in the brain network. The proposed framework can be applied to other signals, which can also capture the statistical association between the stocks at different time scales. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: 43 pages, 20 figures in the paper

arXiv:2305.08790 [pdf, other]

Bayesian Nonparametric Multivariate Mixture of Autoregressive Processes: With Application to Brain Signals

Authors: Guillermo Granados-Garcia, Raquel Prado, Hernando Ombao

Abstract: One of the goals of neuroscience is to study interactions between different brain regions during rest and while performing specific cognitive tasks. The Multivariate Bayesian Autoregressive Decomposition (MBMARD) is proposed as an intuitive and novel Bayesian non-parametric model to represent high-dimensional signals as a low-dimensional mixture of univariate uncorrelated latent oscillations. Each… ▽ More One of the goals of neuroscience is to study interactions between different brain regions during rest and while performing specific cognitive tasks. The Multivariate Bayesian Autoregressive Decomposition (MBMARD) is proposed as an intuitive and novel Bayesian non-parametric model to represent high-dimensional signals as a low-dimensional mixture of univariate uncorrelated latent oscillations. Each latent oscillation captures a specific underlying oscillatory activity and hence will be modeled as a unique second-order autoregressive process due to a compelling property that its spectral density has a shape characterized by a unique frequency peak and bandwidth, which are parameterized by a location and a scale parameter. The posterior distributions of the parameters of the latent oscillations are computed via a metropolis-within-Gibbs algorithm. One of the advantages of MBMARD is its robustness against misspecification of standard models which is demonstrated in simulation studies. The main scientific questions addressed by MBMARD are the effects of long-term abuse of alcohol consumption on memory by analyzing EEG records of alcoholic and non-alcoholic subjects performing a visual recognition experiment. The MBMARD model exhibited novel interesting findings including identifying subject-specific clusters of low and high-frequency oscillations among different brain regions. △ Less

Submitted 15 May, 2023; originally announced May 2023.

arXiv:2303.06384 [pdf, other]

Measuring Information Transfer Between Nodes in a Brain Network through Spectral Transfer Entropy

Authors: Paolo Victor Redondo, Raphael Huser, Hernando Ombao

Abstract: Brain connectivity reflects how different regions of the brain interact during performance of a cognitive task. In studying brain signals such as electroencephalograms (EEG), this may be explored via an information-theoretic causal measure, called transfer entropy (TE), which does not impose any distributional assumption on the variables and covers any form of relationship (beyond linear) between… ▽ More Brain connectivity reflects how different regions of the brain interact during performance of a cognitive task. In studying brain signals such as electroencephalograms (EEG), this may be explored via an information-theoretic causal measure, called transfer entropy (TE), which does not impose any distributional assumption on the variables and covers any form of relationship (beyond linear) between them. To improve utility of TE in brain signal analysis, we propose a novel methodology to capture cross-channel information transfer in the frequency domain. Specifically, we introduce a new causal measure, the spectral transfer entropy (STE), to quantify the magnitude and direction of information flow from a certain frequency-band oscillation of a channel to an oscillation of another channel. In contrast with previous works on TE in the frequency domain, we differentiate our work by considering an extreme value perspective that employs the maximum magnitude of filtered series within time blocks. The main advantages of our proposed approach is that it is robust to the inherent problems of linear filtering and allows adjustments for multiple comparisons to control family-wise error rate (FWER). Another novel contribution is a simple yet efficient estimation method based on the combination vine copulas and extreme value theory that enables estimates to capture zero (boundary point) without the need for bias adjustments. With the vine copula representation, a null copula model, which exhibits zero STE, is defined, making significance testing for STE straightforward through a standard resampling approach. Lastly, we illustrate the advantage of our proposed measure through some numerical experiments and provide interesting and novel findings on the analysis of EEG recordings linked to a visual task. △ Less

Submitted 25 May, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

arXiv:2302.09978 [pdf, other]

An Improved Unbiased Particle Filter

Authors: Ajay Jasra, Mohamed Maama, Hernando Ombao

Abstract: In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. We assume that, for numerical reasons, one has to time-discretize the diffusion process which typically leads to filtering that is subject to discretization bias. The approach in [16] establishes that when only having access to the time-discretized diff… ▽ More In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. We assume that, for numerical reasons, one has to time-discretize the diffusion process which typically leads to filtering that is subject to discretization bias. The approach in [16] establishes that when only having access to the time-discretized diffusion it is possible to remove the discretization bias with an estimator of finite variance. We improve on the method in [16] by introducing a modified estimator based on the recent work of [17]. We show that this new estimator is unbiased and has finite variance. Moreover, we conjecture and verify in numerical simulations that substantial gains are obtained. That is, for a given mean square error (MSE) and a particular class of multi-dimensional diffusion, the cost to achieve the said MSE falls. △ Less

Submitted 20 February, 2023; originally announced February 2023.

arXiv:2301.12371 [pdf, other]

doi 10.1017/apr.2024.12

Antithetic Multilevel Particle Filters

Authors: Ajay Jasra, Mohamed Maama, Hernando Ombao

Abstract: In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. This is a challenging problem which requires the use of advanced numerical schemes based upon time-discretization of the diffusion process and then the application of particle filters. Perhaps the state-of-the-art method for moderate dimensional problem… ▽ More In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. This is a challenging problem which requires the use of advanced numerical schemes based upon time-discretization of the diffusion process and then the application of particle filters. Perhaps the state-of-the-art method for moderate dimensional problems is the multilevel particle filter of \cite{mlpf}. This is a method that combines multilevel Monte Carlo and particle filters. The approach in that article is based intrinsically upon an Euler discretization method. We develop a new particle filter based upon the antithetic truncated Milstein scheme of \cite{ml_anti}. We show that for a class of diffusion problems, for $ε>0$ given, that the cost to produce a mean square error (MSE) in estimation of the filter, of $\mathcal{O}(ε^2)$ is $\mathcal{O}(ε^{-2}\log(ε)^2)$. In the case of multidimensional diffusions with non-constant diffusion coefficient, the method of \cite{mlpf} has a cost of $\mathcal{O}(ε^{-2.5})$ to achieve the same MSE. We support our theory with numerical results in several examples. △ Less

Submitted 29 January, 2023; originally announced January 2023.

arXiv:2212.05316 [pdf, other]

Graph-Regularized Manifold-Aware Conditional Wasserstein GAN for Brain Functional Connectivity Generation

Authors: Yee-Fan Tan, Chee-Ming Ting, Fuad Noman, Raphaël C. -W. Phan, Hernando Ombao

Abstract: Common measures of brain functional connectivity (FC) including covariance and correlation matrices are semi-positive definite (SPD) matrices residing on a cone-shape Riemannian manifold. Despite its remarkable success for Euclidean-valued data generation, use of standard generative adversarial networks (GANs) to generate manifold-valued FC data neglects its inherent SPD structure and hence the in… ▽ More Common measures of brain functional connectivity (FC) including covariance and correlation matrices are semi-positive definite (SPD) matrices residing on a cone-shape Riemannian manifold. Despite its remarkable success for Euclidean-valued data generation, use of standard generative adversarial networks (GANs) to generate manifold-valued FC data neglects its inherent SPD structure and hence the inter-relatedness of edges in real FC. We propose a novel graph-regularized manifold-aware conditional Wasserstein GAN (GR-SPD-GAN) for FC data generation on the SPD manifold that can preserve the global FC structure. Specifically, we optimize a generalized Wasserstein distance between the real and generated SPD data under an adversarial training, conditioned on the class labels. The resulting generator can synthesize new SPD-valued FC matrices associated with different classes of brain networks, e.g., brain disorder or healthy control. Furthermore, we introduce additional population graph-based regularization terms on both the SPD manifold and its tangent space to encourage the generator to respect the inter-subject similarity of FC patterns in the real data. This also helps in avoiding mode collapse and produces more stable GAN training. Evaluated on resting-state functional magnetic resonance imaging (fMRI) data of major depressive disorder (MDD), qualitative and quantitative results show that the proposed GR-SPD-GAN clearly outperforms several state-of-the-art GANs in generating more realistic fMRI-based FC samples. When applied to FC data augmentation for MDD identification, classification models trained on augmented data generated by our approach achieved the largest margin of improvement in classification accuracy among the competing GANs over baselines without data augmentation. △ Less

Submitted 10 December, 2022; originally announced December 2022.

Comments: 10 pages, 4 figures

arXiv:2212.04338 [pdf, other]

Club Exco: clustering brain extreme communities from multi-channel EEG data

Authors: Matheus B. Guerrero, Hernando Ombao, Raphaël Huser

Abstract: Current methods for clustering nodes over time in a brain network are determined by cross-dependence measures, which are computed from the entire range of values of the electroencephalogram (EEG) signals, from low to high amplitudes. We here developed the Club Exco method for clustering brain communities that exhibit synchronized extreme behaviors. To cluster multi-channel EEG data, Club-Exco uses… ▽ More Current methods for clustering nodes over time in a brain network are determined by cross-dependence measures, which are computed from the entire range of values of the electroencephalogram (EEG) signals, from low to high amplitudes. We here developed the Club Exco method for clustering brain communities that exhibit synchronized extreme behaviors. To cluster multi-channel EEG data, Club-Exco uses a spherical $k$-means procedure applied to the ``pseudo-angles,'' derived from extreme absolute amplitudes of EEG signals. With this approach, a cluster center is considered an ``extremal prototype,'' revealing a community of EEG nodes sharing the same extreme behavior, a feature that traditional methods fail to identify. Hence, Club Exco serves as an exploratory tool to classify EEG channels into mutually asymptotically dependent or asymptotically independent groups. It provides insights into how the brain network organizes itself during an extreme event (e.g., an epileptic seizure) in contrast to a baseline state. We apply the Club Exco method to investigate temporal differences in EEG brain connectivity networks of a patient diagnosed with epilepsy, a chronic neurological disorder affecting more than 50 million people globally. Our extreme-value method reveals substantial differences in alpha (8--12 Hertz) oscillations across the brain network compared to coherence-based methods. △ Less

Submitted 8 December, 2022; originally announced December 2022.

arXiv:2211.00296 [pdf, other]

Bayesian Parameter Inference for Partially Observed SDEs driven by Fractional Brownian Motion

Authors: Mohamed Maama, Ajay Jasra, Hernando Ombao

Abstract: In this paper we consider Bayesian parameter inference for partially observed fractional Brownian motion (fBM) models. The approach we follow is to time-discretize the hidden process and then to design Markov chain Monte Carlo (MCMC) algorithms to sample from the posterior density on the parameters given data. We rely on a novel representation of the time discretization, which seeks to sample from… ▽ More In this paper we consider Bayesian parameter inference for partially observed fractional Brownian motion (fBM) models. The approach we follow is to time-discretize the hidden process and then to design Markov chain Monte Carlo (MCMC) algorithms to sample from the posterior density on the parameters given data. We rely on a novel representation of the time discretization, which seeks to sample from an approximation of the posterior and then corrects via importance sampling; the approximation reduces the time (in terms of total observation time T) by O(T). This method is extended by using a multilevel MCMC method which can reduce the computational cost to achieve a given mean square error (MSE) versus using a single time discretization. Our methods are illustrated on simulated and real data. △ Less

Submitted 1 November, 2022; originally announced November 2022.

arXiv:2210.09092 [pdf, other]

Dynamic Topological Data Analysis of Functional Human Brain Networks

Authors: Moo K. Chung, Soumya Das, Hernando Ombao

Abstract: Developing reliable methods to discriminate different transient brain states that change over time is a key neuroscientific challenge in brain imaging studies. Topological data analysis (TDA), a novel framework based on algebraic topology, can handle such a challenge. However, existing TDA has been somewhat limited to capturing the static summary of dynamically changing brain networks. We propose… ▽ More Developing reliable methods to discriminate different transient brain states that change over time is a key neuroscientific challenge in brain imaging studies. Topological data analysis (TDA), a novel framework based on algebraic topology, can handle such a challenge. However, existing TDA has been somewhat limited to capturing the static summary of dynamically changing brain networks. We propose a novel dynamic-TDA framework that builds persistent homology over a time series of brain networks. We construct a Wasserstein distance based inference procedure to discriminate between time series of networks. The method is applied to the resting-state functional magnetic resonance images of human brain. We demonstrate that our proposed dynamic-TDA approach can distinctly discriminate between the topological patterns of male and female brain networks. MATLAB code for implementing this method is available at https://github.com/laplcebeltrami/PH-STAT. △ Less

Submitted 18 December, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: In press in journal Foundations of Data Science

arXiv:2209.10416 [pdf, other]

Modeling and Simulating Dependence in Networks Using Topological Data Analysis

Authors: Anass El Yaagoubi Bourakna, Moo K. Chung, Hernando Ombao

Abstract: Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological and cognitive impairments such as Alzheimer's and Parkinson's diseases, as well as attent… ▽ More Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological and cognitive impairments such as Alzheimer's and Parkinson's diseases, as well as attention deficit hyperactivity disorder (ADHD). Because there is no ground-truth with known dependence patterns in real brain signals, testing new TDA methods on multivariate time series is still a challenge. Simulations are crucial for evaluating the performance of proposed TDA methods and testing procedures as well as for creating computation-based confidence intervals. To our knowledge, there are no methods that simulate multivariate time series data with specific and manually imposed connectivity patterns. In this paper we present a novel approach to simulate multivariate time series with specific number of cycles/holes in its dependence network. Furthermore, we also provide a procedure for generating higher dimensional topological features. △ Less

Submitted 21 September, 2022; originally announced September 2022.

arXiv:2208.03703 [pdf, other]

Granger Causality using Neural Networks

Authors: Samuel Horvath, Malik Shahid Sultan, Hernando Ombao

Abstract: The Granger Causality (GC) test is a famous statistical hypothesis test for investigating if the past of one time series affects the future of the other. It helps in answering the question whether one time series is helpful in forecasting. Standard traditional approaches to Granger causality detection commonly assume linear dynamics, but such simplification does not hold in many real-world applica… ▽ More The Granger Causality (GC) test is a famous statistical hypothesis test for investigating if the past of one time series affects the future of the other. It helps in answering the question whether one time series is helpful in forecasting. Standard traditional approaches to Granger causality detection commonly assume linear dynamics, but such simplification does not hold in many real-world applications, e.g., neuroscience or genomics that are inherently non-linear. In such cases, imposing linear models such as Vector Autoregressive (VAR) models can lead to inconsistent estimation of true Granger Causal interactions. Machine Learning (ML) can learn the hidden patterns in the datasets specifically Deep Learning (DL) has shown tremendous promise in learning the non-linear dynamics of complex systems. Recent work of Tank et al propose to overcome the issue of linear simplification in VAR models by using neural networks combined with sparsity-inducing penalties on the learn-able weights. In this work, we build upon ideas introduced by Tank et al. We propose several new classes of models that can handle underlying non-linearity. Firstly, we present the Learned Kernal VAR(LeKVAR) model-an extension of VAR models that also learns kernel parametrized by a neural net. Secondly, we show one can directly decouple lags and individual time series importance via decoupled penalties. This decoupling provides better scaling and allows us to embed lag selection into RNNs. Lastly, we propose a new training algorithm that supports mini-batching, and it is compatible with commonly used adaptive optimizers such as Adam.he proposed techniques are evaluated on several simulated datasets inspired by real-world applications.We also apply these methods to the Electro-Encephalogram (EEG) data for an epilepsy patient to study the evolution of GC before , during and after seizure across the 19 EEG channels. △ Less

Submitted 7 August, 2022; originally announced August 2022.

Comments: To be Submitted to a Journal work Presented at JSM. arXiv admin note: text overlap with arXiv:1802.05842 by other authors

arXiv:2208.02024 [pdf, other]

Time-Varying Dispersion Integer-Valued GARCH Models

Authors: Wagner Barreto-Souza, Luiza S. C. Piancastelli, Konstantinos Fokianos, Hernando Ombao

Abstract: We propose a general class of INteger-valued Generalized AutoRegressive Conditionally Heteroscedastic (INGARCH) processes by allowing time-varying mean and dispersion parameters, which we call time-varying dispersion INGARCH (tv-DINGARCH) models. More specifically, we consider mixed Poisson INGARCH models and allow for dynamic modeling of the dispersion parameter (as well as the mean), similar to… ▽ More We propose a general class of INteger-valued Generalized AutoRegressive Conditionally Heteroscedastic (INGARCH) processes by allowing time-varying mean and dispersion parameters, which we call time-varying dispersion INGARCH (tv-DINGARCH) models. More specifically, we consider mixed Poisson INGARCH models and allow for dynamic modeling of the dispersion parameter (as well as the mean), similar to the spirit of the ordinary GARCH models. We derive conditions to obtain first and second-order stationarity, and ergodicity as well. Estimation of the parameters is addressed and their associated asymptotic properties are established as well. A restricted bootstrap procedure is proposed for testing constant dispersion against time-varying dispersion. Monte Carlo simulation studies are presented for checking point estimation, standard errors, and the performance of the restricted bootstrap approach. We apply the tv-DINGARCH process to model the weekly number of reported measles infections in North Rhine-Westphalia, Germany, from January 2001 to May 2013, and compare its performance to the ordinary INGARCH approach. △ Less

Submitted 30 May, 2024; v1 submitted 3 August, 2022; originally announced August 2022.

Comments: Paper submitted for publication

arXiv:2208.00292 [pdf, other]

Functional-Coefficient Models for Multivariate Time Series in Designed Experiments: with Applications to Brain Signals

Authors: Paolo Victor Redondo, Raphaël Huser, Hernando Ombao

Abstract: To study the neurophysiological basis of attention deficit hyperactivity disorder (ADHD), clinicians use electroencephalography (EEG) which record neuronal electrical activity on the cortex. The most commonly-used metric in ADHD is the theta-to-beta spectral power ratio (TBR) that is based on a single-channel analysis. However, initial findings for this measure have not been replicated in other st… ▽ More To study the neurophysiological basis of attention deficit hyperactivity disorder (ADHD), clinicians use electroencephalography (EEG) which record neuronal electrical activity on the cortex. The most commonly-used metric in ADHD is the theta-to-beta spectral power ratio (TBR) that is based on a single-channel analysis. However, initial findings for this measure have not been replicated in other studies. Thus, instead of focusing on single-channel spectral power, a novel model for investigating interactions (dependence) between channels in the entire network is proposed. Although dependence measures such as coherence and partial directed coherence (PDC) are well explored in studying brain connectivity, these measures only capture linear dependence. Moreover, in designed clinical experiments, these dependence measures are observed to vary across subjects even within a homogeneous group. To address these limitations, we propose the mixed-effects functional-coefficient autoregressive (MX-FAR) model which captures between-subject variation by incorporating subject-specific random effects. The advantages of the MX-FAR model are the following: (1.) it captures potential non-linear dependence between channels; (2.) it is nonparametric and hence flexible and robust to model mis-specification; (3.) it can capture differences between groups when they exist; (4.) it accounts for variation across subjects; (5.) the framework easily incorporates well-known inference methods from mixed-effects models; (6.) it can be generalized to accommodate various covariates and factors. Finally, we apply the proposed MX-FAR model to analyze multichannel EEG signals and report novel findings on altered brain functional networks in ADHD. △ Less

Submitted 8 August, 2022; v1 submitted 30 July, 2022; originally announced August 2022.

arXiv:2204.13799 [pdf, other]

doi 10.3390/e25111509

Topological Data Analysis for Multivariate Time Series Data

Authors: Anass El Yaagoubi Bourakna, Moo K. Chung, Hernando Ombao

Abstract: Over the last two decades, topological data analysis (TDA) has emerged as a very powerful data analytic approach which can deal with various data modalities of varying complexities. One of the most commonly used tools in TDA is persistent homology (PH) which can extract topological properties from data at various scales. Our aim in this article is to introduce TDA concepts to a statistical audienc… ▽ More Over the last two decades, topological data analysis (TDA) has emerged as a very powerful data analytic approach which can deal with various data modalities of varying complexities. One of the most commonly used tools in TDA is persistent homology (PH) which can extract topological properties from data at various scales. Our aim in this article is to introduce TDA concepts to a statistical audience and provide an approach to analyze multivariate time series data. The application focus will be on multivariate brain signals and brain connectivity networks. Finally, the paper concludes with an overview of some open problems and potential application of TDA to modeling directionality in a brain network as well as the casting of TDA in the context of mixed effects models to capture variations in the topological properties of data collected from multiple subjects △ Less

Submitted 28 April, 2022; originally announced April 2022.

arXiv:2202.10162 [pdf, other]

Poisson-Birnbaum-Saunders Regression Model for Clustered Count Data

Authors: Jussiane Nader Gonçalves, Wagner Barreto-Souza, Hernando Ombao

Abstract: The premise of independence among subjects in the same cluster/group often fails in practice, and models that rely on such untenable assumption can produce misleading results. To overcome this severe deficiency, we introduce a new regression model to handle overdispersed and correlated clustered counts. To account for correlation within clusters, we propose a Poisson regression model where the obs… ▽ More The premise of independence among subjects in the same cluster/group often fails in practice, and models that rely on such untenable assumption can produce misleading results. To overcome this severe deficiency, we introduce a new regression model to handle overdispersed and correlated clustered counts. To account for correlation within clusters, we propose a Poisson regression model where the observations within the same cluster are driven by the same latent random effect that follows the Birnbaum-Saunders distribution with a parameter that controls the strength of dependence among the individuals. This novel multivariate count model is called Clustered Poisson Birnbaum-Saunders (CPBS) regression. As illustrated in this paper, the CPBS model is analytically tractable, and its moment structure can be explicitly obtained. Estimation of parameters is performed through the maximum likelihood method, and an Expectation-Maximization (EM) algorithm is also developed. Simulation results to evaluate the finite-sample performance of our proposed estimators are presented. We also discuss diagnostic tools for checking model adequacy. An empirical application concerning the number of inpatient admissions by individuals to hospital emergency rooms, from the Medical Expenditure Panel Survey (MEPS) conducted by the United States Agency for Health Research and Quality, illustrates the usefulness of our proposed methodology. △ Less

Submitted 21 February, 2022; originally announced February 2022.

Comments: Paper submitted for publication

arXiv:2107.12838 [pdf, other]

doi 10.1109/JBHI.2024.3351177

Graph Autoencoders for Embedding Learning in Brain Networks and Major Depressive Disorder Identification

Authors: Fuad Noman, Chee-Ming Ting, Hakmook Kang, Raphael C. -W. Phan, Brian D. Boyd, Warren D. Taylor, Hernando Ombao

Abstract: Brain functional connectivity (FC) reveals biomarkers for identification of various neuropsychiatric disorders. Recent application of deep neural networks (DNNs) to connectome-based classification mostly relies on traditional convolutional neural networks using input connectivity matrices on a regular Euclidean grid. We propose a graph deep learning framework to incorporate the non-Euclidean infor… ▽ More Brain functional connectivity (FC) reveals biomarkers for identification of various neuropsychiatric disorders. Recent application of deep neural networks (DNNs) to connectome-based classification mostly relies on traditional convolutional neural networks using input connectivity matrices on a regular Euclidean grid. We propose a graph deep learning framework to incorporate the non-Euclidean information about graph structure for classifying functional magnetic resonance imaging (fMRI)-derived brain networks in major depressive disorder (MDD). We design a novel graph autoencoder (GAE) architecture based on the graph convolutional networks (GCNs) to embed the topological structure and node content of large-sized fMRI networks into low-dimensional latent representations. In network construction, we employ the Ledoit-Wolf (LDW) shrinkage method to estimate the high-dimensional FC metrics efficiently from fMRI data. We consider both supervised and unsupervised approaches for the graph embedding learning. The learned embeddings are then used as feature inputs for a deep fully-connected neural network (FCNN) to discriminate MDD from healthy controls. Evaluated on two resting-state fMRI (rs-fMRI) MDD datasets, results show that the proposed GAE-FCNN model significantly outperforms several state-of-the-art methods for brain connectome classification, achieving the best accuracy using the LDW-FC edges as node features. The graph embeddings of fMRI FC networks learned by the GAE also reveal apparent group differences between MDD and HC. Our new framework demonstrates feasibility of learning graph embeddings on brain networks to provide discriminative information for diagnosis of brain disorders. △ Less

Submitted 2 June, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

arXiv:2107.09160 [pdf, other]

BICNet: A Bayesian Approach for Estimating Task Effects on Intrinsic Connectivity Networks in fMRI Data

Authors: Meini Tang, Chee-Ming Ting, Hernando Ombao

Abstract: Intrinsic connectivity networks (ICNs) are specific dynamic functional brain networks that are consistently found under various conditions including rest and task. Studies have shown that some stimuli actually activate intrinsic connectivity through either suppression, excitation, moderation or modification. Nevertheless, the structure of ICNs and task-related effects on ICNs are not yet fully und… ▽ More Intrinsic connectivity networks (ICNs) are specific dynamic functional brain networks that are consistently found under various conditions including rest and task. Studies have shown that some stimuli actually activate intrinsic connectivity through either suppression, excitation, moderation or modification. Nevertheless, the structure of ICNs and task-related effects on ICNs are not yet fully understood. In this paper, we propose a Bayesian Intrinsic Connectivity Network (BICNet) model to identify the ICNs and quantify the task-related effects on the ICN dynamics. Using an extended Bayesian dynamic sparse latent factor model, the proposed BICNet has the following advantages: (1) it simultaneously identifies the individual ICNs and group-level ICN spatial maps; (2) it robustly identifies ICNs by jointly modeling resting-state functional magnetic resonance imaging (rfMRI) and task-related functional magnetic resonance imaging (tfMRI); (3) compared to independent component analysis (ICA)-based methods, it can quantify the difference of ICNs amplitudes across different states; (4) it automatically performs feature selection through the sparsity of the ICNs rather than ad-hoc thresholding. The proposed BICNet was applied to the rfMRI and language tfMRI data from the Human Connectome Project (HCP) and the analysis identified several ICNs related to distinct language processing functions. △ Less

Submitted 19 July, 2021; originally announced July 2021.

arXiv:2107.07561 [pdf, other]

Multivariate Conway-Maxwell-Poisson Distribution: Sarmanov Method and Doubly-Intractable Bayesian Inference

Authors: Luiza S. C. Piancastelli, Nial Friel, Wagner Barreto-Souza, Hernando Ombao

Abstract: In this paper, a multivariate count distribution with Conway-Maxwell (COM)-Poisson marginals is proposed. To do this, we develop a modification of the Sarmanov method for constructing multivariate distributions. Our multivariate COM-Poisson (MultCOMP) model has desirable features such as (i) it admits a flexible covariance matrix allowing for both negative and positive non-diagonal entries; (ii) i… ▽ More In this paper, a multivariate count distribution with Conway-Maxwell (COM)-Poisson marginals is proposed. To do this, we develop a modification of the Sarmanov method for constructing multivariate distributions. Our multivariate COM-Poisson (MultCOMP) model has desirable features such as (i) it admits a flexible covariance matrix allowing for both negative and positive non-diagonal entries; (ii) it overcomes the limitation of the existing bivariate COM-Poisson distributions in the literature that do not have COM-Poisson marginals; (iii) it allows for the analysis of multivariate counts and is not just limited to bivariate counts. Inferential challenges are presented by the likelihood specification as it depends on a number of intractable normalizing constants involving the model parameters. These obstacles motivate us to propose a Bayesian inferential approach where the resulting doubly-intractable posterior is dealt with via the exchange algorithm and the Grouped Independence Metropolis-Hastings algorithm. Numerical experiments based on simulations are presented to illustrate the proposed Bayesian approach. We analyze the potential of the MultCOMP model through a real data application on the numbers of goals scored by the home and away teams in the Premier League from 2018 to 2021. Here, our interest is to assess the effect of a lack of crowds during the COVID-19 pandemic on the well-known home team advantage. A MultCOMP model fit shows that there is evidence of a decreased number of goals scored by the home team, not accompanied by a reduced score from the opponent. Hence, our analysis suggests a smaller home team advantage in the absence of crowds, which agrees with the opinion of several football experts. △ Less

Submitted 15 July, 2021; originally announced July 2021.

Comments: Paper submitted for publication

arXiv:2106.05092 [pdf, other]

Markov-Switching State-Space Models with Applications to Neuroimaging

Authors: David Degras, Chee-Ming Ting, Hernando Ombao

Abstract: State-space models (SSM) with Markov switching offer a powerful framework for detecting multiple regimes in time series, analyzing mutual dependence and dynamics within regimes, and asserting transitions between regimes. These models however present considerable computational challenges due to the exponential number of possible regime sequences to account for. In addition, high dimensionality of t… ▽ More State-space models (SSM) with Markov switching offer a powerful framework for detecting multiple regimes in time series, analyzing mutual dependence and dynamics within regimes, and asserting transitions between regimes. These models however present considerable computational challenges due to the exponential number of possible regime sequences to account for. In addition, high dimensionality of time series can hinder likelihood-based inference. This paper proposes novel statistical methods for Markov-switching SSMs using maximum likelihood estimation, Expectation-Maximization (EM), and parametric bootstrap. We develop solutions for initializing the EM algorithm, accelerating convergence, and conducting inference that are ideally suited to massive spatio-temporal data such as brain signals. We evaluate these methods in simulations and present applications to EEG studies of epilepsy and of motor imagery. All proposed methods are implemented in a MATLAB toolbox available at https://github.com/ddegras/switch-ssm. △ Less

Submitted 9 June, 2021; originally announced June 2021.

arXiv:2106.01104 [pdf, other]

Filtrated Common Functional Principal Components for Multivariate Functional data

Authors: Shuhao Jiao, Ron D. Frostig, Hernando Ombao

Abstract: Local field potentials (LFPs) are signals that measure electrical activity in localized cortical regions from implanted tetrodes in the human or animal brain. The LFP signals are curves observed at multiple tetrodes which are implanted across a patch on the surface of the cortex. Hence, they can be treated as multi-group functional data, where the trajectories collected across temporal epochs from… ▽ More Local field potentials (LFPs) are signals that measure electrical activity in localized cortical regions from implanted tetrodes in the human or animal brain. The LFP signals are curves observed at multiple tetrodes which are implanted across a patch on the surface of the cortex. Hence, they can be treated as multi-group functional data, where the trajectories collected across temporal epochs from one tetrode are viewed as a group of functions. In many cases, multi-tetrode LFP trajectories contain both global variation patterns (which are shared in common to all groups, due to signal synchrony) and isolated variation patterns (common only to a small subset of groups), and such structure is very informative to the analysis of such data. Therefore, one goal in this paper is to develop an efficient procedure that is able to capture and quantify both global and isolated features. We propose a novel tree-structured functional principal components (filt-fPC) analysis through finite-dimensional functional representation - specifically via filtration. A major advantage of the proposed filt-fPC method is the ability to extract the components that are common to multiple groups (or tetrodes) in a flexible "multi-resolution" manner and simultaneously preserve the idiosyncratic individual components of different tetrodes. The proposed filt-fPC approach is highly data-driven and no "ground-truth" model pre-specification is needed, making it a suitable approach for analyzing multi-group functional data that is complex. In addition, the filt-fPC method is able to produce a parsimonious, interpretable, and efficient low dimensional representation of multi-group functional data with orthonormal basis functions. Here, the proposed filt-fPCA method is employed to study the impact of a shock (induced stroke) on the synchrony structure of the rat brain. △ Less

Submitted 26 November, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

arXiv:2105.06418 [pdf, other]

SCAU: Modeling spectral causality for multivariate time series with applications to electroencephalograms

Authors: Marco Antonio Pinto-Orellana, Peyman Mirtaheri, Hugo L. Hammer, Hernando Ombao

Abstract: Electroencephalograms (EEG) are noninvasive measurement signals of electrical neuronal activity in the brain. One of the current major statistical challenges is formally measuring functional dependency between those complex signals. This paper, proposes the spectral causality model (SCAU), a robust linear model, under a causality paradigm, to reflect inter- and intra-frequency modulation effects t… ▽ More Electroencephalograms (EEG) are noninvasive measurement signals of electrical neuronal activity in the brain. One of the current major statistical challenges is formally measuring functional dependency between those complex signals. This paper, proposes the spectral causality model (SCAU), a robust linear model, under a causality paradigm, to reflect inter- and intra-frequency modulation effects that cannot be identifiable using other methods. SCAU inference is conducted with three main steps: (a) signal decomposition into frequency bins, (b) intermediate spectral band mapping, and (c) dependency modeling through frequency-specific autoregressive models (VAR). We apply SCAU to study complex dependencies during visual and lexical fluency tasks (word generation and visual fixation) in 26 participants' EEGs. We compared the connectivity networks estimated using SCAU with respect to a VAR model. SCAU networks show a clear contrast for both stimuli while the magnitude links also denoted a low variance in comparison with the VAR networks. Furthermore, SCAU dependency connections not only were consistent with findings in the neuroscience literature, but it also provided further evidence on the directionality of the spatio-spectral dependencies such as the delta-originated and theta-induced links in the fronto-temporal brain network. △ Less

Submitted 13 May, 2021; originally announced May 2021.

arXiv:2105.00351 [pdf, other]

doi 10.1007/978-3-030-87444-5_8

Lattice Paths for Persistent Diagrams

Authors: Moo K. Chung, Hernando Ombao

Abstract: Persistent homology has undergone significant development in recent years. However, one outstanding challenge is to build a coherent statistical inference procedure on persistent diagrams. In this paper, we first present a new lattice path representation for persistent diagrams. We then develop a new exact statistical inference procedure for lattice paths via combinatorial enumerations. The lattic… ▽ More Persistent homology has undergone significant development in recent years. However, one outstanding challenge is to build a coherent statistical inference procedure on persistent diagrams. In this paper, we first present a new lattice path representation for persistent diagrams. We then develop a new exact statistical inference procedure for lattice paths via combinatorial enumerations. The lattice path method is applied to the topological characterization of the protein structures of the COVID-19 virus. We demonstrate that there are topological changes during the conformational change of spike proteins. △ Less

Submitted 30 July, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

arXiv:2103.17240 [pdf, other]

Spectral Dependence

Authors: Hernando Ombao, Marco Pinto

Abstract: This paper presents a general framework for modeling dependence in multivariate time series. Its fundamental approach relies on decomposing each signal in a system into various frequency components and then studying the dependence properties through these oscillatory activities.The unifying theme across the paper is to explore the strength of dependence and possible lead-lag dynamics through filte… ▽ More This paper presents a general framework for modeling dependence in multivariate time series. Its fundamental approach relies on decomposing each signal in a system into various frequency components and then studying the dependence properties through these oscillatory activities.The unifying theme across the paper is to explore the strength of dependence and possible lead-lag dynamics through filtering. The proposed framework is capable of representing both linear and non-linear dependencies that could occur instantaneously or after some delay(lagged dependence). Examples for studying dependence between oscillations are illustrated through multichannel electroencephalograms. These examples emphasized that some of the most prominent frequency domain measures such as coherence, partial coherence,and dual-frequency coherence can be derived as special cases under this general framework.This paper also introduces related approaches for modeling dependence through phase-amplitude coupling and causality of (one-sided) filtered signals. △ Less

Submitted 31 March, 2021; originally announced March 2021.

arXiv:2103.03818 [pdf, other]

Time-varying $\ell_0$ optimization for Spike Inference from Multi-Trial Calcium Recordings

Authors: Tong Shen, Kevin Johnston, Gyorgy Lur, Michele Guindani, Hernando Ombao, Zhaoxia Yu

Abstract: Optical imaging of genetically encoded calcium indicators is a powerful tool to record the activity of a large number of neurons simultaneously over a long period of time from freely behaving animals. However, determining the exact time at which a neuron spikes and estimating the underlying firing rate from calcium fluorescence data remains challenging, especially for calcium imaging data obtained… ▽ More Optical imaging of genetically encoded calcium indicators is a powerful tool to record the activity of a large number of neurons simultaneously over a long period of time from freely behaving animals. However, determining the exact time at which a neuron spikes and estimating the underlying firing rate from calcium fluorescence data remains challenging, especially for calcium imaging data obtained from a longitudinal study. We propose a multi-trial time-varying $\ell_0$ penalized method to jointly detect spikes and estimate firing rates by robustly integrating evolving neural dynamics across trials. Our simulation study shows that the proposed method performs well in both spike detection and firing rate estimation. We demonstrate the usefulness of our method on calcium fluorescence trace data from two studies, with the first study showing differential firing rate functions between two behaviors and the second study showing evolving firing rate function across trials due to learning. △ Less

Submitted 1 March, 2021; originally announced March 2021.

arXiv:2103.02156 [pdf, other]

Ridge-penalized adaptive Mantel test and its application in imaging genetics

Authors: Dustin Pluta, Tong Shen, Gui Xue, Chuansheng Chen, Hernando Ombao, Zhaoxia Yu

Abstract: We propose a ridge-penalized adaptive Mantel test (AdaMant) for evaluating the association of two high-dimensional sets of features. By introducing a ridge penalty, AdaMant tests the association across many metrics simultaneously. We demonstrate how ridge penalization bridges Euclidean and Mahalanobis distances and their corresponding linear models from the perspective of association measurement a… ▽ More We propose a ridge-penalized adaptive Mantel test (AdaMant) for evaluating the association of two high-dimensional sets of features. By introducing a ridge penalty, AdaMant tests the association across many metrics simultaneously. We demonstrate how ridge penalization bridges Euclidean and Mahalanobis distances and their corresponding linear models from the perspective of association measurement and testing. This result is not only theoretically interesting but also has important implications in penalized hypothesis testing, especially in high dimensional settings such as imaging genetics. Applying the proposed method to an imaging genetic study of visual working memory in health adults, we identified interesting associations of brain connectivity (measured by EEG coherence) with selected genetic features. △ Less

Submitted 20 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

arXiv:2103.00209 [pdf, ps, other]

Statistical Inference for Local Granger Causality

Authors: Yan Liu, Masanobu Taniguchi, Hernando Ombao

Abstract: Granger causality has been employed to investigate causality relations between components of stationary multiple time series. We generalize this concept by developing statistical inference for local Granger causality for multivariate locally stationary processes. Our proposed local Granger causality approach captures time-evolving causality relationships in nonstationary processes. The proposed lo… ▽ More Granger causality has been employed to investigate causality relations between components of stationary multiple time series. We generalize this concept by developing statistical inference for local Granger causality for multivariate locally stationary processes. Our proposed local Granger causality approach captures time-evolving causality relationships in nonstationary processes. The proposed local Granger causality is well represented in the frequency domain and estimated based on the parametric time-varying spectral density matrix using the local Whittle likelihood. Under regularity conditions, we demonstrate that the estimators converge to multivariate normal in distribution. Additionally, the test statistic for the local Granger causality is shown to be asymptotically distributed as a quadratic form of a multivariate normal distribution. The finite sample performance is confirmed with several simulation studies for multivariate time-varying autoregressive models. For practical demonstration, the proposed local Granger causality method uncovered new functional connectivity relationships between channels in brain signals. Moreover, the method was able to identify structural changes in financial data. △ Less

Submitted 4 August, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

Comments: 64 pages, 6 figures

MSC Class: 62M10; 62M15

arXiv:2102.12290 [pdf, other]

doi 10.4310/22-sii729

Smooth Online Parameter Estimation for time varying VAR models with application to rat's LFP data

Authors: Anass El Yaagoubi Bourakna, Marco Pinto, Norbert Fortin, Hernando Ombao

Abstract: Multivariate time series data appear often as realizations of non-stationary processes where the covariance matrix or spectral matrix smoothly evolve over time. Most of the current approaches estimate the time-varying spectral properties only retrospectively - that is, after the entire data has been observed. Retrospective estimation is a major limitation in many adaptive control applications wher… ▽ More Multivariate time series data appear often as realizations of non-stationary processes where the covariance matrix or spectral matrix smoothly evolve over time. Most of the current approaches estimate the time-varying spectral properties only retrospectively - that is, after the entire data has been observed. Retrospective estimation is a major limitation in many adaptive control applications where it is important to estimate these properties and detect changes in the system as they happen in real-time. One major obstacle in online estimation is the computational cost due to the high-dimensionality of the parameters. Existing methods such as the Kalman filter or local least squares are feasible. However, they are not always suitable because they provide noisy estimates and can become prohibitively costly as the dimension of the time series increases. In our brain signal application, it is critical to develop a robust method that can estimate, in real-time, the properties of the underlying stochastic process, in particular, the spectral brain connectivity measures. For these reasons we propose a new smooth online parameter estimation approach (SOPE) that has the ability to control for the smoothness of the estimates with a reasonable computational complexity. Consequently, the models are fit in real-time even for high dimensional time series. We demonstrate that our proposed SOPE approach is as good as the Kalman filter in terms of mean-squared error for small dimensions. However, unlike the Kalman filter, the SOPE has lower computational cost and hence scalable for higher dimensions. Finally, we apply the SOPE method to a rat's local field potential data during a hippocampus-dependent sequence-memory task. As demonstrated in the video, the proposed SOPE method is able to capture the dynamics of the connectivity as the rat performs the sequence of non-spatial working memory tasks. △ Less

Submitted 5 March, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

arXiv:2102.11971 [pdf, other]

Brain Waves Analysis Via a Non-parametric Bayesian Mixture of Autoregressive Kernels

Authors: Guillermo Granados-Garcia, Mark Fiecas, Babak Shahbaba, Norbert Fortin, Hernando Ombao

Abstract: The standard approach to analyzing brain electrical activity is to examine the spectral density function (SDF) and identify predefined frequency bands that have the most substantial relative contributions to the overall variance of the signal. However, a limitation of this approach is that the precise frequency and bandwidth of oscillations vary with cognitive demands. Thus they should not be arbi… ▽ More The standard approach to analyzing brain electrical activity is to examine the spectral density function (SDF) and identify predefined frequency bands that have the most substantial relative contributions to the overall variance of the signal. However, a limitation of this approach is that the precise frequency and bandwidth of oscillations vary with cognitive demands. Thus they should not be arbitrarily defined a priori in an experiment. In this paper, we develop a data-driven approach that identifies (i) the number of prominent peaks, (ii) the frequency peak locations, and (iii) their corresponding bandwidths (or spread of power around the peaks). We propose a Bayesian mixture auto-regressive decomposition method (BMARD), which represents the standardized SDFas a Dirichlet process mixture based on a kernel derived from second-order auto-regressive processes which completely characterize the location (peak)and scale (bandwidth) parameters. We present a Metropolis-Hastings within Gibbs algorithm to sample from the posterior distribution of the mixture parameters. Simulation studies demonstrate the robustness and performance of the BMARD method. Finally, we use the proposed BMARD method to analyze local field potential (LFP) activity from the hippocampus of laboratory rats across different conditions in a non-spatial sequence memory experiment to identify the most interesting frequency bands and examine the link between specific patterns of activity and trial-specific cognitive demands. △ Less

Submitted 25 March, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

Comments: 21 pages, 7 Figures, 3 tables

arXiv:2102.10331 [pdf, other]

Separating Stimulus-Induced and Background Components of Dynamic Functional Connectivity in Naturalistic fMRI

Authors: Chee-Ming Ting, Jeremy I. Skipper, Steven L. Small, Hernando Ombao

Abstract: We consider the challenges in extracting stimulus-related neural dynamics from other intrinsic processes and noise in naturalistic functional magnetic resonance imaging (fMRI). Most studies rely on inter-subject correlations (ISC) of low-level regional activity and neglect varying responses in individuals. We propose a novel, data-driven approach based on low-rank plus sparse (L+S) decomposition t… ▽ More We consider the challenges in extracting stimulus-related neural dynamics from other intrinsic processes and noise in naturalistic functional magnetic resonance imaging (fMRI). Most studies rely on inter-subject correlations (ISC) of low-level regional activity and neglect varying responses in individuals. We propose a novel, data-driven approach based on low-rank plus sparse (L+S) decomposition to isolate stimulus-driven dynamic changes in brain functional connectivity (FC) from the background noise, by exploiting shared network structure among subjects receiving the same naturalistic stimuli. The time-resolved multi-subject FC matrices are modeled as a sum of a low-rank component of correlated FC patterns across subjects, and a sparse component of subject-specific, idiosyncratic background activities. To recover the shared low-rank subspace, we introduce a fused version of principal component pursuit (PCP) by adding a fusion-type penalty on the differences between the rows of the low-rank matrix. The method improves the detection of stimulus-induced group-level homogeneity in the FC profile while capturing inter-subject variability. We develop an efficient algorithm via a linearized alternating direction method of multipliers to solve the fused-PCP. Simulations show accurate recovery by the fused-PCP even when a large fraction of FC edges are severely corrupted. When applied to natural fMRI data, our method reveals FC changes that were time-locked to auditory processing during movie watching, with dynamic engagement of sensorimotor systems for speech-in-noise. It also provides a better mapping to auditory content in the movie than ISC. △ Less

Submitted 24 January, 2021; originally announced February 2021.

Comments: Main paper: 10 pages, 8 figures. Supplemental file: 3 pages

arXiv:2101.09352 [pdf, other]

Conex-Connect: Learning Patterns in Extremal Brain Connectivity From Multi-Channel EEG Data

Authors: Matheus B. Guerrero, Raphaël Huser, Hernando Ombao

Abstract: Epilepsy is a chronic neurological disorder affecting more than 50 million people globally. An epileptic seizure acts like a temporary shock to the neuronal system, disrupting normal electrical activity in the brain. Epilepsy is frequently diagnosed with electroencephalograms (EEGs). Current methods study the time-varying spectra and coherence but do not directly model changes in extreme behavior.… ▽ More Epilepsy is a chronic neurological disorder affecting more than 50 million people globally. An epileptic seizure acts like a temporary shock to the neuronal system, disrupting normal electrical activity in the brain. Epilepsy is frequently diagnosed with electroencephalograms (EEGs). Current methods study the time-varying spectra and coherence but do not directly model changes in extreme behavior. Thus, we propose a new approach to characterize brain connectivity based on the joint tail behavior of the EEGs. Our proposed method, the conditional extremal dependence for brain connectivity (Conex-Connect), is a pioneering approach that links the association between extreme values of higher oscillations at a reference channel with the other brain network channels. Using the Conex-Connect method, we discover changes in the extremal dependence driven by the activity at the foci of the epileptic seizure. Our model-based approach reveals that, pre-seizure, the dependence is notably stable for all channels when conditioning on extreme values of the focal seizure area. Post-seizure, by contrast, the dependence between channels is weaker, and dependence patterns are more "chaotic". Moreover, in terms of spectral decomposition, we find that high values of the high-frequency Gamma-band are the most relevant features to explain the conditional extremal dependence of brain connectivity. △ Less

Submitted 3 January, 2021; originally announced January 2021.

arXiv:2101.04334 [pdf, other]

Change-point detection using spectral PCA for multivariate time series

Authors: Shuhao Jiao, Tong Shen, Zhaoxia Yu, Hernando Ombao

Abstract: We propose a two-stage approach Spec PC-CP to identify change points in multivariate time series. In the first stage, we obtain a low-dimensional summary of the high-dimensional time series by Spectral Principal Component Analysis (Spec-PCA). In the second stage, we apply cumulative sum-type test on the Spectral PCA component using a binary segmentation algorithm. Compared with existing approaches… ▽ More We propose a two-stage approach Spec PC-CP to identify change points in multivariate time series. In the first stage, we obtain a low-dimensional summary of the high-dimensional time series by Spectral Principal Component Analysis (Spec-PCA). In the second stage, we apply cumulative sum-type test on the Spectral PCA component using a binary segmentation algorithm. Compared with existing approaches, the proposed method is able to capture the lead-lag relationship in time series. Our simulations demonstrate that the Spec PC-CP method performs significantly better than competing methods for detecting change points in high-dimensional time series. The results on epileptic seizure EEG data and stock data also indicate that our new method can efficiently {detect} change points corresponding to the onset of the underlying events. △ Less

Submitted 12 January, 2021; originally announced January 2021.

arXiv:2011.08799 [pdf, other]

Flexible Bivariate INGARCH Process With a Broad Range of Contemporaneous Correlation

Authors: Luiza S. C. Piancastelli, Wagner Barreto-Souza, Hernando Ombao

Abstract: We propose a novel flexible bivariate conditional Poisson (BCP) INteger-valued Generalized AutoRegressive Conditional Heteroscedastic (INGARCH) model for correlated count time series data. Our proposed BCP-INGARCH model is mathematically tractable and has as the main advantage over existing bivariate INGARCH models its ability to capture a broad range (both negative and positive) of contemporaneou… ▽ More We propose a novel flexible bivariate conditional Poisson (BCP) INteger-valued Generalized AutoRegressive Conditional Heteroscedastic (INGARCH) model for correlated count time series data. Our proposed BCP-INGARCH model is mathematically tractable and has as the main advantage over existing bivariate INGARCH models its ability to capture a broad range (both negative and positive) of contemporaneous cross-correlation which is a non-trivial advancement. Properties of stationarity and ergodicity for the BCP-INGARCH process are developed. Estimation of the parameters is performed through conditional maximum likelihood (CML) and finite sample behavior of the estimators are investigated through simulation studies. Asymptotic properties of the CML estimators are derived. Additional simulation studies compare and contrast methods of obtaining standard errors of the parameter estimates, where a bootstrap option is demonstrated to be advantageous. Hypothesis testing methods for the presence of contemporaneous correlation between the time series are presented and evaluated. We apply our methodology to monthly counts of hepatitis cases at two nearby Brazilian cities, which are highly cross-correlated. The data analysis demonstrates the importance of considering a bivariate model allowing for a wide range of contemporaneous correlation in real-life applications. △ Less

Submitted 17 November, 2020; originally announced November 2020.

Comments: Paper submitted for publication

arXiv:2010.13458 [pdf, other]

Structural Brain Asymmetries in Youths with Combined and Inattentive Presentations of Attention Deficit Hyperactivity Disorder

Authors: Cintya Nirvana Dutta, Pamela K. Douglas, Hernando Ombao

Abstract: Alterations in structural brain laterality are reported in attention-deficit/hyperactivity disorder (ADHD). However, few studies examined differences within presentations of ADHD. We investigate asymmetry index (AI) across 13 subcortical and 33 cortical regions from anatomical metrics of volume, surface area, and thickness. Structural T1-weighted MRI data were obtained from youths with inattentive… ▽ More Alterations in structural brain laterality are reported in attention-deficit/hyperactivity disorder (ADHD). However, few studies examined differences within presentations of ADHD. We investigate asymmetry index (AI) across 13 subcortical and 33 cortical regions from anatomical metrics of volume, surface area, and thickness. Structural T1-weighted MRI data were obtained from youths with inattentive (n = 64) and combined (n = 51) presentations, and aged-matched controls (n = 298). We used a linear mixed effect model that accounts for data site heterogeneity, while studying associations between AI and covariates of presentation and age. Our paper contributes to the functional results seen among ADHD presentations evidencing disrupted connectivity in motor networks from ADHD-C and cingulo-frontal networks from ADHD-I, as well as new findings in the temporal cortex and default mode subnetworks. Age patterns of structural asymmetries vary with presentation type. Linear mixed effects model is a practical tool for characterizing associations between brain asymmetries, diagnosis, and neurodevelopment. △ Less

Submitted 26 October, 2020; originally announced October 2020.

Comments: 5 pages, 3 figures, 1 table, submitted to ISBI conference

arXiv:2007.14078 [pdf, other]

Clustering Brain Signals: A Robust Approach Using Functional Data Ranking

Authors: Tianbo Chen, Ying Sun, Carolina Euan, Hernando Ombao

Abstract: In this paper, we analyze electroencephalograms (EEG) which are recordings of brain electrical activity. We develop new clustering methods for identifying synchronized brain regions, where the EEGs show similar oscillations or waveforms according to their spectral densities. We treat the estimated spectral densities from many epochs or trials as functional data and develop clustering algorithms ba… ▽ More In this paper, we analyze electroencephalograms (EEG) which are recordings of brain electrical activity. We develop new clustering methods for identifying synchronized brain regions, where the EEGs show similar oscillations or waveforms according to their spectral densities. We treat the estimated spectral densities from many epochs or trials as functional data and develop clustering algorithms based on functional data ranking. The two proposed clustering algorithms use different dissimilarity measures: distance of the functional medians and the area of the central region. The performance of the proposed algorithms is examined by simulation studies. We show that, when contaminations are present, the proposed methods for clustering spectral densities are more robust than the mean-based methods. The developed methods are applied to two stages of resting state EEG data from a male college student, corresponding to early exploration of functional connectivity in the human brain. △ Less

Submitted 28 July, 2020; originally announced July 2020.

arXiv:2007.00437 [pdf, other]

Levels and trends in the sex ratio at birth in seven provinces of Nepal between 1980 and 2016 with probabilistic projections to 2050: a Bayesian modeling approach

Authors: Fengqing Chao, Samir KC, Hernando Ombao

Abstract: The sex ratio at birth (SRB; ratio of male to female births) in Nepal has been reported without imbalance on the national level. However, the national SRB could mask the disparity within the country. Given the demographic and cultural heterogeneities in Nepal, it is crucial to model Nepal SRB on the subnational level. Prior studies on subnational SRB in Nepal are mostly based on reporting observed… ▽ More The sex ratio at birth (SRB; ratio of male to female births) in Nepal has been reported without imbalance on the national level. However, the national SRB could mask the disparity within the country. Given the demographic and cultural heterogeneities in Nepal, it is crucial to model Nepal SRB on the subnational level. Prior studies on subnational SRB in Nepal are mostly based on reporting observed values from surveys and census, and no study has provided probabilistic projections. We aim to estimate and project SRB for the seven provinces of Nepal from 1980 to 2050 using a Bayesian modeling approach. We compiled an extensive database on provincial SRB of Nepal, consisting 2001, 2006, 2011, and 2016 Nepal Demographic and Health Surveys and 2011 Census. We adopted a Bayesian hierarchical time series model to estimate and project the provincial SRB, with a focus on modelling the potential SRB imbalance. In 2016, the highest SRB is estimated in Province 5 at 1.102 with a 95% credible interval (1.044, 1.127) and the lowest SRB is in Province 2 at 1.053 (1.035, 1.109). The SRB imbalance probabilities in all provinces are generally low and vary from 16% in Province 2 to 81% in Province 5. SRB imbalances are estimated to have begun at the earliest in 2001 in Province 5 with a 95% credible interval (1992, 2022) and the latest in 2017 (1998, 2040) in Province 2. We project SRB in all provinces to begin converging back to the national baseline in the mid-2030s. Our findings imply that the majority of provinces in Nepal have low risks of SRB imbalance for the period 1980-2016. However, we identify a few provinces with higher probabilities of having SRB inflation. The projected SRB is an important illustration of potential future prenatal sex discrimination and shows the need to monitor SRB in provinces with higher possibilities of SRB imbalance. △ Less

Submitted 30 August, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

MSC Class: 62P25 (Primary) 91D20; 62F15; 62M10 (Secondary)

Journal ref: BMC Public Health 2022, Vol. 22, No. 1, 358

arXiv:2006.13887 [pdf, other]

doi 10.1111/sjos.12589

Break Point Detection for Functional Covariance

Authors: Shuhao Jiao, Ron D. Frostig, Hernando Ombao

Abstract: Many experiments record sequential trajectories where each trajectory consists of oscillations and fluctuations around zero. Such trajectories can be viewed as zero-mean functional data. When there are structural breaks (on the sequence of trajectories) in higher order moments, it is not always easy to spot these by mere visual inspection. Motivated by this challenging problem in brain signal anal… ▽ More Many experiments record sequential trajectories where each trajectory consists of oscillations and fluctuations around zero. Such trajectories can be viewed as zero-mean functional data. When there are structural breaks (on the sequence of trajectories) in higher order moments, it is not always easy to spot these by mere visual inspection. Motivated by this challenging problem in brain signal analysis, we propose a detection and testing procedure to find the change point in functional covariance. The detection procedure is based on the cumulative sum statistics (CUSUM). The classical testing procedure for functional data depends on a null distribution which depends on infinitely many unknown parameters, though in practice only a finite number of these can be included for the hypothesis test of the existence of change point. This paper provides some theoretical insights on the influence of the number of parameters. Meanwhile, the asymptotic properties of the estimated change point are developed. The effectiveness of the proposed method is numerically validated in simulation studies and an application to investigate changes in rat brain signals following an experimentally-induced stroke. △ Less

Submitted 4 February, 2022; v1 submitted 24 June, 2020; originally announced June 2020.

arXiv:2005.09440 [pdf, other]

Multiscale modelling of replicated nonstationary time series

Authors: Jonathan Embleton, Marina I. Knight, Hernando Ombao

Abstract: Within the neurosciences, to observe variability across time in the dynamics of an underlying brain process is neither new nor unexpected. Wavelets are essential in analyzing brain signals because, even within a single trial, brain signals exhibit nonstationary behaviour. However, neurological signals generated within an experiment may also potentially exhibit evolution across trials (replicates).… ▽ More Within the neurosciences, to observe variability across time in the dynamics of an underlying brain process is neither new nor unexpected. Wavelets are essential in analyzing brain signals because, even within a single trial, brain signals exhibit nonstationary behaviour. However, neurological signals generated within an experiment may also potentially exhibit evolution across trials (replicates). As neurologists consider localised spectra of brain signals to be most informative, here we develop a novel wavelet-based tool capable to formally represent process nonstationarities across both time and replicate dimensions. Specifically, we propose the Replicate Locally Stationary Wavelet (RLSW) process, that captures the potential nonstationary behaviour within and across trials. Estimation using wavelets gives a natural desired time- and replicate-localisation of the process dynamics. We develop the associated spectral estimation framework and establish its asymptotic properties. By means of thorough simulation studies, we demonstrate the theoretical estimator properties hold in practice. A real data investigation into the evolutionary dynamics of the hippocampus and nucleus accumbens during an associative learning experiment, demonstrate the applicability of our proposed methodology, as well as the new insights it provides. △ Less

Submitted 19 May, 2020; originally announced May 2020.

Comments: 24 pages and 13 figures (main paper), supplementary material included

arXiv:2004.11470 [pdf, other]

doi 10.1016/j.ijforecast.2020.12.007

Semiparametric time series models driven by latent factor

Authors: Gisele O. Maia, Wagner Barreto-Souza, Fernando S. Bastos, Hernando Ombao

Abstract: We introduce a class of semiparametric time series models by assuming a quasi-likelihood approach driven by a latent factor process. More specifically, given the latent process, we only specify the conditional mean and variance of the time series and enjoy a quasi-likelihood function for estimating parameters related to the mean. This proposed methodology has three remarkable features: (i) no para… ▽ More We introduce a class of semiparametric time series models by assuming a quasi-likelihood approach driven by a latent factor process. More specifically, given the latent process, we only specify the conditional mean and variance of the time series and enjoy a quasi-likelihood function for estimating parameters related to the mean. This proposed methodology has three remarkable features: (i) no parametric form is assumed for the conditional distribution of the time series given the latent process; (ii) able for modelling non-negative, count, bounded/binary and real-valued time series; (iii) dispersion parameter is not assumed to be known. Further, we obtain explicit expressions for the marginal moments and for the autocorrelation function of the time series process so that a method of moments can be employed for estimating the dispersion parameter and also parameters related to the latent process. Simulated results aiming to check the proposed estimation procedure are presented. Real data analysis on unemployment rate and precipitation time series illustrate the potencial for practice of our methodology. △ Less

Submitted 23 April, 2020; originally announced April 2020.

Journal ref: International Journal of Forecasting (2021)

arXiv:2004.08667 [pdf, other]

Integer-valued autoregressive process with flexible marginal and innovation distributions

Authors: Matheus B. Guerrero, Wagner Barreto-Souza, Hernando Ombao

Abstract: INteger Auto-Regressive (INAR) processes are usually defined by specifying the innovations and the operator, which often leads to difficulties in deriving marginal properties of the process. In many practical situations, a major modeling limitation is that it is difficult to justify the choice of the operator. To overcome these drawbacks, we propose a new flexible approach to build an INAR model:… ▽ More INteger Auto-Regressive (INAR) processes are usually defined by specifying the innovations and the operator, which often leads to difficulties in deriving marginal properties of the process. In many practical situations, a major modeling limitation is that it is difficult to justify the choice of the operator. To overcome these drawbacks, we propose a new flexible approach to build an INAR model: we pre-specify the marginal and innovation distributions. Hence, the operator is a consequence of specifying the desired marginal and innovation distributions. Our new INAR model has both marginal and innovations geometric distributed, being a direct alternative to the classical Poisson INAR model. Our proposed process has interesting stochastic properties such as an MA($\infty$) representation, time-reversibility, and closed-forms for the transition probabilities $h$-steps ahead, allowing for coherent forecasting. We analyze time-series counts of skin lesions using our proposed approach, comparing it with existing INAR and INGARCH models. Our model gives more adherence to the data and better forecasting performance. △ Less

Submitted 18 April, 2020; originally announced April 2020.

arXiv:2004.04362 [pdf, other]

doi 10.1109/TMI.2020.3030047

Detecting Dynamic Community Structure in Functional Brain Networks Across Individuals: A Multilayer Approach

Authors: Chee-Ming Ting, S. Balqis Samdin, Meini Tang, Hernando Ombao

Abstract: We present a unified statistical framework for characterizing community structure of brain functional networks that captures variation across individuals and evolution over time. Existing methods for community detection focus only on single-subject analysis of dynamic networks; while recent extensions to multiple-subjects analysis are limited to static networks. To overcome these limitations, we p… ▽ More We present a unified statistical framework for characterizing community structure of brain functional networks that captures variation across individuals and evolution over time. Existing methods for community detection focus only on single-subject analysis of dynamic networks; while recent extensions to multiple-subjects analysis are limited to static networks. To overcome these limitations, we propose a multi-subject, Markov-switching stochastic block model (MSS-SBM) to identify state-related changes in brain community organization over a group of individuals. We first formulate a multilayer extension of SBM to describe the time-dependent, multi-subject brain networks. We develop a novel procedure for fitting the multilayer SBM that builds on multislice modularity maximization which can uncover a common community partition of all layers (subjects) simultaneously. By augmenting with a dynamic Markov switching process, our proposed method is able to capture a set of distinct, recurring temporal states with respect to inter-community interactions over subjects and the change points between them. Simulation shows accurate community recovery and tracking of dynamic community regimes over multilayer networks by the MSS-SBM. Application to task fMRI reveals meaningful non-assortative brain community motifs, e.g., core-periphery structure at the group level, that are associated with language comprehension and motor functions suggesting their putative role in complex information integration. Our approach detected dynamic reconfiguration of modular connectivity elicited by varying task demands and identified unique profiles of intra and inter-community connectivity across different task conditions. The proposed multilayer network representation provides a principled way of detecting synchronous, dynamic modularity in brain networks across subjects. △ Less

Submitted 16 October, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

Comments: Main paper: 12 pages, 13 figures. Supplemental file: 16 pages. Accepted for IEEE Trans Medical Imaging

Journal ref: IEEE Trans Medical Imaging, vol. 40, no. 2 (2021) 468 - 480

arXiv:2004.02228 [pdf, other]

doi 10.1371/journal.pone.0236673

Probabilistic Projection of the Sex Ratio at Birth and Missing Female Births by State and Union Territory in India

Authors: Fengqing Chao, Christophe Z. Guilmoto, Samir K. C., Hernando Ombao

Abstract: The sex ratio at birth (SRB) in India has been reported imbalanced since the 1970s. Previous studies have shown a great variation in the SRB across geographic locations in India till 2016. As one of the most populous countries and in view of its great regional heterogeneity, it is crucial to produce probabilistic projections for the SRB in India at state level for the purpose of population project… ▽ More The sex ratio at birth (SRB) in India has been reported imbalanced since the 1970s. Previous studies have shown a great variation in the SRB across geographic locations in India till 2016. As one of the most populous countries and in view of its great regional heterogeneity, it is crucial to produce probabilistic projections for the SRB in India at state level for the purpose of population projection and policy planning. In this paper, we implement a Bayesian hierarchical time series model to project SRB in India by state. We generate SRB probabilistic projections from 2017 to 2030 for 29 States and Union Territories (UTs) in India, and present results in 21 States/UTs with data from the Sample Registration System. Our analysis takes into account two state-specific factors that contribute to sex-selective abortion and resulting sex imbalances at birth: intensity of son preference and fertility squeeze. We project that the largest contribution to female births deficits is in Uttar Pradesh, with cumulative number of missing female births projected to be 2.0 (95% credible interval [1.9; 2.2]) million from 2017 to 2030. The total female birth deficits during 2017-2030 for the whole India is projected to be 6.8 [6.6; 7.0] million. △ Less

Submitted 5 April, 2020; originally announced April 2020.

MSC Class: 62P25 (Primary) 91D20 (Secondary)

Journal ref: PLoS ONE 2020, Vol. 15, No. 8, e0236673

Showing 1–50 of 85 results for author: Ombao, H