-
Topological Analysis of Seizure-Induced Changes in Brain Hierarchy Through Effective Connectivity
Authors:
Anass B. El-Yaagoubi,
Moo K. Chung,
Hernando Ombao
Abstract:
Traditional Topological Data Analysis (TDA) methods, such as Persistent Homology (PH), rely on distance measures (e.g., cross-correlation, partial correlation, coherence, and partial coherence) that are symmetric by definition. While useful for studying topological patterns in functional brain connectivity, the main limitation of these methods is their inability to capture the directional dynamics…
▽ More
Traditional Topological Data Analysis (TDA) methods, such as Persistent Homology (PH), rely on distance measures (e.g., cross-correlation, partial correlation, coherence, and partial coherence) that are symmetric by definition. While useful for studying topological patterns in functional brain connectivity, the main limitation of these methods is their inability to capture the directional dynamics - which is crucial for understanding effective brain connectivity. We propose the Causality-Based Topological Ranking (CBTR) method, which integrates Causal Inference (CI) to assess effective brain connectivity with Hodge Decomposition (HD) to rank brain regions based on their mutual influence. Our simulations confirm that the CBTR method accurately and consistently identifies hierarchical structures in multivariate time series data. Moreover, this method effectively identifies brain regions showing the most significant interaction changes with other regions during seizures using electroencephalogram (EEG) data. These results provide novel insights into the brain's hierarchical organization and illuminate the impact of seizures on its dynamics.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
A Practical Approach for Exploring Granger Connectivity in High-Dimensional Networks of Time Series
Authors:
Sipan Aslan,
Hernando Ombao
Abstract:
This manuscript presents a novel method for discovering effective connectivity between specified pairs of nodes in a high-dimensional network of time series. To accurately perform Granger causality analysis from the first node to the second node, it is essential to eliminate the influence of all other nodes within the network. The approach proposed is to create a low-dimensional representation of…
▽ More
This manuscript presents a novel method for discovering effective connectivity between specified pairs of nodes in a high-dimensional network of time series. To accurately perform Granger causality analysis from the first node to the second node, it is essential to eliminate the influence of all other nodes within the network. The approach proposed is to create a low-dimensional representation of all other nodes in the network using frequency-domain-based dynamic principal component analysis (spectral DPCA). The resulting scores are subsequently removed from the first and second nodes of interest, thus eliminating the confounding effect of other nodes within the high-dimensional network. To conduct hypothesis testing on Granger causality, we propose a permutation-based causality test. This test enhances the accuracy of our findings when the error structures are non-Gaussian. The approach has been validated in extensive simulation studies, which demonstrate the efficacy of the methodology as a tool for causality analysis in complex time series networks. The proposed methodology has also been demonstrated to be both expedient and viable on real datasets, with particular success observed on multichannel EEG networks.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Statistics of Extremes for Neuroscience
Authors:
Paolo V. Redondo,
Matheus B. Guerrero,
Raphaël Huser,
Hernando Ombao
Abstract:
This chapter illustrates how tools from univariate and multivariate statistics of extremes can complement classical methods used to study brain signals and enhance the understanding of brain activity and connectivity during specific cognitive tasks or abnormal episodes, such as an epileptic seizure.
This chapter illustrates how tools from univariate and multivariate statistics of extremes can complement classical methods used to study brain signals and enhance the understanding of brain activity and connectivity during specific cognitive tasks or abnormal episodes, such as an epileptic seizure.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
Dynamic MRI reconstruction using low-rank plus sparse decomposition with smoothness regularization
Authors:
Chee-Ming Ting,
Fuad Noman,
Raphaël C. -W. Phan,
Hernando Ombao
Abstract:
The low-rank plus sparse (L+S) decomposition model has enabled better reconstruction of dynamic magnetic resonance imaging (dMRI) with separation into background (L) and dynamic (S) component. However, use of low-rank prior alone may not fully explain the slow variations or smoothness of the background part at the local scale. In this paper, we propose a smoothness-regularized L+S (SR-L+S) model f…
▽ More
The low-rank plus sparse (L+S) decomposition model has enabled better reconstruction of dynamic magnetic resonance imaging (dMRI) with separation into background (L) and dynamic (S) component. However, use of low-rank prior alone may not fully explain the slow variations or smoothness of the background part at the local scale. In this paper, we propose a smoothness-regularized L+S (SR-L+S) model for dMRI reconstruction from highly undersampled k-t-space data. We exploit joint low-rank and smooth priors on the background component of dMRI to better capture both its global and local temporal correlated structures. Extending the L+S formulation, the low-rank property is encoded by the nuclear norm, while the smoothness by a general \ell_{p}-norm penalty on the local differences of the columns of L. The additional smoothness regularizer can promote piecewise local consistency between neighboring frames. By smoothing out the noise and dynamic activities, it allows accurate recovery of the background part, and subsequently more robust dMRI reconstruction. Extensive experiments on multi-coil cardiac and synthetic data shows that the SR-L+S model outp
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Spectral Topological Data Analysis of Brain Signals
Authors:
Anass B. El-Yaagoubi,
Shuhao Jiao,
Moo K. Chung,
Hernando Ombao
Abstract:
Topological data analysis (TDA) has become a powerful approach over the last twenty years, mainly due to its ability to capture the shape and the geometry inherent in the data. Persistence homology, which is a particular tool in TDA, has been demonstrated to be successful in analyzing functional brain connectivity. One limitation of standard approaches is that they use arbitrarily chosen threshold…
▽ More
Topological data analysis (TDA) has become a powerful approach over the last twenty years, mainly due to its ability to capture the shape and the geometry inherent in the data. Persistence homology, which is a particular tool in TDA, has been demonstrated to be successful in analyzing functional brain connectivity. One limitation of standard approaches is that they use arbitrarily chosen threshold values for analyzing connectivity matrices. To overcome this weakness, TDA provides a filtration of the weighted brain network across a range of threshold values. However, current analyses of the topological structure of functional brain connectivity primarily rely on overly simplistic connectivity measures, such as the Pearson orrelation. These measures do not provide information about the specific oscillators that drive dependence within the brain network. Here, we develop a frequency-specific approach that utilizes coherence, a measure of dependence in the spectral domain, to evaluate the functional connectivity of the brain. Our approach, the spectral TDA (STDA), has the ability to capture more nuanced and detailed information about the underlying brain networks. The proposed STDA method leads to a novel topological summary, the spectral landscape, which is a 2D-generalization of the persistence landscape. Using the novel spectral landscape, we analyze the EEG brain connectivity of patients with attention deficit hyperactivity disorder (ADHD) and shed light on the frequency-specific differences in the topology of brain connectivity between the controls and ADHD patients.
△ Less
Submitted 1 December, 2023;
originally announced January 2024.
-
Statistical Inference for Modulation Index in Phase-Amplitude Coupling
Authors:
Marco Antonio Pinto-Orellana,
Hernando Ombao,
Beth Lopour
Abstract:
Phase-amplitude coupling is a phenomenon observed in several neurological processes, where the phase of one signal modulates the amplitude of another signal with a distinct frequency. The modulation index (MI) is a common technique used to quantify this interaction by assessing the Kullback-Leibler divergence between a uniform distribution and the empirical conditional distribution of amplitudes w…
▽ More
Phase-amplitude coupling is a phenomenon observed in several neurological processes, where the phase of one signal modulates the amplitude of another signal with a distinct frequency. The modulation index (MI) is a common technique used to quantify this interaction by assessing the Kullback-Leibler divergence between a uniform distribution and the empirical conditional distribution of amplitudes with respect to the phases of the observed signals. The uniform distribution is an ideal representation that is expected to appear under the absence of coupling. However, it does not reflect the statistical properties of coupling values caused by random chance. In this paper, we propose a statistical framework for evaluating the significance of an observed MI value based on a null hypothesis that a MI value can be entirely explained by chance. Significance is obtained by comparing the value with a reference distribution derived under the null hypothesis of independence (i.e., no coupling) between signals. We derived a closed-form distribution of this null model, resulting in a scaled beta distribution. To validate the efficacy of our proposed framework, we conducted comprehensive Monte Carlo simulations, assessing the significance of MI values under various experimental scenarios, including amplitude modulation, trains of spikes, and sequences of high-frequency oscillations. Furthermore, we corroborated the reliability of our model by comparing its statistical significance thresholds with reported values from other research studies conducted under different experimental settings. Our method offers several advantages such as meta-analysis reliability, simplicity and computational efficiency, as it provides p-values and significance levels without resorting to generating surrogate data through sampling procedures.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation
Authors:
Md Nurul Muttakin,
Malik Shahid Sultan,
Robert Hoehndorf,
Hernando Ombao
Abstract:
Generative Adversarial Networks are used for generating the data using a generator and a discriminator, GANs usually produce high-quality images, but training GANs in an adversarial setting is a difficult task. GANs require high computation power and hyper-parameter regularization for converging. Projected GANs tackle the training difficulty of GANs by using transfer learning to project the genera…
▽ More
Generative Adversarial Networks are used for generating the data using a generator and a discriminator, GANs usually produce high-quality images, but training GANs in an adversarial setting is a difficult task. GANs require high computation power and hyper-parameter regularization for converging. Projected GANs tackle the training difficulty of GANs by using transfer learning to project the generated and real samples into a pre-trained feature space. Projected GANs improve the training time and convergence but produce artifacts in the generated images which reduce the quality of the generated samples, we propose an optimized architecture called Stylized Projected GANs which integrates the mapping network of the Style GANs with Skip Layer Excitation of Fast GAN. The integrated modules are incorporated within the generator architecture of the Fast GAN to mitigate the problem of artifacts in the generated images.
△ Less
Submitted 30 July, 2023;
originally announced July 2023.
-
Topological Data Analysis for Directed Dependence Networks of Multivariate Time Series Data
Authors:
Anass B. El-Yaagoubi,
Hernando Ombao
Abstract:
Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological impairments such as epileptic seizures. Existing TDA approaches rely on the notion of dis…
▽ More
Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological impairments such as epileptic seizures. Existing TDA approaches rely on the notion of distance between data points that is symmetric by definition for building graph filtrations. For brain dependence networks, this is a major limitation that constrains practitioners to using only symmetric dependence measures, such as correlations or coherence. However, it is known that the brain dependence network may be very complex and can contain a directed flow of information from one brain region to another. Such dependence networks are usually captured by more advanced measures of dependence such as partial directed coherence, which is a Granger causality based dependence measure. These dependence measures will result in a non-symmetric distance function, especially during epileptic seizures. In this paper we propose to solve this limitation by decomposing the weighted connectivity network into its symmetric and anti-symmetric components using matrix decomposition and comparing the anti-symmetric component prior to and post seizure. Our analysis of epileptic seizure EEG data shows promising results.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
An MCMC Approach to Bayesian Image Analysis in Fourier Space
Authors:
Konstantinos Bakas,
John Kornak,
Hernando Ombao
Abstract:
Bayesian methods are commonly applied to solve image analysis problems such as noise-reduction, feature enhancement and object detection. A primary limitation of these approaches is the computational complexity due to the interdependence of neighboring pixels which limits the ability to perform full posterior sampling through Markov chain Monte Carlo (MCMC). To alleviate this problem, we develop a…
▽ More
Bayesian methods are commonly applied to solve image analysis problems such as noise-reduction, feature enhancement and object detection. A primary limitation of these approaches is the computational complexity due to the interdependence of neighboring pixels which limits the ability to perform full posterior sampling through Markov chain Monte Carlo (MCMC). To alleviate this problem, we develop a new posterior sampling method that is based on modeling the prior and likelihood in the space of the Fourier transform of the image. One advantage of Fourier-based methods is that many spatially correlated processes in image space can be represented via independent processes over Fourier space. A recent approach known as Bayesian Image Analysis in Fourier Space (or BIFS), has introduced parameter functions to describe prior expectations about image properties in Fourier space. To date BIFS has relied on Maximum a Posteriori (MAP) estimation for generating posterior estimates; providing just a single point estimate. The work presented here develops a posterior sampling approach for BIFS that can explore the full posterior distribution while continuing to take advantage of the independence modeling over Fourier space. As a result computational efficiency is improved over that for conventional Bayesian image analysis and mixing concerns that commonly have to be dealt with in high dimensional Markov chain Monte Carlo sampling problems are avoided. Implementation results and details are provided using simulated data.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Multi-scale wavelet coherence with its applications
Authors:
Haibo Wu,
MI Knight,
H Ombao
Abstract:
The goal in this paper is to develop a novel statistical approach to characterize functional interactions between channels in a brain network. Wavelets are effective for capturing transient properties of non-stationary signals because they have compact support that can be compressed or stretched according to the dynamic properties of the signal. Wavelets give a multi-scale decomposition of signals…
▽ More
The goal in this paper is to develop a novel statistical approach to characterize functional interactions between channels in a brain network. Wavelets are effective for capturing transient properties of non-stationary signals because they have compact support that can be compressed or stretched according to the dynamic properties of the signal. Wavelets give a multi-scale decomposition of signals and thus can be few for studying potential cross-scale interactions between signals. To achieve this, we develop the scale-specific sub-processes of a multivariate locally stationary wavelet stochastic process. Under this proposed framework, a novel cross-scale dependence measure is developed. This provides a measure for dependence structure of components at different scales of multivariate time series. Extensive simulation studies are conducted to demonstrate that the theoretical properties hold in practice. The proposed cross-scale analysis is applied to the electroencephalogram (EEG) data to study alterations in the functional connectivity structure in children diagnosed with attention deficit hyperactivity disorder (ADHD). Our approach identified novel interesting cross-scale interactions between channels in the brain network. The proposed framework can be applied to other signals, which can also capture the statistical association between the stocks at different time scales.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Bayesian Nonparametric Multivariate Mixture of Autoregressive Processes: With Application to Brain Signals
Authors:
Guillermo Granados-Garcia,
Raquel Prado,
Hernando Ombao
Abstract:
One of the goals of neuroscience is to study interactions between different brain regions during rest and while performing specific cognitive tasks. The Multivariate Bayesian Autoregressive Decomposition (MBMARD) is proposed as an intuitive and novel Bayesian non-parametric model to represent high-dimensional signals as a low-dimensional mixture of univariate uncorrelated latent oscillations. Each…
▽ More
One of the goals of neuroscience is to study interactions between different brain regions during rest and while performing specific cognitive tasks. The Multivariate Bayesian Autoregressive Decomposition (MBMARD) is proposed as an intuitive and novel Bayesian non-parametric model to represent high-dimensional signals as a low-dimensional mixture of univariate uncorrelated latent oscillations. Each latent oscillation captures a specific underlying oscillatory activity and hence will be modeled as a unique second-order autoregressive process due to a compelling property that its spectral density has a shape characterized by a unique frequency peak and bandwidth, which are parameterized by a location and a scale parameter. The posterior distributions of the parameters of the latent oscillations are computed via a metropolis-within-Gibbs algorithm. One of the advantages of MBMARD is its robustness against misspecification of standard models which is demonstrated in simulation studies. The main scientific questions addressed by MBMARD are the effects of long-term abuse of alcohol consumption on memory by analyzing EEG records of alcoholic and non-alcoholic subjects performing a visual recognition experiment. The MBMARD model exhibited novel interesting findings including identifying subject-specific clusters of low and high-frequency oscillations among different brain regions.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Measuring Information Transfer Between Nodes in a Brain Network through Spectral Transfer Entropy
Authors:
Paolo Victor Redondo,
Raphael Huser,
Hernando Ombao
Abstract:
Brain connectivity reflects how different regions of the brain interact during performance of a cognitive task. In studying brain signals such as electroencephalograms (EEG), this may be explored via an information-theoretic causal measure, called transfer entropy (TE), which does not impose any distributional assumption on the variables and covers any form of relationship (beyond linear) between…
▽ More
Brain connectivity reflects how different regions of the brain interact during performance of a cognitive task. In studying brain signals such as electroencephalograms (EEG), this may be explored via an information-theoretic causal measure, called transfer entropy (TE), which does not impose any distributional assumption on the variables and covers any form of relationship (beyond linear) between them. To improve utility of TE in brain signal analysis, we propose a novel methodology to capture cross-channel information transfer in the frequency domain. Specifically, we introduce a new causal measure, the spectral transfer entropy (STE), to quantify the magnitude and direction of information flow from a certain frequency-band oscillation of a channel to an oscillation of another channel. In contrast with previous works on TE in the frequency domain, we differentiate our work by considering an extreme value perspective that employs the maximum magnitude of filtered series within time blocks. The main advantages of our proposed approach is that it is robust to the inherent problems of linear filtering and allows adjustments for multiple comparisons to control family-wise error rate (FWER). Another novel contribution is a simple yet efficient estimation method based on the combination vine copulas and extreme value theory that enables estimates to capture zero (boundary point) without the need for bias adjustments. With the vine copula representation, a null copula model, which exhibits zero STE, is defined, making significance testing for STE straightforward through a standard resampling approach. Lastly, we illustrate the advantage of our proposed measure through some numerical experiments and provide interesting and novel findings on the analysis of EEG recordings linked to a visual task.
△ Less
Submitted 25 May, 2023; v1 submitted 11 March, 2023;
originally announced March 2023.
-
An Improved Unbiased Particle Filter
Authors:
Ajay Jasra,
Mohamed Maama,
Hernando Ombao
Abstract:
In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. We assume that, for numerical reasons, one has to time-discretize the diffusion process which typically leads to filtering that is subject to discretization bias. The approach in [16] establishes that when only having access to the time-discretized diff…
▽ More
In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. We assume that, for numerical reasons, one has to time-discretize the diffusion process which typically leads to filtering that is subject to discretization bias. The approach in [16] establishes that when only having access to the time-discretized diffusion it is possible to remove the discretization bias with an estimator of finite variance. We improve on the method in [16] by introducing a modified estimator based on the recent work of [17]. We show that this new estimator is unbiased and has finite variance. Moreover, we conjecture and verify in numerical simulations that substantial gains are obtained. That is, for a given mean square error (MSE) and a particular class of multi-dimensional diffusion, the cost to achieve the said MSE falls.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Antithetic Multilevel Particle Filters
Authors:
Ajay Jasra,
Mohamed Maama,
Hernando Ombao
Abstract:
In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. This is a challenging problem which requires the use of advanced numerical schemes based upon time-discretization of the diffusion process and then the application of particle filters. Perhaps the state-of-the-art method for moderate dimensional problem…
▽ More
In this paper we consider the filtering of partially observed multi-dimensional diffusion processes that are observed regularly at discrete times. This is a challenging problem which requires the use of advanced numerical schemes based upon time-discretization of the diffusion process and then the application of particle filters. Perhaps the state-of-the-art method for moderate dimensional problems is the multilevel particle filter of \cite{mlpf}. This is a method that combines multilevel Monte Carlo and particle filters. The approach in that article is based intrinsically upon an Euler discretization method. We develop a new particle filter based upon the antithetic truncated Milstein scheme of \cite{ml_anti}. We show that for a class of diffusion problems, for $ε>0$ given, that the cost to produce a mean square error (MSE) in estimation of the filter, of $\mathcal{O}(ε^2)$ is $\mathcal{O}(ε^{-2}\log(ε)^2)$. In the case of multidimensional diffusions with non-constant diffusion coefficient, the method of \cite{mlpf} has a cost of $\mathcal{O}(ε^{-2.5})$ to achieve the same MSE. We support our theory with numerical results in several examples.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
Graph-Regularized Manifold-Aware Conditional Wasserstein GAN for Brain Functional Connectivity Generation
Authors:
Yee-Fan Tan,
Chee-Ming Ting,
Fuad Noman,
Raphaël C. -W. Phan,
Hernando Ombao
Abstract:
Common measures of brain functional connectivity (FC) including covariance and correlation matrices are semi-positive definite (SPD) matrices residing on a cone-shape Riemannian manifold. Despite its remarkable success for Euclidean-valued data generation, use of standard generative adversarial networks (GANs) to generate manifold-valued FC data neglects its inherent SPD structure and hence the in…
▽ More
Common measures of brain functional connectivity (FC) including covariance and correlation matrices are semi-positive definite (SPD) matrices residing on a cone-shape Riemannian manifold. Despite its remarkable success for Euclidean-valued data generation, use of standard generative adversarial networks (GANs) to generate manifold-valued FC data neglects its inherent SPD structure and hence the inter-relatedness of edges in real FC. We propose a novel graph-regularized manifold-aware conditional Wasserstein GAN (GR-SPD-GAN) for FC data generation on the SPD manifold that can preserve the global FC structure. Specifically, we optimize a generalized Wasserstein distance between the real and generated SPD data under an adversarial training, conditioned on the class labels. The resulting generator can synthesize new SPD-valued FC matrices associated with different classes of brain networks, e.g., brain disorder or healthy control. Furthermore, we introduce additional population graph-based regularization terms on both the SPD manifold and its tangent space to encourage the generator to respect the inter-subject similarity of FC patterns in the real data. This also helps in avoiding mode collapse and produces more stable GAN training. Evaluated on resting-state functional magnetic resonance imaging (fMRI) data of major depressive disorder (MDD), qualitative and quantitative results show that the proposed GR-SPD-GAN clearly outperforms several state-of-the-art GANs in generating more realistic fMRI-based FC samples. When applied to FC data augmentation for MDD identification, classification models trained on augmented data generated by our approach achieved the largest margin of improvement in classification accuracy among the competing GANs over baselines without data augmentation.
△ Less
Submitted 10 December, 2022;
originally announced December 2022.
-
Club Exco: clustering brain extreme communities from multi-channel EEG data
Authors:
Matheus B. Guerrero,
Hernando Ombao,
Raphaël Huser
Abstract:
Current methods for clustering nodes over time in a brain network are determined by cross-dependence measures, which are computed from the entire range of values of the electroencephalogram (EEG) signals, from low to high amplitudes. We here developed the Club Exco method for clustering brain communities that exhibit synchronized extreme behaviors. To cluster multi-channel EEG data, Club-Exco uses…
▽ More
Current methods for clustering nodes over time in a brain network are determined by cross-dependence measures, which are computed from the entire range of values of the electroencephalogram (EEG) signals, from low to high amplitudes. We here developed the Club Exco method for clustering brain communities that exhibit synchronized extreme behaviors. To cluster multi-channel EEG data, Club-Exco uses a spherical $k$-means procedure applied to the ``pseudo-angles,'' derived from extreme absolute amplitudes of EEG signals. With this approach, a cluster center is considered an ``extremal prototype,'' revealing a community of EEG nodes sharing the same extreme behavior, a feature that traditional methods fail to identify. Hence, Club Exco serves as an exploratory tool to classify EEG channels into mutually asymptotically dependent or asymptotically independent groups. It provides insights into how the brain network organizes itself during an extreme event (e.g., an epileptic seizure) in contrast to a baseline state. We apply the Club Exco method to investigate temporal differences in EEG brain connectivity networks of a patient diagnosed with epilepsy, a chronic neurological disorder affecting more than 50 million people globally. Our extreme-value method reveals substantial differences in alpha (8--12 Hertz) oscillations across the brain network compared to coherence-based methods.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Bayesian Parameter Inference for Partially Observed SDEs driven by Fractional Brownian Motion
Authors:
Mohamed Maama,
Ajay Jasra,
Hernando Ombao
Abstract:
In this paper we consider Bayesian parameter inference for partially observed fractional Brownian motion (fBM) models. The approach we follow is to time-discretize the hidden process and then to design Markov chain Monte Carlo (MCMC) algorithms to sample from the posterior density on the parameters given data. We rely on a novel representation of the time discretization, which seeks to sample from…
▽ More
In this paper we consider Bayesian parameter inference for partially observed fractional Brownian motion (fBM) models. The approach we follow is to time-discretize the hidden process and then to design Markov chain Monte Carlo (MCMC) algorithms to sample from the posterior density on the parameters given data. We rely on a novel representation of the time discretization, which seeks to sample from an approximation of the posterior and then corrects via importance sampling; the approximation reduces the time (in terms of total observation time T) by O(T). This method is extended by using a multilevel MCMC method which can reduce the computational cost to achieve a given mean square error (MSE) versus using a single time discretization. Our methods are illustrated on simulated and real data.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Dynamic Topological Data Analysis of Functional Human Brain Networks
Authors:
Moo K. Chung,
Soumya Das,
Hernando Ombao
Abstract:
Developing reliable methods to discriminate different transient brain states that change over time is a key neuroscientific challenge in brain imaging studies. Topological data analysis (TDA), a novel framework based on algebraic topology, can handle such a challenge. However, existing TDA has been somewhat limited to capturing the static summary of dynamically changing brain networks. We propose…
▽ More
Developing reliable methods to discriminate different transient brain states that change over time is a key neuroscientific challenge in brain imaging studies. Topological data analysis (TDA), a novel framework based on algebraic topology, can handle such a challenge. However, existing TDA has been somewhat limited to capturing the static summary of dynamically changing brain networks. We propose a novel dynamic-TDA framework that builds persistent homology over a time series of brain networks. We construct a Wasserstein distance based inference procedure to discriminate between time series of networks. The method is applied to the resting-state functional magnetic resonance images of human brain. We demonstrate that our proposed dynamic-TDA approach can distinctly discriminate between the topological patterns of male and female brain networks. MATLAB code for implementing this method is available at https://github.com/laplcebeltrami/PH-STAT.
△ Less
Submitted 18 December, 2023; v1 submitted 17 October, 2022;
originally announced October 2022.
-
Modeling and Simulating Dependence in Networks Using Topological Data Analysis
Authors:
Anass El Yaagoubi Bourakna,
Moo K. Chung,
Hernando Ombao
Abstract:
Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological and cognitive impairments such as Alzheimer's and Parkinson's diseases, as well as attent…
▽ More
Topological data analysis (TDA) approaches are becoming increasingly popular for studying the dependence patterns in multivariate time series data. In particular, various dependence patterns in brain networks may be linked to specific tasks and cognitive processes, which can be altered by various neurological and cognitive impairments such as Alzheimer's and Parkinson's diseases, as well as attention deficit hyperactivity disorder (ADHD). Because there is no ground-truth with known dependence patterns in real brain signals, testing new TDA methods on multivariate time series is still a challenge. Simulations are crucial for evaluating the performance of proposed TDA methods and testing procedures as well as for creating computation-based confidence intervals. To our knowledge, there are no methods that simulate multivariate time series data with specific and manually imposed connectivity patterns. In this paper we present a novel approach to simulate multivariate time series with specific number of cycles/holes in its dependence network. Furthermore, we also provide a procedure for generating higher dimensional topological features.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Granger Causality using Neural Networks
Authors:
Samuel Horvath,
Malik Shahid Sultan,
Hernando Ombao
Abstract:
The Granger Causality (GC) test is a famous statistical hypothesis test for investigating if the past of one time series affects the future of the other. It helps in answering the question whether one time series is helpful in forecasting. Standard traditional approaches to Granger causality detection commonly assume linear dynamics, but such simplification does not hold in many real-world applica…
▽ More
The Granger Causality (GC) test is a famous statistical hypothesis test for investigating if the past of one time series affects the future of the other. It helps in answering the question whether one time series is helpful in forecasting. Standard traditional approaches to Granger causality detection commonly assume linear dynamics, but such simplification does not hold in many real-world applications, e.g., neuroscience or genomics that are inherently non-linear. In such cases, imposing linear models such as Vector Autoregressive (VAR) models can lead to inconsistent estimation of true Granger Causal interactions. Machine Learning (ML) can learn the hidden patterns in the datasets specifically Deep Learning (DL) has shown tremendous promise in learning the non-linear dynamics of complex systems. Recent work of Tank et al propose to overcome the issue of linear simplification in VAR models by using neural networks combined with sparsity-inducing penalties on the learn-able weights. In this work, we build upon ideas introduced by Tank et al. We propose several new classes of models that can handle underlying non-linearity. Firstly, we present the Learned Kernal VAR(LeKVAR) model-an extension of VAR models that also learns kernel parametrized by a neural net. Secondly, we show one can directly decouple lags and individual time series importance via decoupled penalties. This decoupling provides better scaling and allows us to embed lag selection into RNNs. Lastly, we propose a new training algorithm that supports mini-batching, and it is compatible with commonly used adaptive optimizers such as Adam.he proposed techniques are evaluated on several simulated datasets inspired by real-world applications.We also apply these methods to the Electro-Encephalogram (EEG) data for an epilepsy patient to study the evolution of GC before , during and after seizure across the 19 EEG channels.
△ Less
Submitted 7 August, 2022;
originally announced August 2022.
-
Time-Varying Dispersion Integer-Valued GARCH Models
Authors:
Wagner Barreto-Souza,
Luiza S. C. Piancastelli,
Konstantinos Fokianos,
Hernando Ombao
Abstract:
We propose a general class of INteger-valued Generalized AutoRegressive Conditionally Heteroscedastic (INGARCH) processes by allowing time-varying mean and dispersion parameters, which we call time-varying dispersion INGARCH (tv-DINGARCH) models. More specifically, we consider mixed Poisson INGARCH models and allow for dynamic modeling of the dispersion parameter (as well as the mean), similar to…
▽ More
We propose a general class of INteger-valued Generalized AutoRegressive Conditionally Heteroscedastic (INGARCH) processes by allowing time-varying mean and dispersion parameters, which we call time-varying dispersion INGARCH (tv-DINGARCH) models. More specifically, we consider mixed Poisson INGARCH models and allow for dynamic modeling of the dispersion parameter (as well as the mean), similar to the spirit of the ordinary GARCH models. We derive conditions to obtain first and second-order stationarity, and ergodicity as well. Estimation of the parameters is addressed and their associated asymptotic properties are established as well. A restricted bootstrap procedure is proposed for testing constant dispersion against time-varying dispersion. Monte Carlo simulation studies are presented for checking point estimation, standard errors, and the performance of the restricted bootstrap approach. We apply the tv-DINGARCH process to model the weekly number of reported measles infections in North Rhine-Westphalia, Germany, from January 2001 to May 2013, and compare its performance to the ordinary INGARCH approach.
△ Less
Submitted 30 May, 2024; v1 submitted 3 August, 2022;
originally announced August 2022.
-
Functional-Coefficient Models for Multivariate Time Series in Designed Experiments: with Applications to Brain Signals
Authors:
Paolo Victor Redondo,
Raphaël Huser,
Hernando Ombao
Abstract:
To study the neurophysiological basis of attention deficit hyperactivity disorder (ADHD), clinicians use electroencephalography (EEG) which record neuronal electrical activity on the cortex. The most commonly-used metric in ADHD is the theta-to-beta spectral power ratio (TBR) that is based on a single-channel analysis. However, initial findings for this measure have not been replicated in other st…
▽ More
To study the neurophysiological basis of attention deficit hyperactivity disorder (ADHD), clinicians use electroencephalography (EEG) which record neuronal electrical activity on the cortex. The most commonly-used metric in ADHD is the theta-to-beta spectral power ratio (TBR) that is based on a single-channel analysis. However, initial findings for this measure have not been replicated in other studies. Thus, instead of focusing on single-channel spectral power, a novel model for investigating interactions (dependence) between channels in the entire network is proposed. Although dependence measures such as coherence and partial directed coherence (PDC) are well explored in studying brain connectivity, these measures only capture linear dependence. Moreover, in designed clinical experiments, these dependence measures are observed to vary across subjects even within a homogeneous group. To address these limitations, we propose the mixed-effects functional-coefficient autoregressive (MX-FAR) model which captures between-subject variation by incorporating subject-specific random effects. The advantages of the MX-FAR model are the following: (1.) it captures potential non-linear dependence between channels; (2.) it is nonparametric and hence flexible and robust to model mis-specification; (3.) it can capture differences between groups when they exist; (4.) it accounts for variation across subjects; (5.) the framework easily incorporates well-known inference methods from mixed-effects models; (6.) it can be generalized to accommodate various covariates and factors. Finally, we apply the proposed MX-FAR model to analyze multichannel EEG signals and report novel findings on altered brain functional networks in ADHD.
△ Less
Submitted 8 August, 2022; v1 submitted 30 July, 2022;
originally announced August 2022.
-
Topological Data Analysis for Multivariate Time Series Data
Authors:
Anass El Yaagoubi Bourakna,
Moo K. Chung,
Hernando Ombao
Abstract:
Over the last two decades, topological data analysis (TDA) has emerged as a very powerful data analytic approach which can deal with various data modalities of varying complexities. One of the most commonly used tools in TDA is persistent homology (PH) which can extract topological properties from data at various scales. Our aim in this article is to introduce TDA concepts to a statistical audienc…
▽ More
Over the last two decades, topological data analysis (TDA) has emerged as a very powerful data analytic approach which can deal with various data modalities of varying complexities. One of the most commonly used tools in TDA is persistent homology (PH) which can extract topological properties from data at various scales. Our aim in this article is to introduce TDA concepts to a statistical audience and provide an approach to analyze multivariate time series data. The application focus will be on multivariate brain signals and brain connectivity networks. Finally, the paper concludes with an overview of some open problems and potential application of TDA to modeling directionality in a brain network as well as the casting of TDA in the context of mixed effects models to capture variations in the topological properties of data collected from multiple subjects
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Poisson-Birnbaum-Saunders Regression Model for Clustered Count Data
Authors:
Jussiane Nader Gonçalves,
Wagner Barreto-Souza,
Hernando Ombao
Abstract:
The premise of independence among subjects in the same cluster/group often fails in practice, and models that rely on such untenable assumption can produce misleading results. To overcome this severe deficiency, we introduce a new regression model to handle overdispersed and correlated clustered counts. To account for correlation within clusters, we propose a Poisson regression model where the obs…
▽ More
The premise of independence among subjects in the same cluster/group often fails in practice, and models that rely on such untenable assumption can produce misleading results. To overcome this severe deficiency, we introduce a new regression model to handle overdispersed and correlated clustered counts. To account for correlation within clusters, we propose a Poisson regression model where the observations within the same cluster are driven by the same latent random effect that follows the Birnbaum-Saunders distribution with a parameter that controls the strength of dependence among the individuals. This novel multivariate count model is called Clustered Poisson Birnbaum-Saunders (CPBS) regression. As illustrated in this paper, the CPBS model is analytically tractable, and its moment structure can be explicitly obtained. Estimation of parameters is performed through the maximum likelihood method, and an Expectation-Maximization (EM) algorithm is also developed. Simulation results to evaluate the finite-sample performance of our proposed estimators are presented. We also discuss diagnostic tools for checking model adequacy. An empirical application concerning the number of inpatient admissions by individuals to hospital emergency rooms, from the Medical Expenditure Panel Survey (MEPS) conducted by the United States Agency for Health Research and Quality, illustrates the usefulness of our proposed methodology.
△ Less
Submitted 21 February, 2022;
originally announced February 2022.
-
Graph Autoencoders for Embedding Learning in Brain Networks and Major Depressive Disorder Identification
Authors:
Fuad Noman,
Chee-Ming Ting,
Hakmook Kang,
Raphael C. -W. Phan,
Brian D. Boyd,
Warren D. Taylor,
Hernando Ombao
Abstract:
Brain functional connectivity (FC) reveals biomarkers for identification of various neuropsychiatric disorders. Recent application of deep neural networks (DNNs) to connectome-based classification mostly relies on traditional convolutional neural networks using input connectivity matrices on a regular Euclidean grid. We propose a graph deep learning framework to incorporate the non-Euclidean infor…
▽ More
Brain functional connectivity (FC) reveals biomarkers for identification of various neuropsychiatric disorders. Recent application of deep neural networks (DNNs) to connectome-based classification mostly relies on traditional convolutional neural networks using input connectivity matrices on a regular Euclidean grid. We propose a graph deep learning framework to incorporate the non-Euclidean information about graph structure for classifying functional magnetic resonance imaging (fMRI)-derived brain networks in major depressive disorder (MDD). We design a novel graph autoencoder (GAE) architecture based on the graph convolutional networks (GCNs) to embed the topological structure and node content of large-sized fMRI networks into low-dimensional latent representations. In network construction, we employ the Ledoit-Wolf (LDW) shrinkage method to estimate the high-dimensional FC metrics efficiently from fMRI data. We consider both supervised and unsupervised approaches for the graph embedding learning. The learned embeddings are then used as feature inputs for a deep fully-connected neural network (FCNN) to discriminate MDD from healthy controls. Evaluated on two resting-state fMRI (rs-fMRI) MDD datasets, results show that the proposed GAE-FCNN model significantly outperforms several state-of-the-art methods for brain connectome classification, achieving the best accuracy using the LDW-FC edges as node features. The graph embeddings of fMRI FC networks learned by the GAE also reveal apparent group differences between MDD and HC. Our new framework demonstrates feasibility of learning graph embeddings on brain networks to provide discriminative information for diagnosis of brain disorders.
△ Less
Submitted 2 June, 2022; v1 submitted 27 July, 2021;
originally announced July 2021.
-
BICNet: A Bayesian Approach for Estimating Task Effects on Intrinsic Connectivity Networks in fMRI Data
Authors:
Meini Tang,
Chee-Ming Ting,
Hernando Ombao
Abstract:
Intrinsic connectivity networks (ICNs) are specific dynamic functional brain networks that are consistently found under various conditions including rest and task. Studies have shown that some stimuli actually activate intrinsic connectivity through either suppression, excitation, moderation or modification. Nevertheless, the structure of ICNs and task-related effects on ICNs are not yet fully und…
▽ More
Intrinsic connectivity networks (ICNs) are specific dynamic functional brain networks that are consistently found under various conditions including rest and task. Studies have shown that some stimuli actually activate intrinsic connectivity through either suppression, excitation, moderation or modification. Nevertheless, the structure of ICNs and task-related effects on ICNs are not yet fully understood. In this paper, we propose a Bayesian Intrinsic Connectivity Network (BICNet) model to identify the ICNs and quantify the task-related effects on the ICN dynamics. Using an extended Bayesian dynamic sparse latent factor model, the proposed BICNet has the following advantages: (1) it simultaneously identifies the individual ICNs and group-level ICN spatial maps; (2) it robustly identifies ICNs by jointly modeling resting-state functional magnetic resonance imaging (rfMRI) and task-related functional magnetic resonance imaging (tfMRI); (3) compared to independent component analysis (ICA)-based methods, it can quantify the difference of ICNs amplitudes across different states; (4) it automatically performs feature selection through the sparsity of the ICNs rather than ad-hoc thresholding. The proposed BICNet was applied to the rfMRI and language tfMRI data from the Human Connectome Project (HCP) and the analysis identified several ICNs related to distinct language processing functions.
△ Less
Submitted 19 July, 2021;
originally announced July 2021.
-
Multivariate Conway-Maxwell-Poisson Distribution: Sarmanov Method and Doubly-Intractable Bayesian Inference
Authors:
Luiza S. C. Piancastelli,
Nial Friel,
Wagner Barreto-Souza,
Hernando Ombao
Abstract:
In this paper, a multivariate count distribution with Conway-Maxwell (COM)-Poisson marginals is proposed. To do this, we develop a modification of the Sarmanov method for constructing multivariate distributions. Our multivariate COM-Poisson (MultCOMP) model has desirable features such as (i) it admits a flexible covariance matrix allowing for both negative and positive non-diagonal entries; (ii) i…
▽ More
In this paper, a multivariate count distribution with Conway-Maxwell (COM)-Poisson marginals is proposed. To do this, we develop a modification of the Sarmanov method for constructing multivariate distributions. Our multivariate COM-Poisson (MultCOMP) model has desirable features such as (i) it admits a flexible covariance matrix allowing for both negative and positive non-diagonal entries; (ii) it overcomes the limitation of the existing bivariate COM-Poisson distributions in the literature that do not have COM-Poisson marginals; (iii) it allows for the analysis of multivariate counts and is not just limited to bivariate counts. Inferential challenges are presented by the likelihood specification as it depends on a number of intractable normalizing constants involving the model parameters. These obstacles motivate us to propose a Bayesian inferential approach where the resulting doubly-intractable posterior is dealt with via the exchange algorithm and the Grouped Independence Metropolis-Hastings algorithm. Numerical experiments based on simulations are presented to illustrate the proposed Bayesian approach. We analyze the potential of the MultCOMP model through a real data application on the numbers of goals scored by the home and away teams in the Premier League from 2018 to 2021. Here, our interest is to assess the effect of a lack of crowds during the COVID-19 pandemic on the well-known home team advantage. A MultCOMP model fit shows that there is evidence of a decreased number of goals scored by the home team, not accompanied by a reduced score from the opponent. Hence, our analysis suggests a smaller home team advantage in the absence of crowds, which agrees with the opinion of several football experts.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Markov-Switching State-Space Models with Applications to Neuroimaging
Authors:
David Degras,
Chee-Ming Ting,
Hernando Ombao
Abstract:
State-space models (SSM) with Markov switching offer a powerful framework for detecting multiple regimes in time series, analyzing mutual dependence and dynamics within regimes, and asserting transitions between regimes. These models however present considerable computational challenges due to the exponential number of possible regime sequences to account for. In addition, high dimensionality of t…
▽ More
State-space models (SSM) with Markov switching offer a powerful framework for detecting multiple regimes in time series, analyzing mutual dependence and dynamics within regimes, and asserting transitions between regimes. These models however present considerable computational challenges due to the exponential number of possible regime sequences to account for. In addition, high dimensionality of time series can hinder likelihood-based inference. This paper proposes novel statistical methods for Markov-switching SSMs using maximum likelihood estimation, Expectation-Maximization (EM), and parametric bootstrap. We develop solutions for initializing the EM algorithm, accelerating convergence, and conducting inference that are ideally suited to massive spatio-temporal data such as brain signals. We evaluate these methods in simulations and present applications to EEG studies of epilepsy and of motor imagery. All proposed methods are implemented in a MATLAB toolbox available at https://github.com/ddegras/switch-ssm.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
Filtrated Common Functional Principal Components for Multivariate Functional data
Authors:
Shuhao Jiao,
Ron D. Frostig,
Hernando Ombao
Abstract:
Local field potentials (LFPs) are signals that measure electrical activity in localized cortical regions from implanted tetrodes in the human or animal brain. The LFP signals are curves observed at multiple tetrodes which are implanted across a patch on the surface of the cortex. Hence, they can be treated as multi-group functional data, where the trajectories collected across temporal epochs from…
▽ More
Local field potentials (LFPs) are signals that measure electrical activity in localized cortical regions from implanted tetrodes in the human or animal brain. The LFP signals are curves observed at multiple tetrodes which are implanted across a patch on the surface of the cortex. Hence, they can be treated as multi-group functional data, where the trajectories collected across temporal epochs from one tetrode are viewed as a group of functions. In many cases, multi-tetrode LFP trajectories contain both global variation patterns (which are shared in common to all groups, due to signal synchrony) and isolated variation patterns (common only to a small subset of groups), and such structure is very informative to the analysis of such data. Therefore, one goal in this paper is to develop an efficient procedure that is able to capture and quantify both global and isolated features. We propose a novel tree-structured functional principal components (filt-fPC) analysis through finite-dimensional functional representation - specifically via filtration. A major advantage of the proposed filt-fPC method is the ability to extract the components that are common to multiple groups (or tetrodes) in a flexible "multi-resolution" manner and simultaneously preserve the idiosyncratic individual components of different tetrodes. The proposed filt-fPC approach is highly data-driven and no "ground-truth" model pre-specification is needed, making it a suitable approach for analyzing multi-group functional data that is complex. In addition, the filt-fPC method is able to produce a parsimonious, interpretable, and efficient low dimensional representation of multi-group functional data with orthonormal basis functions. Here, the proposed filt-fPCA method is employed to study the impact of a shock (induced stroke) on the synchrony structure of the rat brain.
△ Less
Submitted 26 November, 2022; v1 submitted 2 June, 2021;
originally announced June 2021.
-
SCAU: Modeling spectral causality for multivariate time series with applications to electroencephalograms
Authors:
Marco Antonio Pinto-Orellana,
Peyman Mirtaheri,
Hugo L. Hammer,
Hernando Ombao
Abstract:
Electroencephalograms (EEG) are noninvasive measurement signals of electrical neuronal activity in the brain. One of the current major statistical challenges is formally measuring functional dependency between those complex signals. This paper, proposes the spectral causality model (SCAU), a robust linear model, under a causality paradigm, to reflect inter- and intra-frequency modulation effects t…
▽ More
Electroencephalograms (EEG) are noninvasive measurement signals of electrical neuronal activity in the brain. One of the current major statistical challenges is formally measuring functional dependency between those complex signals. This paper, proposes the spectral causality model (SCAU), a robust linear model, under a causality paradigm, to reflect inter- and intra-frequency modulation effects that cannot be identifiable using other methods. SCAU inference is conducted with three main steps: (a) signal decomposition into frequency bins, (b) intermediate spectral band mapping, and (c) dependency modeling through frequency-specific autoregressive models (VAR). We apply SCAU to study complex dependencies during visual and lexical fluency tasks (word generation and visual fixation) in 26 participants' EEGs. We compared the connectivity networks estimated using SCAU with respect to a VAR model. SCAU networks show a clear contrast for both stimuli while the magnitude links also denoted a low variance in comparison with the VAR networks. Furthermore, SCAU dependency connections not only were consistent with findings in the neuroscience literature, but it also provided further evidence on the directionality of the spatio-spectral dependencies such as the delta-originated and theta-induced links in the fronto-temporal brain network.
△ Less
Submitted 13 May, 2021;
originally announced May 2021.
-
Lattice Paths for Persistent Diagrams
Authors:
Moo K. Chung,
Hernando Ombao
Abstract:
Persistent homology has undergone significant development in recent years. However, one outstanding challenge is to build a coherent statistical inference procedure on persistent diagrams. In this paper, we first present a new lattice path representation for persistent diagrams. We then develop a new exact statistical inference procedure for lattice paths via combinatorial enumerations. The lattic…
▽ More
Persistent homology has undergone significant development in recent years. However, one outstanding challenge is to build a coherent statistical inference procedure on persistent diagrams. In this paper, we first present a new lattice path representation for persistent diagrams. We then develop a new exact statistical inference procedure for lattice paths via combinatorial enumerations. The lattice path method is applied to the topological characterization of the protein structures of the COVID-19 virus. We demonstrate that there are topological changes during the conformational change of spike proteins.
△ Less
Submitted 30 July, 2021; v1 submitted 1 May, 2021;
originally announced May 2021.
-
Spectral Dependence
Authors:
Hernando Ombao,
Marco Pinto
Abstract:
This paper presents a general framework for modeling dependence in multivariate time series. Its fundamental approach relies on decomposing each signal in a system into various frequency components and then studying the dependence properties through these oscillatory activities.The unifying theme across the paper is to explore the strength of dependence and possible lead-lag dynamics through filte…
▽ More
This paper presents a general framework for modeling dependence in multivariate time series. Its fundamental approach relies on decomposing each signal in a system into various frequency components and then studying the dependence properties through these oscillatory activities.The unifying theme across the paper is to explore the strength of dependence and possible lead-lag dynamics through filtering. The proposed framework is capable of representing both linear and non-linear dependencies that could occur instantaneously or after some delay(lagged dependence). Examples for studying dependence between oscillations are illustrated through multichannel electroencephalograms. These examples emphasized that some of the most prominent frequency domain measures such as coherence, partial coherence,and dual-frequency coherence can be derived as special cases under this general framework.This paper also introduces related approaches for modeling dependence through phase-amplitude coupling and causality of (one-sided) filtered signals.
△ Less
Submitted 31 March, 2021;
originally announced March 2021.
-
Time-varying $\ell_0$ optimization for Spike Inference from Multi-Trial Calcium Recordings
Authors:
Tong Shen,
Kevin Johnston,
Gyorgy Lur,
Michele Guindani,
Hernando Ombao,
Zhaoxia Yu
Abstract:
Optical imaging of genetically encoded calcium indicators is a powerful tool to record the activity of a large number of neurons simultaneously over a long period of time from freely behaving animals. However, determining the exact time at which a neuron spikes and estimating the underlying firing rate from calcium fluorescence data remains challenging, especially for calcium imaging data obtained…
▽ More
Optical imaging of genetically encoded calcium indicators is a powerful tool to record the activity of a large number of neurons simultaneously over a long period of time from freely behaving animals. However, determining the exact time at which a neuron spikes and estimating the underlying firing rate from calcium fluorescence data remains challenging, especially for calcium imaging data obtained from a longitudinal study. We propose a multi-trial time-varying $\ell_0$ penalized method to jointly detect spikes and estimate firing rates by robustly integrating evolving neural dynamics across trials. Our simulation study shows that the proposed method performs well in both spike detection and firing rate estimation. We demonstrate the usefulness of our method on calcium fluorescence trace data from two studies, with the first study showing differential firing rate functions between two behaviors and the second study showing evolving firing rate function across trials due to learning.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
Ridge-penalized adaptive Mantel test and its application in imaging genetics
Authors:
Dustin Pluta,
Tong Shen,
Gui Xue,
Chuansheng Chen,
Hernando Ombao,
Zhaoxia Yu
Abstract:
We propose a ridge-penalized adaptive Mantel test (AdaMant) for evaluating the association of two high-dimensional sets of features. By introducing a ridge penalty, AdaMant tests the association across many metrics simultaneously. We demonstrate how ridge penalization bridges Euclidean and Mahalanobis distances and their corresponding linear models from the perspective of association measurement a…
▽ More
We propose a ridge-penalized adaptive Mantel test (AdaMant) for evaluating the association of two high-dimensional sets of features. By introducing a ridge penalty, AdaMant tests the association across many metrics simultaneously. We demonstrate how ridge penalization bridges Euclidean and Mahalanobis distances and their corresponding linear models from the perspective of association measurement and testing. This result is not only theoretically interesting but also has important implications in penalized hypothesis testing, especially in high dimensional settings such as imaging genetics. Applying the proposed method to an imaging genetic study of visual working memory in health adults, we identified interesting associations of brain connectivity (measured by EEG coherence) with selected genetic features.
△ Less
Submitted 20 March, 2021; v1 submitted 2 March, 2021;
originally announced March 2021.
-
Statistical Inference for Local Granger Causality
Authors:
Yan Liu,
Masanobu Taniguchi,
Hernando Ombao
Abstract:
Granger causality has been employed to investigate causality relations between components of stationary multiple time series. We generalize this concept by developing statistical inference for local Granger causality for multivariate locally stationary processes. Our proposed local Granger causality approach captures time-evolving causality relationships in nonstationary processes. The proposed lo…
▽ More
Granger causality has been employed to investigate causality relations between components of stationary multiple time series. We generalize this concept by developing statistical inference for local Granger causality for multivariate locally stationary processes. Our proposed local Granger causality approach captures time-evolving causality relationships in nonstationary processes. The proposed local Granger causality is well represented in the frequency domain and estimated based on the parametric time-varying spectral density matrix using the local Whittle likelihood. Under regularity conditions, we demonstrate that the estimators converge to multivariate normal in distribution. Additionally, the test statistic for the local Granger causality is shown to be asymptotically distributed as a quadratic form of a multivariate normal distribution. The finite sample performance is confirmed with several simulation studies for multivariate time-varying autoregressive models. For practical demonstration, the proposed local Granger causality method uncovered new functional connectivity relationships between channels in brain signals. Moreover, the method was able to identify structural changes in financial data.
△ Less
Submitted 4 August, 2021; v1 submitted 27 February, 2021;
originally announced March 2021.
-
Smooth Online Parameter Estimation for time varying VAR models with application to rat's LFP data
Authors:
Anass El Yaagoubi Bourakna,
Marco Pinto,
Norbert Fortin,
Hernando Ombao
Abstract:
Multivariate time series data appear often as realizations of non-stationary processes where the covariance matrix or spectral matrix smoothly evolve over time. Most of the current approaches estimate the time-varying spectral properties only retrospectively - that is, after the entire data has been observed. Retrospective estimation is a major limitation in many adaptive control applications wher…
▽ More
Multivariate time series data appear often as realizations of non-stationary processes where the covariance matrix or spectral matrix smoothly evolve over time. Most of the current approaches estimate the time-varying spectral properties only retrospectively - that is, after the entire data has been observed. Retrospective estimation is a major limitation in many adaptive control applications where it is important to estimate these properties and detect changes in the system as they happen in real-time. One major obstacle in online estimation is the computational cost due to the high-dimensionality of the parameters. Existing methods such as the Kalman filter or local least squares are feasible. However, they are not always suitable because they provide noisy estimates and can become prohibitively costly as the dimension of the time series increases. In our brain signal application, it is critical to develop a robust method that can estimate, in real-time, the properties of the underlying stochastic process, in particular, the spectral brain connectivity measures. For these reasons we propose a new smooth online parameter estimation approach (SOPE) that has the ability to control for the smoothness of the estimates with a reasonable computational complexity. Consequently, the models are fit in real-time even for high dimensional time series. We demonstrate that our proposed SOPE approach is as good as the Kalman filter in terms of mean-squared error for small dimensions. However, unlike the Kalman filter, the SOPE has lower computational cost and hence scalable for higher dimensions. Finally, we apply the SOPE method to a rat's local field potential data during a hippocampus-dependent sequence-memory task. As demonstrated in the video, the proposed SOPE method is able to capture the dynamics of the connectivity as the rat performs the sequence of non-spatial working memory tasks.
△ Less
Submitted 5 March, 2022; v1 submitted 24 February, 2021;
originally announced February 2021.
-
Brain Waves Analysis Via a Non-parametric Bayesian Mixture of Autoregressive Kernels
Authors:
Guillermo Granados-Garcia,
Mark Fiecas,
Babak Shahbaba,
Norbert Fortin,
Hernando Ombao
Abstract:
The standard approach to analyzing brain electrical activity is to examine the spectral density function (SDF) and identify predefined frequency bands that have the most substantial relative contributions to the overall variance of the signal. However, a limitation of this approach is that the precise frequency and bandwidth of oscillations vary with cognitive demands. Thus they should not be arbi…
▽ More
The standard approach to analyzing brain electrical activity is to examine the spectral density function (SDF) and identify predefined frequency bands that have the most substantial relative contributions to the overall variance of the signal. However, a limitation of this approach is that the precise frequency and bandwidth of oscillations vary with cognitive demands. Thus they should not be arbitrarily defined a priori in an experiment. In this paper, we develop a data-driven approach that identifies (i) the number of prominent peaks, (ii) the frequency peak locations, and (iii) their corresponding bandwidths (or spread of power around the peaks). We propose a Bayesian mixture auto-regressive decomposition method (BMARD), which represents the standardized SDFas a Dirichlet process mixture based on a kernel derived from second-order auto-regressive processes which completely characterize the location (peak)and scale (bandwidth) parameters. We present a Metropolis-Hastings within Gibbs algorithm to sample from the posterior distribution of the mixture parameters. Simulation studies demonstrate the robustness and performance of the BMARD method. Finally, we use the proposed BMARD method to analyze local field potential (LFP) activity from the hippocampus of laboratory rats across different conditions in a non-spatial sequence memory experiment to identify the most interesting frequency bands and examine the link between specific patterns of activity and trial-specific cognitive demands.
△ Less
Submitted 25 March, 2021; v1 submitted 23 February, 2021;
originally announced February 2021.
-
Separating Stimulus-Induced and Background Components of Dynamic Functional Connectivity in Naturalistic fMRI
Authors:
Chee-Ming Ting,
Jeremy I. Skipper,
Steven L. Small,
Hernando Ombao
Abstract:
We consider the challenges in extracting stimulus-related neural dynamics from other intrinsic processes and noise in naturalistic functional magnetic resonance imaging (fMRI). Most studies rely on inter-subject correlations (ISC) of low-level regional activity and neglect varying responses in individuals. We propose a novel, data-driven approach based on low-rank plus sparse (L+S) decomposition t…
▽ More
We consider the challenges in extracting stimulus-related neural dynamics from other intrinsic processes and noise in naturalistic functional magnetic resonance imaging (fMRI). Most studies rely on inter-subject correlations (ISC) of low-level regional activity and neglect varying responses in individuals. We propose a novel, data-driven approach based on low-rank plus sparse (L+S) decomposition to isolate stimulus-driven dynamic changes in brain functional connectivity (FC) from the background noise, by exploiting shared network structure among subjects receiving the same naturalistic stimuli. The time-resolved multi-subject FC matrices are modeled as a sum of a low-rank component of correlated FC patterns across subjects, and a sparse component of subject-specific, idiosyncratic background activities. To recover the shared low-rank subspace, we introduce a fused version of principal component pursuit (PCP) by adding a fusion-type penalty on the differences between the rows of the low-rank matrix. The method improves the detection of stimulus-induced group-level homogeneity in the FC profile while capturing inter-subject variability. We develop an efficient algorithm via a linearized alternating direction method of multipliers to solve the fused-PCP. Simulations show accurate recovery by the fused-PCP even when a large fraction of FC edges are severely corrupted. When applied to natural fMRI data, our method reveals FC changes that were time-locked to auditory processing during movie watching, with dynamic engagement of sensorimotor systems for speech-in-noise. It also provides a better mapping to auditory content in the movie than ISC.
△ Less
Submitted 24 January, 2021;
originally announced February 2021.
-
Conex-Connect: Learning Patterns in Extremal Brain Connectivity From Multi-Channel EEG Data
Authors:
Matheus B. Guerrero,
Raphaël Huser,
Hernando Ombao
Abstract:
Epilepsy is a chronic neurological disorder affecting more than 50 million people globally. An epileptic seizure acts like a temporary shock to the neuronal system, disrupting normal electrical activity in the brain. Epilepsy is frequently diagnosed with electroencephalograms (EEGs). Current methods study the time-varying spectra and coherence but do not directly model changes in extreme behavior.…
▽ More
Epilepsy is a chronic neurological disorder affecting more than 50 million people globally. An epileptic seizure acts like a temporary shock to the neuronal system, disrupting normal electrical activity in the brain. Epilepsy is frequently diagnosed with electroencephalograms (EEGs). Current methods study the time-varying spectra and coherence but do not directly model changes in extreme behavior. Thus, we propose a new approach to characterize brain connectivity based on the joint tail behavior of the EEGs. Our proposed method, the conditional extremal dependence for brain connectivity (Conex-Connect), is a pioneering approach that links the association between extreme values of higher oscillations at a reference channel with the other brain network channels. Using the Conex-Connect method, we discover changes in the extremal dependence driven by the activity at the foci of the epileptic seizure. Our model-based approach reveals that, pre-seizure, the dependence is notably stable for all channels when conditioning on extreme values of the focal seizure area. Post-seizure, by contrast, the dependence between channels is weaker, and dependence patterns are more "chaotic". Moreover, in terms of spectral decomposition, we find that high values of the high-frequency Gamma-band are the most relevant features to explain the conditional extremal dependence of brain connectivity.
△ Less
Submitted 3 January, 2021;
originally announced January 2021.
-
Change-point detection using spectral PCA for multivariate time series
Authors:
Shuhao Jiao,
Tong Shen,
Zhaoxia Yu,
Hernando Ombao
Abstract:
We propose a two-stage approach Spec PC-CP to identify change points in multivariate time series. In the first stage, we obtain a low-dimensional summary of the high-dimensional time series by Spectral Principal Component Analysis (Spec-PCA). In the second stage, we apply cumulative sum-type test on the Spectral PCA component using a binary segmentation algorithm. Compared with existing approaches…
▽ More
We propose a two-stage approach Spec PC-CP to identify change points in multivariate time series. In the first stage, we obtain a low-dimensional summary of the high-dimensional time series by Spectral Principal Component Analysis (Spec-PCA). In the second stage, we apply cumulative sum-type test on the Spectral PCA component using a binary segmentation algorithm. Compared with existing approaches, the proposed method is able to capture the lead-lag relationship in time series. Our simulations demonstrate that the Spec PC-CP method performs significantly better than competing methods for detecting change points in high-dimensional time series. The results on epileptic seizure EEG data and stock data also indicate that our new method can efficiently {detect} change points corresponding to the onset of the underlying events.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
Flexible Bivariate INGARCH Process With a Broad Range of Contemporaneous Correlation
Authors:
Luiza S. C. Piancastelli,
Wagner Barreto-Souza,
Hernando Ombao
Abstract:
We propose a novel flexible bivariate conditional Poisson (BCP) INteger-valued Generalized AutoRegressive Conditional Heteroscedastic (INGARCH) model for correlated count time series data. Our proposed BCP-INGARCH model is mathematically tractable and has as the main advantage over existing bivariate INGARCH models its ability to capture a broad range (both negative and positive) of contemporaneou…
▽ More
We propose a novel flexible bivariate conditional Poisson (BCP) INteger-valued Generalized AutoRegressive Conditional Heteroscedastic (INGARCH) model for correlated count time series data. Our proposed BCP-INGARCH model is mathematically tractable and has as the main advantage over existing bivariate INGARCH models its ability to capture a broad range (both negative and positive) of contemporaneous cross-correlation which is a non-trivial advancement. Properties of stationarity and ergodicity for the BCP-INGARCH process are developed. Estimation of the parameters is performed through conditional maximum likelihood (CML) and finite sample behavior of the estimators are investigated through simulation studies. Asymptotic properties of the CML estimators are derived. Additional simulation studies compare and contrast methods of obtaining standard errors of the parameter estimates, where a bootstrap option is demonstrated to be advantageous. Hypothesis testing methods for the presence of contemporaneous correlation between the time series are presented and evaluated. We apply our methodology to monthly counts of hepatitis cases at two nearby Brazilian cities, which are highly cross-correlated. The data analysis demonstrates the importance of considering a bivariate model allowing for a wide range of contemporaneous correlation in real-life applications.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
Structural Brain Asymmetries in Youths with Combined and Inattentive Presentations of Attention Deficit Hyperactivity Disorder
Authors:
Cintya Nirvana Dutta,
Pamela K. Douglas,
Hernando Ombao
Abstract:
Alterations in structural brain laterality are reported in attention-deficit/hyperactivity disorder (ADHD). However, few studies examined differences within presentations of ADHD. We investigate asymmetry index (AI) across 13 subcortical and 33 cortical regions from anatomical metrics of volume, surface area, and thickness. Structural T1-weighted MRI data were obtained from youths with inattentive…
▽ More
Alterations in structural brain laterality are reported in attention-deficit/hyperactivity disorder (ADHD). However, few studies examined differences within presentations of ADHD. We investigate asymmetry index (AI) across 13 subcortical and 33 cortical regions from anatomical metrics of volume, surface area, and thickness. Structural T1-weighted MRI data were obtained from youths with inattentive (n = 64) and combined (n = 51) presentations, and aged-matched controls (n = 298). We used a linear mixed effect model that accounts for data site heterogeneity, while studying associations between AI and covariates of presentation and age. Our paper contributes to the functional results seen among ADHD presentations evidencing disrupted connectivity in motor networks from ADHD-C and cingulo-frontal networks from ADHD-I, as well as new findings in the temporal cortex and default mode subnetworks. Age patterns of structural asymmetries vary with presentation type. Linear mixed effects model is a practical tool for characterizing associations between brain asymmetries, diagnosis, and neurodevelopment.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
Clustering Brain Signals: A Robust Approach Using Functional Data Ranking
Authors:
Tianbo Chen,
Ying Sun,
Carolina Euan,
Hernando Ombao
Abstract:
In this paper, we analyze electroencephalograms (EEG) which are recordings of brain electrical activity. We develop new clustering methods for identifying synchronized brain regions, where the EEGs show similar oscillations or waveforms according to their spectral densities. We treat the estimated spectral densities from many epochs or trials as functional data and develop clustering algorithms ba…
▽ More
In this paper, we analyze electroencephalograms (EEG) which are recordings of brain electrical activity. We develop new clustering methods for identifying synchronized brain regions, where the EEGs show similar oscillations or waveforms according to their spectral densities. We treat the estimated spectral densities from many epochs or trials as functional data and develop clustering algorithms based on functional data ranking. The two proposed clustering algorithms use different dissimilarity measures: distance of the functional medians and the area of the central region. The performance of the proposed algorithms is examined by simulation studies. We show that, when contaminations are present, the proposed methods for clustering spectral densities are more robust than the mean-based methods. The developed methods are applied to two stages of resting state EEG data from a male college student, corresponding to early exploration of functional connectivity in the human brain.
△ Less
Submitted 28 July, 2020;
originally announced July 2020.
-
Levels and trends in the sex ratio at birth in seven provinces of Nepal between 1980 and 2016 with probabilistic projections to 2050: a Bayesian modeling approach
Authors:
Fengqing Chao,
Samir KC,
Hernando Ombao
Abstract:
The sex ratio at birth (SRB; ratio of male to female births) in Nepal has been reported without imbalance on the national level. However, the national SRB could mask the disparity within the country. Given the demographic and cultural heterogeneities in Nepal, it is crucial to model Nepal SRB on the subnational level. Prior studies on subnational SRB in Nepal are mostly based on reporting observed…
▽ More
The sex ratio at birth (SRB; ratio of male to female births) in Nepal has been reported without imbalance on the national level. However, the national SRB could mask the disparity within the country. Given the demographic and cultural heterogeneities in Nepal, it is crucial to model Nepal SRB on the subnational level. Prior studies on subnational SRB in Nepal are mostly based on reporting observed values from surveys and census, and no study has provided probabilistic projections. We aim to estimate and project SRB for the seven provinces of Nepal from 1980 to 2050 using a Bayesian modeling approach. We compiled an extensive database on provincial SRB of Nepal, consisting 2001, 2006, 2011, and 2016 Nepal Demographic and Health Surveys and 2011 Census. We adopted a Bayesian hierarchical time series model to estimate and project the provincial SRB, with a focus on modelling the potential SRB imbalance. In 2016, the highest SRB is estimated in Province 5 at 1.102 with a 95% credible interval (1.044, 1.127) and the lowest SRB is in Province 2 at 1.053 (1.035, 1.109). The SRB imbalance probabilities in all provinces are generally low and vary from 16% in Province 2 to 81% in Province 5. SRB imbalances are estimated to have begun at the earliest in 2001 in Province 5 with a 95% credible interval (1992, 2022) and the latest in 2017 (1998, 2040) in Province 2. We project SRB in all provinces to begin converging back to the national baseline in the mid-2030s. Our findings imply that the majority of provinces in Nepal have low risks of SRB imbalance for the period 1980-2016. However, we identify a few provinces with higher probabilities of having SRB inflation. The projected SRB is an important illustration of potential future prenatal sex discrimination and shows the need to monitor SRB in provinces with higher possibilities of SRB imbalance.
△ Less
Submitted 30 August, 2020; v1 submitted 1 July, 2020;
originally announced July 2020.
-
Break Point Detection for Functional Covariance
Authors:
Shuhao Jiao,
Ron D. Frostig,
Hernando Ombao
Abstract:
Many experiments record sequential trajectories where each trajectory consists of oscillations and fluctuations around zero. Such trajectories can be viewed as zero-mean functional data. When there are structural breaks (on the sequence of trajectories) in higher order moments, it is not always easy to spot these by mere visual inspection. Motivated by this challenging problem in brain signal anal…
▽ More
Many experiments record sequential trajectories where each trajectory consists of oscillations and fluctuations around zero. Such trajectories can be viewed as zero-mean functional data. When there are structural breaks (on the sequence of trajectories) in higher order moments, it is not always easy to spot these by mere visual inspection. Motivated by this challenging problem in brain signal analysis, we propose a detection and testing procedure to find the change point in functional covariance. The detection procedure is based on the cumulative sum statistics (CUSUM). The classical testing procedure for functional data depends on a null distribution which depends on infinitely many unknown parameters, though in practice only a finite number of these can be included for the hypothesis test of the existence of change point. This paper provides some theoretical insights on the influence of the number of parameters. Meanwhile, the asymptotic properties of the estimated change point are developed. The effectiveness of the proposed method is numerically validated in simulation studies and an application to investigate changes in rat brain signals following an experimentally-induced stroke.
△ Less
Submitted 4 February, 2022; v1 submitted 24 June, 2020;
originally announced June 2020.
-
Multiscale modelling of replicated nonstationary time series
Authors:
Jonathan Embleton,
Marina I. Knight,
Hernando Ombao
Abstract:
Within the neurosciences, to observe variability across time in the dynamics of an underlying brain process is neither new nor unexpected. Wavelets are essential in analyzing brain signals because, even within a single trial, brain signals exhibit nonstationary behaviour. However, neurological signals generated within an experiment may also potentially exhibit evolution across trials (replicates).…
▽ More
Within the neurosciences, to observe variability across time in the dynamics of an underlying brain process is neither new nor unexpected. Wavelets are essential in analyzing brain signals because, even within a single trial, brain signals exhibit nonstationary behaviour. However, neurological signals generated within an experiment may also potentially exhibit evolution across trials (replicates). As neurologists consider localised spectra of brain signals to be most informative, here we develop a novel wavelet-based tool capable to formally represent process nonstationarities across both time and replicate dimensions. Specifically, we propose the Replicate Locally Stationary Wavelet (RLSW) process, that captures the potential nonstationary behaviour within and across trials. Estimation using wavelets gives a natural desired time- and replicate-localisation of the process dynamics. We develop the associated spectral estimation framework and establish its asymptotic properties. By means of thorough simulation studies, we demonstrate the theoretical estimator properties hold in practice. A real data investigation into the evolutionary dynamics of the hippocampus and nucleus accumbens during an associative learning experiment, demonstrate the applicability of our proposed methodology, as well as the new insights it provides.
△ Less
Submitted 19 May, 2020;
originally announced May 2020.
-
Semiparametric time series models driven by latent factor
Authors:
Gisele O. Maia,
Wagner Barreto-Souza,
Fernando S. Bastos,
Hernando Ombao
Abstract:
We introduce a class of semiparametric time series models by assuming a quasi-likelihood approach driven by a latent factor process. More specifically, given the latent process, we only specify the conditional mean and variance of the time series and enjoy a quasi-likelihood function for estimating parameters related to the mean. This proposed methodology has three remarkable features: (i) no para…
▽ More
We introduce a class of semiparametric time series models by assuming a quasi-likelihood approach driven by a latent factor process. More specifically, given the latent process, we only specify the conditional mean and variance of the time series and enjoy a quasi-likelihood function for estimating parameters related to the mean. This proposed methodology has three remarkable features: (i) no parametric form is assumed for the conditional distribution of the time series given the latent process; (ii) able for modelling non-negative, count, bounded/binary and real-valued time series; (iii) dispersion parameter is not assumed to be known. Further, we obtain explicit expressions for the marginal moments and for the autocorrelation function of the time series process so that a method of moments can be employed for estimating the dispersion parameter and also parameters related to the latent process. Simulated results aiming to check the proposed estimation procedure are presented. Real data analysis on unemployment rate and precipitation time series illustrate the potencial for practice of our methodology.
△ Less
Submitted 23 April, 2020;
originally announced April 2020.
-
Integer-valued autoregressive process with flexible marginal and innovation distributions
Authors:
Matheus B. Guerrero,
Wagner Barreto-Souza,
Hernando Ombao
Abstract:
INteger Auto-Regressive (INAR) processes are usually defined by specifying the innovations and the operator, which often leads to difficulties in deriving marginal properties of the process. In many practical situations, a major modeling limitation is that it is difficult to justify the choice of the operator. To overcome these drawbacks, we propose a new flexible approach to build an INAR model:…
▽ More
INteger Auto-Regressive (INAR) processes are usually defined by specifying the innovations and the operator, which often leads to difficulties in deriving marginal properties of the process. In many practical situations, a major modeling limitation is that it is difficult to justify the choice of the operator. To overcome these drawbacks, we propose a new flexible approach to build an INAR model: we pre-specify the marginal and innovation distributions. Hence, the operator is a consequence of specifying the desired marginal and innovation distributions. Our new INAR model has both marginal and innovations geometric distributed, being a direct alternative to the classical Poisson INAR model. Our proposed process has interesting stochastic properties such as an MA($\infty$) representation, time-reversibility, and closed-forms for the transition probabilities $h$-steps ahead, allowing for coherent forecasting. We analyze time-series counts of skin lesions using our proposed approach, comparing it with existing INAR and INGARCH models. Our model gives more adherence to the data and better forecasting performance.
△ Less
Submitted 18 April, 2020;
originally announced April 2020.
-
Detecting Dynamic Community Structure in Functional Brain Networks Across Individuals: A Multilayer Approach
Authors:
Chee-Ming Ting,
S. Balqis Samdin,
Meini Tang,
Hernando Ombao
Abstract:
We present a unified statistical framework for characterizing community structure of brain functional networks that captures variation across individuals and evolution over time. Existing methods for community detection focus only on single-subject analysis of dynamic networks; while recent extensions to multiple-subjects analysis are limited to static networks. To overcome these limitations, we p…
▽ More
We present a unified statistical framework for characterizing community structure of brain functional networks that captures variation across individuals and evolution over time. Existing methods for community detection focus only on single-subject analysis of dynamic networks; while recent extensions to multiple-subjects analysis are limited to static networks. To overcome these limitations, we propose a multi-subject, Markov-switching stochastic block model (MSS-SBM) to identify state-related changes in brain community organization over a group of individuals. We first formulate a multilayer extension of SBM to describe the time-dependent, multi-subject brain networks. We develop a novel procedure for fitting the multilayer SBM that builds on multislice modularity maximization which can uncover a common community partition of all layers (subjects) simultaneously. By augmenting with a dynamic Markov switching process, our proposed method is able to capture a set of distinct, recurring temporal states with respect to inter-community interactions over subjects and the change points between them. Simulation shows accurate community recovery and tracking of dynamic community regimes over multilayer networks by the MSS-SBM. Application to task fMRI reveals meaningful non-assortative brain community motifs, e.g., core-periphery structure at the group level, that are associated with language comprehension and motor functions suggesting their putative role in complex information integration. Our approach detected dynamic reconfiguration of modular connectivity elicited by varying task demands and identified unique profiles of intra and inter-community connectivity across different task conditions. The proposed multilayer network representation provides a principled way of detecting synchronous, dynamic modularity in brain networks across subjects.
△ Less
Submitted 16 October, 2020; v1 submitted 9 April, 2020;
originally announced April 2020.
-
Probabilistic Projection of the Sex Ratio at Birth and Missing Female Births by State and Union Territory in India
Authors:
Fengqing Chao,
Christophe Z. Guilmoto,
Samir K. C.,
Hernando Ombao
Abstract:
The sex ratio at birth (SRB) in India has been reported imbalanced since the 1970s. Previous studies have shown a great variation in the SRB across geographic locations in India till 2016. As one of the most populous countries and in view of its great regional heterogeneity, it is crucial to produce probabilistic projections for the SRB in India at state level for the purpose of population project…
▽ More
The sex ratio at birth (SRB) in India has been reported imbalanced since the 1970s. Previous studies have shown a great variation in the SRB across geographic locations in India till 2016. As one of the most populous countries and in view of its great regional heterogeneity, it is crucial to produce probabilistic projections for the SRB in India at state level for the purpose of population projection and policy planning. In this paper, we implement a Bayesian hierarchical time series model to project SRB in India by state. We generate SRB probabilistic projections from 2017 to 2030 for 29 States and Union Territories (UTs) in India, and present results in 21 States/UTs with data from the Sample Registration System. Our analysis takes into account two state-specific factors that contribute to sex-selective abortion and resulting sex imbalances at birth: intensity of son preference and fertility squeeze. We project that the largest contribution to female births deficits is in Uttar Pradesh, with cumulative number of missing female births projected to be 2.0 (95% credible interval [1.9; 2.2]) million from 2017 to 2030. The total female birth deficits during 2017-2030 for the whole India is projected to be 6.8 [6.6; 7.0] million.
△ Less
Submitted 5 April, 2020;
originally announced April 2020.