Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
11institutetext: King Abdullah University of Science and Technology (KAUST), Statistics Program, Thuwal, 23955, Saudi Arabia
11email: anass.bourakna@kaust.edu.sa; hernando.ombao@kaust.edu.sa
22institutetext: University of Wisconsin-Madison, Department of Statistics, Medical Science Center 4725, 1300 University Ave, Madison, WI 53706, USA
22email: mkchung@wisc.edu

Topological Analysis of Seizure-Induced Changes in Brain Hierarchy Through Effective Connectivity

Anass B. El-Yaagoubi 11 0000-0002-8886-0587    Moo K. Chung 22 0000-0003-2852-9670    Hernando  Ombao 11 0000-0001-7020-8091
Abstract

Traditional Topological Data Analysis (TDA) methods, such as Persistent Homology (PH), rely on distance measures (e.g., cross-correlation, partial correlation, coherence, and partial coherence) that are symmetric by definition. While useful for studying topological patterns in functional brain connectivity, the main limitation of these methods is their inability to capture the directional dynamics - which is crucial for understanding effective brain connectivity. We propose the Causality-Based Topological Ranking (CBTR) method, which integrates Causal Inference (CI) to assess effective brain connectivity with Hodge Decomposition (HD) to rank brain regions based on their mutual influence. Our simulations confirm that the CBTR method accurately and consistently identifies hierarchical structures in multivariate time series data. Moreover, this method effectively identifies brain regions showing the most significant interaction changes with other regions during seizures using electroencephalogram (EEG) data. These results provide novel insights into the brain’s hierarchical organization and illuminate the impact of seizures on its dynamics.

Keywords:
Hodge Decomposition Topological Data Analysis Time Series Analysis Causal Inference Effective Brain Connectivity Seizure EEG Data.

1 Introduction

Over the past two decades, Topological Data Analysis (TDA) has become a valuable tool in various fields, providing new insights into complex data structures [1, 5, 6, 8, 9]. In biology, TDA has helped clarify genetic and evolutionary processes [19], while in materials science, it has improved predictions of properties in crystalline compounds [16]. In finance, TDA has offered a novel perspective on the dynamics of financial markets during crises [14]. In neuroscience, the use of Persistent Homology (PH), a key technique within TDA, has revealed new aspects of the topological organization of brain dependence networks. By analyzing multivariate brain signals, researchers have uncovered complex patterns and structures that deepen our understanding of brain connectivity and functionality [2, 10, 17].

However, traditional uses of PH often involve Vietoris-Rips filtrations from undirected brain dependence networks. A significant limitation of this approach is its inability to capture the directionality of connectivity between nodes in a brain network [11]. It is well-established that node dependencies within a brain network exhibit asymmetry, where the influence exerted by node A𝐴Aitalic_A on node B𝐵Bitalic_B can differ significantly from the influence exerted by node B𝐵Bitalic_B on node A𝐴Aitalic_A. This asymmetry is crucial for understanding effective brain connectivity, which involves directional information flow, from one region into another, that often varies markedly in its reverse path, thus delineating specific neural pathways [12, 13, 20, 23].

In the study of epilepsy, understanding changes in brain connectivity during seizures is crucial. Research indicates that epileptic seizures is associated with significant alterations in interactions between brain regions in a network. These changes vary depending on whether the seizure is generalized or originates from a specific focal region and spreads across the brain’s hemispheres [4, 7]. Traditional TDA methods, which primarily assess functional connectivity, often fall short in accurately characterizing these dynamic changes in effective brain connectivity.

The concept of causal inference refers to learning a causal model from observational data. In cognitive neuroscience, this process is fundamental for understanding how different brain regions interact, and how these interactions may be altered due to brain disorders  [25]. Experimental interventions (e.g., optogenetics) are considered the gold standard in causal discovery [18, 25], as they provide a way to experimentally manipulate activity of a neuron (or subpopulation of neurons) and investigate how these manipulations change the interaction of a neuron with another neuron (or another subpopulation of neurons). Through these experimental manipulations, one can gain definitive evidence of causal physiological relationships. While such interventions and manipulations are conducted in non-human animal studies, these are ethically and practically challenging when studying brain connectivity in humans. Consequently, we will only focus on identifying causal associations and quantifying their strength from observational data.

To gain better understanding of the dynamic changes in brain connectivity during epileptic seizures, we have developed the Causality-Based Topological Ranking (CBTR) approach. This novel method integrates Causal Inference (CI) and Hodge Decomposition (HD), allowing for the identification of causal pathways and the hierarchical ranking of brain regions based on their mutual influence over time. CBTR assesses dependencies conditioned on parent nodes, thereby avoiding confounding effects among observed variables. Our analysis of EEG data from subjects who experienced seizures helped us identify the brain regions most significantly impacted by seizures for each subject.

2 Methods

The goal is to develop a method for ranking brain regions (derived from the network of electroencehphalogram data) according to effective connectivity using a two-step process. First, causal inference techniques are employed to map the hierarchical structure of brain connectivity, with a focus on the effects of seizures. Next, Hodge Decomposition is applied to net flow of information between brain regions, quantifying their mutual influence.

2.1 Causal Inference of the Brain’s Hierarchical Structure

Understanding the dynamics of the hierarchical structure of the brain’s effective connectivity is crucial for comprehending the impact of seizures. In an EEG network, a channel is considered to be at the top of the hierarchy if it tends to exert greater influence on other nodes, whereas a brain channel is at the bottom if it tends to be influenced more by other nodes. This hierarchical structure is determined by analyzing the causal relationships and time-delayed dependencies among the channels. This is illustrated in Figure 1. Assessing changes in brain hierarchy allows us to quantify the effects of seizures on each specific EEG channel, providing deeper insights into how seizures alter brain connectivity.

Refer to caption
Figure 1: Potential hierarchy of the brain in normal (Left) and seizure (Right) conditions. In the normal state, the hierarchy is O1 (left occipital), O2 (right occipital) followed by T3 (left temporal), Fp2 (right frontal pole), and then T4 (right temporal), Fp1 (left frontal pole). During a seizure, the order changes to T3, O1, Fp1 followed by T4, O2, Fp2.

To estimate effective brain connectivity, we employ the PCMCI (Peter and Clark Momentary Conditional Independence) method, a robust causal inference technique well-suited for high-dimensional time series data [21, 22]. Following the PCMCI method, we conducted a two-stage process. First, we used the PC algorithm to identify potential causal parents for each variable, reducing dimensionality and producing a preliminary causal graph. Then, we applied the Momentary Conditional Independence (MCI) test to rigorously assess these dependencies’ significance, considering both contemporaneous and time-lagged relationships while controlling for false positives. This combination of dimension reduction and rigorous testing allowed PCMCI to effectively estimate the connectivity between observed time series components, providing a detailed map of directional dependencies in the brain.

The outputs of the PCMCI method include a causal graph that highlights the most significant causal links between brain regions at various lags, with each link annotated by its corresponding statistical significance and strength. This causal graph forms the foundation for our subsequent analyses. We estimate effective connectivity as a weighted directed network, where the weight of each link (𝐖p,qsubscript𝐖𝑝𝑞\mathbf{W}_{p,q}bold_W start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT) between regions p𝑝pitalic_p and q𝑞qitalic_q is derived from the significance level (p-value) associated with the causal interaction from q𝑞qitalic_q to p𝑝pitalic_p, as defined in Equations 1, 2.

𝐖p,qsubscript𝐖𝑝𝑞\displaystyle\mathbf{W}_{p,q}bold_W start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT =1p¯p,q,absent1subscript¯𝑝𝑝𝑞\displaystyle=1-\overline{p}_{p,q},= 1 - over¯ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT , (1)
p¯p,qsubscript¯𝑝𝑝𝑞\displaystyle\overline{p}_{p,q}over¯ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT =1Kk=0Kpp,q(k),absent1𝐾superscriptsubscript𝑘0𝐾subscript𝑝𝑝𝑞𝑘\displaystyle=\frac{1}{K}\sum_{k=0}^{K}p_{p,q}(k),= divide start_ARG 1 end_ARG start_ARG italic_K end_ARG ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT ( italic_k ) , (2)

where pp,q(k)subscript𝑝𝑝𝑞𝑘p_{p,q}(k)italic_p start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT ( italic_k ) is the p-value corresponding to the MCI test from variable q𝑞qitalic_q to p𝑝pitalic_p at lag k𝑘kitalic_k. The estimated effective brain connectivity network is decomposed into symmetric (𝐖ssubscript𝐖𝑠\mathbf{W}_{s}bold_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT) and anti-symmetric (𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT) components.

𝐖𝐖\displaystyle\mathbf{W}bold_W =𝐖s+𝐖aabsentsubscript𝐖𝑠subscript𝐖𝑎\displaystyle=\mathbf{W}_{s}+\mathbf{W}_{a}= bold_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT (3)

where 𝐖s=12(𝐖+𝐖)subscript𝐖𝑠12𝐖superscript𝐖\mathbf{W}_{s}=\frac{1}{2}(\mathbf{W}+\mathbf{W}^{\prime})bold_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( bold_W + bold_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) represents mutual causal influence and 𝐖a=12(𝐖𝐖)subscript𝐖𝑎12𝐖superscript𝐖\mathbf{W}_{a}=\frac{1}{2}(\mathbf{W}-\mathbf{W}^{\prime})bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( bold_W - bold_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) represents net causal influence. Since brain dependencies can be asymmetric, this decomposition is relevant as it separates the connectivity network into meaningful symmetric and anti-symmetric components. The symmetric component, 𝐖ssubscript𝐖𝑠\mathbf{W}_{s}bold_W start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, captures bidirectional interactions, reflecting mutual influences critical for cooperative neural processes. The anti-symmetric component, 𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, highlights the predominant direction of causal influence, revealing the hierarchical structure and directional information flow within the brain network. This approach provides a nuanced understanding of neural activities and identifies key regions driving effective connectivity. This decomposition is attractive to neuroscientists - it is an essential tool to deepen our understanding of the hierarchical structure of the brain. The next step involves building the ranking of brain regions using Hodge Decomposition, which will allow us to quantify and rank the hierarchical influence of each region within the network.

2.2 Our Proposed Method for Ranking Brain Regions

Once effective connectivity is estimated and the net influence 𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is derived, our proposed method will rank brain regions by assigning a score to each node or brain channel based on their influence within the network. Our method is based on the HodgeRank approach, as described in [15], which has previously been utilized to rank entities such as movies and football teams. A higher ranking score indicates a region that influences other regions more than it is influenced, while a lower score suggests a region that is more influenced by others. This concept is illustrated in Figure 2.

Refer to caption
Figure 2: Node ranking in a graph involves a gradient operator that maps ranking scores on the left to gradient flow on the right (dependence structures). The adjoint, gradient star, reverses this mapping from right to left.

To achieve the most meaningful ranking (i.e., one that minimizes violations of mutual net causal influence), we apply Hodge Decomposition to the anti-symmetric component of the connectivity network. Since 𝐖p,q=𝐖q,psubscript𝐖𝑝𝑞subscript𝐖𝑞𝑝\mathbf{W}_{p,q}=-\mathbf{W}_{q,p}bold_W start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT = - bold_W start_POSTSUBSCRIPT italic_q , italic_p end_POSTSUBSCRIPT, 𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is referred to as an alternating function. In the following, we first provide definitions for the gradient (grad) and curl operators, and then introduce the Hodge Decomposition Theorem.

Definition 1 (Gradient Operator)

The ’grad’ operator measures the flow of influence between nodes. For any edge e=(p,q)E𝑒𝑝𝑞𝐸e=(p,q)\in Eitalic_e = ( italic_p , italic_q ) ∈ italic_E, it is defined as:

grad(s)=sqsp,grad𝑠subscript𝑠𝑞subscript𝑠𝑝\displaystyle\text{grad}(s)=s_{q}-s_{p},grad ( italic_s ) = italic_s start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT - italic_s start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , (4)

where spsubscript𝑠𝑝s_{p}italic_s start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and sqsubscript𝑠𝑞s_{q}italic_s start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT represent the scores or potentials of nodes p𝑝pitalic_p and q𝑞qitalic_q. This operator reflects the change in influence from node p𝑝pitalic_p to node q𝑞qitalic_q.

Definition 2 (Curl Operator)

The ’curl’ operator measures the rotation strength of mutual influence among nodes. For any face t=(p,q,r)T𝑡𝑝𝑞𝑟𝑇t=(p,q,r)\in Titalic_t = ( italic_p , italic_q , italic_r ) ∈ italic_T, it is defined as:

curl(ϕ)=ϕp,q+ϕq,r+ϕr,p,curlitalic-ϕsubscriptitalic-ϕ𝑝𝑞subscriptitalic-ϕ𝑞𝑟subscriptitalic-ϕ𝑟𝑝\displaystyle\text{curl}(\phi)=\phi_{p,q}+\phi_{q,r}+\phi_{r,p},curl ( italic_ϕ ) = italic_ϕ start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT + italic_ϕ start_POSTSUBSCRIPT italic_q , italic_r end_POSTSUBSCRIPT + italic_ϕ start_POSTSUBSCRIPT italic_r , italic_p end_POSTSUBSCRIPT , (5)

where ϕp,qsubscriptitalic-ϕ𝑝𝑞\phi_{p,q}italic_ϕ start_POSTSUBSCRIPT italic_p , italic_q end_POSTSUBSCRIPT, ϕq,rsubscriptitalic-ϕ𝑞𝑟\phi_{q,r}italic_ϕ start_POSTSUBSCRIPT italic_q , italic_r end_POSTSUBSCRIPT, and ϕr,psubscriptitalic-ϕ𝑟𝑝\phi_{r,p}italic_ϕ start_POSTSUBSCRIPT italic_r , italic_p end_POSTSUBSCRIPT are the edge flows between nodes p,q𝑝𝑞p,qitalic_p , italic_q, and r𝑟ritalic_r. This operator evaluates the total rotation around a triangular face in the graph.

Given the definitions above, we can observe that the image of the gradient operator is included in the kernel of the curl operator. This relationship leads to the Helmholtz-Hodge Decomposition theorem [3].

Theorem 2.1 (Helmholtz-Hodge Decomposition)

The space of anti-symmetric matrices, Esuperscript𝐸\mathbb{R}^{E}blackboard_R start_POSTSUPERSCRIPT italic_E end_POSTSUPERSCRIPT, can be decomposed into three mutually orthogonal subspaces:

E=Im(grad)Im(curl)Ker(grad)Ker(curl).superscript𝐸direct-sumImgradImsuperscriptcurlKersuperscriptgradKercurl\displaystyle\mathbb{R}^{E}=\text{Im}(\text{grad})\oplus\text{Im}(\text{curl}^% {*})\oplus\text{Ker}(\text{grad}^{*})\cap\text{Ker}(\text{curl}).blackboard_R start_POSTSUPERSCRIPT italic_E end_POSTSUPERSCRIPT = Im ( grad ) ⊕ Im ( curl start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ⊕ Ker ( grad start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ∩ Ker ( curl ) .

The anti-symmetric component 𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT can exhibit cyclic influence in the network, which indicates that no perfect ranking can be achieved without violating at least one mutual influence relationship within each cycle. According to the Helmholtz-Hodge Decomposition theorem, any edge flow 𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT can be uniquely decomposed as:

𝐖a=grad(s)+curl(ϕ)+𝐡,subscript𝐖𝑎grad𝑠superscriptcurlitalic-ϕ𝐡\displaystyle\mathbf{W}_{a}=-\text{grad}(s)+\text{curl}^{*}(\phi)+\mathbf{h},bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = - grad ( italic_s ) + curl start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_ϕ ) + bold_h , (6)

where grad(s)grad𝑠\text{grad}(s)grad ( italic_s ) is the gradient component capturing hierarchical relationships, curl(ϕ)superscriptcurlitalic-ϕ\text{curl}^{*}(\phi)curl start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_ϕ ) is the curl component indicating local cyclic influences, and 𝐡𝐡\mathbf{h}bold_h is the harmonic component indicating non-local or global cyclic influences.

To estimate these components, we use the least squares method for the gradient and curl components, and obtain the harmonic component as the residual.

s^^𝑠\displaystyle\hat{s}over^ start_ARG italic_s end_ARG =argmins𝐖a+grad(s)F2,absentsubscript𝑠superscriptsubscriptnormsubscript𝐖𝑎grad𝑠𝐹2\displaystyle=\arg\min_{s}\|\mathbf{W}_{a}+\text{grad}(s)\|_{F}^{2},= roman_arg roman_min start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∥ bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT + grad ( italic_s ) ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (7)
ϕ^^italic-ϕ\displaystyle\hat{\phi}over^ start_ARG italic_ϕ end_ARG =argminϕ𝐖acurl(ϕ)F2,absentsubscriptitalic-ϕsuperscriptsubscriptnormsubscript𝐖𝑎superscriptcurlitalic-ϕ𝐹2\displaystyle=\arg\min_{\phi}\|\mathbf{W}_{a}-\text{curl}^{*}(\phi)\|_{F}^{2},= roman_arg roman_min start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ∥ bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - curl start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_ϕ ) ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (8)
h^^\displaystyle\hat{h}over^ start_ARG italic_h end_ARG =𝐖agrad(s)curl(ϕ).absentsubscript𝐖𝑎grad𝑠superscriptcurlitalic-ϕ\displaystyle=\mathbf{W}_{a}-\text{grad}(s)-\text{curl}^{*}(\phi).= bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT - grad ( italic_s ) - curl start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_ϕ ) . (9)

The estimated score vector s^=[s1,s2,,sN]^𝑠superscriptsubscript𝑠1subscript𝑠2subscript𝑠𝑁\widehat{s}=[s_{1},s_{2},\ldots,s_{N}]^{\prime}over^ start_ARG italic_s end_ARG = [ italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_s start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT (where N𝑁Nitalic_N is the number of nodes or time series components) derived from 𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is used to establish the topological ranking of the EEG channels (or brain regions). By minimizing the difference between grad(s^)grad^𝑠-\text{grad}(\widehat{s})- grad ( over^ start_ARG italic_s end_ARG ) and 𝐖asubscript𝐖𝑎\mathbf{W}_{a}bold_W start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, this ranking accurately reflects net information flow patterns while avoiding inconsistencies, as curl(grad(s^))=0curlgrad^𝑠0\text{curl}(\text{grad}(\widehat{s}))=0curl ( grad ( over^ start_ARG italic_s end_ARG ) ) = 0. If sp>sqsubscript𝑠𝑝subscript𝑠𝑞s_{p}>s_{q}italic_s start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT > italic_s start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT, the p𝑝pitalic_p-th time series tends to lead other time series more than the q𝑞qitalic_q-th time series, indicating that p𝑝pitalic_p is higher in the hierarchy. Thus, s^^𝑠\widehat{s}over^ start_ARG italic_s end_ARG captures the brain hierarchy, as illustrated in Figure 2. This is crucial for understanding brain function and the impact of dysfunctions such as seizures.

Our methodology involves a series of steps designed to rank brain regions based on their mutual influence. Initially, we employ the PCMCI method to estimate effective brain connectivity, utilizing causal inference techniques. Following this, the connectivity network is decomposed into symmetric and anti-symmetric components. We then apply Hodge Decomposition to the anti-symmetric component, extracting the gradient, which is essential for establishing the ranking. This allows us to derive the topological ranking of brain regions based on the gradient component. The entire process is visually represented in Figure 3.

Refer to caption
Figure 3: Hodge decomposition pipeline for topological ranking of brain regions.

3 Experiments

To evaluate our methodology, we propose six distinct simulation scenarios, each representing different hierarchical dependency structures. These scenarios range from linear hierarchies to complex cyclic dependencies, testing the robustness and versatility of our ranking mechanism across various topologies, as illustrated in Figure 4.

  • Scenarios 1 and 4: Exhibit a linear hierarchical dependence structure, allowing for clear node ranking (scenario 4 reverses scenario 1).

  • Scenarios 2 and 5: Feature cyclic dependence structures, challenging meaningful ranking due to non-transitivity (scenario 5 reverses scenario 2).

  • Scenarios 3 and 6: Present partial rankings with localized cycles, suggesting intermediate dependence structures (scenario 6 reverses scenario 3).

Refer to caption
Figure 4: Visual representation of the designed dependency networks for six simulation scenarios. Each arrow indicates a dependency direction at lag one.

We generate multivariate time series data using the mixture model in Equation 10, with P=5𝑃5P=5italic_P = 5 dimensions and T=30,000𝑇30000T=30,000italic_T = 30 , 000 observations, divided into 500-observation epochs to analyze temporal stability and variability in ranking. The first 5,000 observations correspond to scenario one, the next 5,000 to scenario two, and so on.

Y(t)=Φ(t)Z(t)+E(t),Φ(t)=[00.40.500000.40.500000.40.500000.400000],formulae-sequence𝑌𝑡Φ𝑡𝑍𝑡𝐸𝑡Φ𝑡matrix00.40.500000.40.500000.40.500000.400000\displaystyle Y(t)=\Phi(t)Z(t)+E(t),\quad\Phi(t)=\begin{bmatrix}0&0.4&0.5&0&0% \\ 0&0&0.4&0.5&0\\ 0&0&0&0.4&0.5\\ 0&0&0&0&0.4\\ 0&0&0&0&0\\ \end{bmatrix},italic_Y ( italic_t ) = roman_Φ ( italic_t ) italic_Z ( italic_t ) + italic_E ( italic_t ) , roman_Φ ( italic_t ) = [ start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0.4 end_CELL start_CELL 0.5 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0.4 end_CELL start_CELL 0.5 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0.4 end_CELL start_CELL 0.5 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0.4 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ] , (10)

where Y(t)𝑌𝑡Y(t)italic_Y ( italic_t ) is the observed multivariate time series, Z(t)𝑍𝑡Z(t)italic_Z ( italic_t ) is a multivariate standard Gaussian process, and E(t)𝐸𝑡E(t)italic_E ( italic_t ) is random noise, also modeled as a standard Gaussian process. The mixing matrix Φ(t)Φ𝑡\Phi(t)roman_Φ ( italic_t ) reflects each scenario’s dependence structure.

Our method follows an epoch-based approach to dynamically estimate causal relationships within the multivariate time series data in order to capture the directional dependencies and hierarchical variations. Figure 5 showcases the causal connections identified for one epoch from scenario one, highlighting both the causal pathways and the strength of these interactions.

Refer to caption
Figure 5: Visual representation of the causal interactions within the network for scenario one as identified by PCMCI, showcasing the algorithm’s capacity to accurately capture both the direction and magnitude of causal dependencies across time lags.

By applying our proposed ranking procedure, we are able to identify and highlight nodes within the network that exert significant influence over others, providing a clear understanding of the hierarchical patterns and their dynamics, as can be seen in Figure 6. Our methodology effectively ranks the simulated time series, consistently placing node A at the top and node E at the bottom, as indicated by their respective curves, in scenario one, with the inverse pattern observed in scenario four. A node is ranked above another if its curve is consistently above the other’s. Scenarios two and five exhibit fluctuating rankings due to inherent cyclic dependencies, while scenarios three and six show partial rankings with notable positions for node D, aligning with the designed simulation scenarios.

Refer to caption
Figure 6: Aggregate causality-based ranking scores, delineating the hierarchical structure within each scenario. The distinct patterns observed across scenarios validate the robustness of our ranking methodology.

These results validate our approach for ranking multivariate time series data based on net influence, as it robustly identifies the hierarchical structures in scenarios one and three. In contrast, scenarios two and four, characterized by cyclic dependencies, exhibit inconsistent rankings. Through these simulations, we confirm that our CBTR method effectively uncovers the hierarchical structure within dependence networks, distinguishing various dependency patterns.

4 Results

In this section, we focus on applying our methodology to a dataset of neonatal EEG recordings [24]. Neonatal seizures are a critical concern within the neonatal intensive care unit (NICU), raising numerous questions about their temporal and spatial characteristics. The dataset comprises 19-channel EEG recordings from 79 full-term infants at the Helsinki University Hospital NICU, with seizure annotations provided by three expert clinicians for each recording, spanning a median duration of 74 minutes. We concentrate our study on a few subjects, specifically analyzing epochs consistently classified as either seizure or seizure-free by all three experts. Our initial goal is to explore how seizures influence effective brain connectivity in neonates, serving as a foundational step for future detailed research on the nature of seizures and their developmental effects.

In many instances, neonatal EEG recordings reveal distinct seizure patterns, as shown in Figure 7. However, deciphering the interconnections among the 19 channels, particularly the influence of seizures, presents a complex challenge. Our approach quantifies the impact of seizures on the neonatal brain’s effective connectivity at the channel level, aiming to uncover how seizures modify brain connectivity and hierarchical organization in each subject.

Refer to caption
Figure 7: Sample of EEG signals from channels F3, F7, Fz, and C4, showcasing epochs (1 second each) with and without seizure. Original EEG recordings are depicted in black, while the incremental changes between successive EEG readings are highlighted in cyan.

The EEG recordings typically exhibit smooth variations at the millisecond scale, leading to high autocorrelation in lagged observations, which can increase spurious causal relationships. To mitigate this, we analyze effective brain connectivity through successive differences or increments in EEG recordings, reducing the influence of lagged dependencies, as illustrated in Figure 7.

Following the approach presented in Section 2.1, we estimated the causal influences among brain channels for each subject across different time lags during seizure and non-seizure periods. Following the procedure in Section 2.2, we evaluate the changes in causal influence during seizure time by estimating ranking scores for all channels across 50 seizure epochs and compare them with scores from 50 seizure-free epochs. We used a t-test to determine the statistical significance of the ranking score differences. This analysis revealed significant changes in causality-based ranking scores for several channels for each examined subject, evidenced by very small p-values. For instance, in subjects 1 and 5, we highlight the EEG channels most significantly impacted by seizures. These findings, depicted in Figures 8 and 9, illuminate how seizures alter the hierarchical architecture of brain connectivity, offering deeper insights into their impacts and pinpointing the brain regions where seizures exert the greatest influence. These results demonstrate the efficacy of our CBTR approach in uncovering the nuanced dynamics of effective connectivity influenced by seizure events.

Refer to caption
Figure 8: Channels with most significant shifts in the ranking scores between seizure free epochs and seizure epochs. For subject 1.
Refer to caption
Figure 9: Channels with most significant shifts in the ranking scores between seizure free epochs and seizure epochs. For subject 5.

5 Conclusion and Discussion

In this paper, we introduced a novel framework to investigate effective brain connectivity — a domain traditionally explored through the lens of functional connectivity via persistence homology. Our proposed approach is unique in a sense that it integrates causal inference methods with Hodge decomposition, enabling the ranking of brain regions according to their mutual causal influences. This approach offers a novel perspective on the complex dynamics with effective brain connectivity.

Our simulation study demonstrates the robustness and accuracy of our methodology in uncovering and describing various hierarchical structures within multivariate dependence networks. Additionally, the application of our method to a dataset of neonatal EEG recordings highlights its practical value. Our analysis revealed novel insights into the impact of seizures on neonatal brain connectivity, specifically identifying changes in the hierarchical positions of specific brain regions in response to seizure activity.

Our future work entails deeper investigations on epileptic seizure using our proposed method. Given its potential to enhance the localization of seizure origins, this future direction holds promise for making significant contributions to seizure. By refining our understanding of seizure-induced alterations in brain connectivity, we anticipate that our methodology will offer valuable tools for clinical diagnosis and the development of targeted interventions.

References

  • [1] Adams, H., Emerson, T., Kirby, M., Neville, R., Peterson, C., Shipman, P.: Persistence images: A stable vector representation of persistent homology. Journal of Machine Learning Research 18, 1–35 (2017). https://doi.org/https://jmlr.org/papers/volume18/16-337/16-337.pdf
  • [2] Bendich, P., Marron, J.S., Miller, E., Pieloch, A., Skwerer, S.: Persistent homology analysis of brain artery trees. The Annals of Applied Statistics 10, 198–218 (2016). https://doi.org/10.1214/15-AOAS886
  • [3] Bhatia, H., Norgard, G., Pascucci, V., Bremer, P.T.: The helmholtz-hodge decomposition—a survey. IEEE Transactions on Visualization and Computer Graphics 19(8), 1386–1404 (2013). https://doi.org/10.1109/TVCG.2012.316
  • [4] Bromfield, E.B., Cavazos, J.E., Sirven, J.I.: An Introduction to Epilepsy. American Epilepsy Society (2006)
  • [5] Bubenik, P.: Statistical topological data analysis using persistence landscapes. Journal of Machine Learning Research 16, 77–102 (2015). https://doi.org/10.5555/2789272.2789275
  • [6] Carlsson, G., Zomorodian, A., Collins, A., Guibas, L.: Persistence barcodes for shapes. p. 124–135. Association for Computing Machinery (2004). https://doi.org/10.1145/1057432.1057449
  • [7] Cook, C.J., Hwang, G., Mathis, J., Nair, V.A., Conant, L.L., Allen, L., Almane, D.N., Birn, R., DeYoe, E.A., Felton, E., Forseth, C., Humphries, C.J., Kraegel, P., Nencka, A., Nwoke, O., Raghavan, M., Rivera-Bonet, C., Rozman, M., Tellapragada, N., Ustine, C., Ward, B.D., Struck, A., Maganti, R., Hermann, B., Prabhakaran, V., Binder, J.R., Meyerand, M.E.: Effective connectivity within the default mode network in left temporal lobe epilepsy: Findings from the epilepsy connectome project. Brain Connectivity 9 (2019). https://doi.org/10.1089/brain.2018.0600
  • [8] Edelsbrunner, H., Harer, J.: Persistent homology—a survey. Discrete and Computational Geometry 453, 257–282 (2008). https://doi.org/10.1090/conm/453/08802
  • [9] Edelsbrunner, H., Letscher, D., Zomorodian, A.: Topological persistence and simplification 28, 511–533 (2002). https://doi.org/doi.org/10.1007/s00454-002-2885-2
  • [10] El-Yaagoubi, A.B., Jiao, S., Chung, M.K., Ombao, H.: Spectral topological data analysis of brain signals. arXiv:2401.05343 [q-bio.NC] (2024). https://doi.org/10.48550/arXiv.2401.05343
  • [11] El-Yaagoubi, A.B., Ombao, H.: Topological data analysis for directed dependence networks of multivariate time series data. In: Research Papers in Statistical Inference for Time Series and Related Models, chap. 17. Springer (2023)
  • [12] Friston, K., Moran, R., Seth, A.: Analysing connectivity with granger causality and dynamic causal modelling. Current Opinion in Neurobiology 23(2), 172–178 (2013). https://doi.org/https://doi.org/10.1016/j.conb.2012.11.010
  • [13] Friston, K.J.: Functional and effective connectivity: A review. Brain Connectivity 1, 13–36 (2011). https://doi.org/10.1089/brain.2011.0008
  • [14] Gidea, M., Katz, Y.: Topological data analysis of financial time series: Landscapes of crashes. Physica A: Statistical Mechanics and its Applications 491, 820–834 (2018). https://doi.org/10.1016/j.physa.2017.09.028
  • [15] Jiang, X., Lim, L.H., Yao, Y., Ye, Y.: Statistical ranking and combinatorial hodge theory. Mathematical Programming 127(1), 203–244 (March 2011). https://doi.org/10.1007/s10107-010-0419-x
  • [16] Jiang, Y., Chen, D., Chen, X., Li, T., Wei, G.W., Pan, F.: Topological representations of crystalline compounds for the machine-learning prediction of materials properties. npj Computational Materials 7(28) (2021). https://doi.org/10.1038/s41524-021-00493-w
  • [17] Lee, H., Kang, H., Chung, M.K., Kim, B.N., Lee, D.S.: Persistent brain network homology from the perspective of dendrogram. IEEE Transactions on Medical Imaging 31, 2267–2277 (2012). https://doi.org/10.1109/TMI.2012.2219590
  • [18] Pearl, J.: Causality: Models, Reasoning and Inference. Cambridge University Press (2009)
  • [19] Rabadan, R., Blumberg, A.J.: Topological Data Analysis for Genomics and Evolution. Cambridge University Press (2019)
  • [20] Roebroeck, A., Formisano, E., Goebel, R.: Mapping directed influence over the brain using granger causality and fmri. NeuroImage 25(1), 230–242 (2005). https://doi.org/https://doi.org/10.1016/j.neuroimage.2004.11.017
  • [21] Runge, J.: Discovering contemporaneous and lagged causal relations in autocorrelated nonlinear time series datasets. In: Peters, J., Sontag, D. (eds.) Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI). Proceedings of Machine Learning Research, vol. 124, pp. 1388–1397. PMLR (03–06 Aug 2020), https://proceedings.mlr.press/v124/runge20a.html
  • [22] Runge, J., Nowack, P., Kretschmer, M., Flaxman, S., Sejdinovic, D.: Detecting and quantifying causal associations in large nonlinear time series datasets. Science Advances 5(11), eaau4996 (2019). https://doi.org/10.1126/sciadv.aau4996
  • [23] Seth, A.K., Barrett, A.B., Barnett, L.: Granger causality analysis in neuroscience and neuroimaging. Journal of Neuroscience 35(8), 3293–3297 (2015). https://doi.org/10.1523/JNEUROSCI.4399-14.2015
  • [24] Stevenson, N.J., Tapani, K., Lauronen, L., Vanhatalo, S.: A dataset of neonatal eeg recordings with seizure annotations. Scientific Data 6, 190039 (2019). https://doi.org/10.1038/sdata.2019.39
  • [25] Weichwald, S., Peters, J.: Causality in cognitive neuroscience: Concepts, challenges, and distributional robustness. Journal of Cognitive Neuroscience 33(2), 226–247 (2021). https://doi.org/10.1162/jocn_a_01623