11email: anass.bourakna@kaust.edu.sa; hernando.ombao@kaust.edu.sa 22institutetext: University of Wisconsin-Madison, Department of Statistics, Medical Science Center 4725, 1300 University Ave, Madison, WI 53706, USA
22email: mkchung@wisc.edu
Topological Analysis of Seizure-Induced Changes in Brain Hierarchy Through Effective Connectivity
Abstract
Traditional Topological Data Analysis (TDA) methods, such as Persistent Homology (PH), rely on distance measures (e.g., cross-correlation, partial correlation, coherence, and partial coherence) that are symmetric by definition. While useful for studying topological patterns in functional brain connectivity, the main limitation of these methods is their inability to capture the directional dynamics - which is crucial for understanding effective brain connectivity. We propose the Causality-Based Topological Ranking (CBTR) method, which integrates Causal Inference (CI) to assess effective brain connectivity with Hodge Decomposition (HD) to rank brain regions based on their mutual influence. Our simulations confirm that the CBTR method accurately and consistently identifies hierarchical structures in multivariate time series data. Moreover, this method effectively identifies brain regions showing the most significant interaction changes with other regions during seizures using electroencephalogram (EEG) data. These results provide novel insights into the brain’s hierarchical organization and illuminate the impact of seizures on its dynamics.
Keywords:
Hodge Decomposition Topological Data Analysis Time Series Analysis Causal Inference Effective Brain Connectivity Seizure EEG Data.1 Introduction
Over the past two decades, Topological Data Analysis (TDA) has become a valuable tool in various fields, providing new insights into complex data structures [1, 5, 6, 8, 9]. In biology, TDA has helped clarify genetic and evolutionary processes [19], while in materials science, it has improved predictions of properties in crystalline compounds [16]. In finance, TDA has offered a novel perspective on the dynamics of financial markets during crises [14]. In neuroscience, the use of Persistent Homology (PH), a key technique within TDA, has revealed new aspects of the topological organization of brain dependence networks. By analyzing multivariate brain signals, researchers have uncovered complex patterns and structures that deepen our understanding of brain connectivity and functionality [2, 10, 17].
However, traditional uses of PH often involve Vietoris-Rips filtrations from undirected brain dependence networks. A significant limitation of this approach is its inability to capture the directionality of connectivity between nodes in a brain network [11]. It is well-established that node dependencies within a brain network exhibit asymmetry, where the influence exerted by node on node can differ significantly from the influence exerted by node on node . This asymmetry is crucial for understanding effective brain connectivity, which involves directional information flow, from one region into another, that often varies markedly in its reverse path, thus delineating specific neural pathways [12, 13, 20, 23].
In the study of epilepsy, understanding changes in brain connectivity during seizures is crucial. Research indicates that epileptic seizures is associated with significant alterations in interactions between brain regions in a network. These changes vary depending on whether the seizure is generalized or originates from a specific focal region and spreads across the brain’s hemispheres [4, 7]. Traditional TDA methods, which primarily assess functional connectivity, often fall short in accurately characterizing these dynamic changes in effective brain connectivity.
The concept of causal inference refers to learning a causal model from observational data. In cognitive neuroscience, this process is fundamental for understanding how different brain regions interact, and how these interactions may be altered due to brain disorders [25]. Experimental interventions (e.g., optogenetics) are considered the gold standard in causal discovery [18, 25], as they provide a way to experimentally manipulate activity of a neuron (or subpopulation of neurons) and investigate how these manipulations change the interaction of a neuron with another neuron (or another subpopulation of neurons). Through these experimental manipulations, one can gain definitive evidence of causal physiological relationships. While such interventions and manipulations are conducted in non-human animal studies, these are ethically and practically challenging when studying brain connectivity in humans. Consequently, we will only focus on identifying causal associations and quantifying their strength from observational data.
To gain better understanding of the dynamic changes in brain connectivity during epileptic seizures, we have developed the Causality-Based Topological Ranking (CBTR) approach. This novel method integrates Causal Inference (CI) and Hodge Decomposition (HD), allowing for the identification of causal pathways and the hierarchical ranking of brain regions based on their mutual influence over time. CBTR assesses dependencies conditioned on parent nodes, thereby avoiding confounding effects among observed variables. Our analysis of EEG data from subjects who experienced seizures helped us identify the brain regions most significantly impacted by seizures for each subject.
2 Methods
The goal is to develop a method for ranking brain regions (derived from the network of electroencehphalogram data) according to effective connectivity using a two-step process. First, causal inference techniques are employed to map the hierarchical structure of brain connectivity, with a focus on the effects of seizures. Next, Hodge Decomposition is applied to net flow of information between brain regions, quantifying their mutual influence.
2.1 Causal Inference of the Brain’s Hierarchical Structure
Understanding the dynamics of the hierarchical structure of the brain’s effective connectivity is crucial for comprehending the impact of seizures. In an EEG network, a channel is considered to be at the top of the hierarchy if it tends to exert greater influence on other nodes, whereas a brain channel is at the bottom if it tends to be influenced more by other nodes. This hierarchical structure is determined by analyzing the causal relationships and time-delayed dependencies among the channels. This is illustrated in Figure 1. Assessing changes in brain hierarchy allows us to quantify the effects of seizures on each specific EEG channel, providing deeper insights into how seizures alter brain connectivity.
To estimate effective brain connectivity, we employ the PCMCI (Peter and Clark Momentary Conditional Independence) method, a robust causal inference technique well-suited for high-dimensional time series data [21, 22]. Following the PCMCI method, we conducted a two-stage process. First, we used the PC algorithm to identify potential causal parents for each variable, reducing dimensionality and producing a preliminary causal graph. Then, we applied the Momentary Conditional Independence (MCI) test to rigorously assess these dependencies’ significance, considering both contemporaneous and time-lagged relationships while controlling for false positives. This combination of dimension reduction and rigorous testing allowed PCMCI to effectively estimate the connectivity between observed time series components, providing a detailed map of directional dependencies in the brain.
The outputs of the PCMCI method include a causal graph that highlights the most significant causal links between brain regions at various lags, with each link annotated by its corresponding statistical significance and strength. This causal graph forms the foundation for our subsequent analyses. We estimate effective connectivity as a weighted directed network, where the weight of each link () between regions and is derived from the significance level (p-value) associated with the causal interaction from to , as defined in Equations 1, 2.
(1) | ||||
(2) |
where is the p-value corresponding to the MCI test from variable to at lag . The estimated effective brain connectivity network is decomposed into symmetric () and anti-symmetric () components.
(3) |
where represents mutual causal influence and represents net causal influence. Since brain dependencies can be asymmetric, this decomposition is relevant as it separates the connectivity network into meaningful symmetric and anti-symmetric components. The symmetric component, , captures bidirectional interactions, reflecting mutual influences critical for cooperative neural processes. The anti-symmetric component, , highlights the predominant direction of causal influence, revealing the hierarchical structure and directional information flow within the brain network. This approach provides a nuanced understanding of neural activities and identifies key regions driving effective connectivity. This decomposition is attractive to neuroscientists - it is an essential tool to deepen our understanding of the hierarchical structure of the brain. The next step involves building the ranking of brain regions using Hodge Decomposition, which will allow us to quantify and rank the hierarchical influence of each region within the network.
2.2 Our Proposed Method for Ranking Brain Regions
Once effective connectivity is estimated and the net influence is derived, our proposed method will rank brain regions by assigning a score to each node or brain channel based on their influence within the network. Our method is based on the HodgeRank approach, as described in [15], which has previously been utilized to rank entities such as movies and football teams. A higher ranking score indicates a region that influences other regions more than it is influenced, while a lower score suggests a region that is more influenced by others. This concept is illustrated in Figure 2.
To achieve the most meaningful ranking (i.e., one that minimizes violations of mutual net causal influence), we apply Hodge Decomposition to the anti-symmetric component of the connectivity network. Since , is referred to as an alternating function. In the following, we first provide definitions for the gradient (grad) and curl operators, and then introduce the Hodge Decomposition Theorem.
Definition 1 (Gradient Operator)
The ’grad’ operator measures the flow of influence between nodes. For any edge , it is defined as:
(4) |
where and represent the scores or potentials of nodes and . This operator reflects the change in influence from node to node .
Definition 2 (Curl Operator)
The ’curl’ operator measures the rotation strength of mutual influence among nodes. For any face , it is defined as:
(5) |
where , , and are the edge flows between nodes , and . This operator evaluates the total rotation around a triangular face in the graph.
Given the definitions above, we can observe that the image of the gradient operator is included in the kernel of the curl operator. This relationship leads to the Helmholtz-Hodge Decomposition theorem [3].
Theorem 2.1 (Helmholtz-Hodge Decomposition)
The space of anti-symmetric matrices, , can be decomposed into three mutually orthogonal subspaces:
The anti-symmetric component can exhibit cyclic influence in the network, which indicates that no perfect ranking can be achieved without violating at least one mutual influence relationship within each cycle. According to the Helmholtz-Hodge Decomposition theorem, any edge flow can be uniquely decomposed as:
(6) |
where is the gradient component capturing hierarchical relationships, is the curl component indicating local cyclic influences, and is the harmonic component indicating non-local or global cyclic influences.
To estimate these components, we use the least squares method for the gradient and curl components, and obtain the harmonic component as the residual.
(7) | ||||
(8) | ||||
(9) |
The estimated score vector (where is the number of nodes or time series components) derived from is used to establish the topological ranking of the EEG channels (or brain regions). By minimizing the difference between and , this ranking accurately reflects net information flow patterns while avoiding inconsistencies, as . If , the -th time series tends to lead other time series more than the -th time series, indicating that is higher in the hierarchy. Thus, captures the brain hierarchy, as illustrated in Figure 2. This is crucial for understanding brain function and the impact of dysfunctions such as seizures.
Our methodology involves a series of steps designed to rank brain regions based on their mutual influence. Initially, we employ the PCMCI method to estimate effective brain connectivity, utilizing causal inference techniques. Following this, the connectivity network is decomposed into symmetric and anti-symmetric components. We then apply Hodge Decomposition to the anti-symmetric component, extracting the gradient, which is essential for establishing the ranking. This allows us to derive the topological ranking of brain regions based on the gradient component. The entire process is visually represented in Figure 3.
3 Experiments
To evaluate our methodology, we propose six distinct simulation scenarios, each representing different hierarchical dependency structures. These scenarios range from linear hierarchies to complex cyclic dependencies, testing the robustness and versatility of our ranking mechanism across various topologies, as illustrated in Figure 4.
-
•
Scenarios 1 and 4: Exhibit a linear hierarchical dependence structure, allowing for clear node ranking (scenario 4 reverses scenario 1).
-
•
Scenarios 2 and 5: Feature cyclic dependence structures, challenging meaningful ranking due to non-transitivity (scenario 5 reverses scenario 2).
-
•
Scenarios 3 and 6: Present partial rankings with localized cycles, suggesting intermediate dependence structures (scenario 6 reverses scenario 3).
We generate multivariate time series data using the mixture model in Equation 10, with dimensions and observations, divided into 500-observation epochs to analyze temporal stability and variability in ranking. The first 5,000 observations correspond to scenario one, the next 5,000 to scenario two, and so on.
(10) |
where is the observed multivariate time series, is a multivariate standard Gaussian process, and is random noise, also modeled as a standard Gaussian process. The mixing matrix reflects each scenario’s dependence structure.
Our method follows an epoch-based approach to dynamically estimate causal relationships within the multivariate time series data in order to capture the directional dependencies and hierarchical variations. Figure 5 showcases the causal connections identified for one epoch from scenario one, highlighting both the causal pathways and the strength of these interactions.
By applying our proposed ranking procedure, we are able to identify and highlight nodes within the network that exert significant influence over others, providing a clear understanding of the hierarchical patterns and their dynamics, as can be seen in Figure 6. Our methodology effectively ranks the simulated time series, consistently placing node A at the top and node E at the bottom, as indicated by their respective curves, in scenario one, with the inverse pattern observed in scenario four. A node is ranked above another if its curve is consistently above the other’s. Scenarios two and five exhibit fluctuating rankings due to inherent cyclic dependencies, while scenarios three and six show partial rankings with notable positions for node D, aligning with the designed simulation scenarios.
These results validate our approach for ranking multivariate time series data based on net influence, as it robustly identifies the hierarchical structures in scenarios one and three. In contrast, scenarios two and four, characterized by cyclic dependencies, exhibit inconsistent rankings. Through these simulations, we confirm that our CBTR method effectively uncovers the hierarchical structure within dependence networks, distinguishing various dependency patterns.
4 Results
In this section, we focus on applying our methodology to a dataset of neonatal EEG recordings [24]. Neonatal seizures are a critical concern within the neonatal intensive care unit (NICU), raising numerous questions about their temporal and spatial characteristics. The dataset comprises 19-channel EEG recordings from 79 full-term infants at the Helsinki University Hospital NICU, with seizure annotations provided by three expert clinicians for each recording, spanning a median duration of 74 minutes. We concentrate our study on a few subjects, specifically analyzing epochs consistently classified as either seizure or seizure-free by all three experts. Our initial goal is to explore how seizures influence effective brain connectivity in neonates, serving as a foundational step for future detailed research on the nature of seizures and their developmental effects.
In many instances, neonatal EEG recordings reveal distinct seizure patterns, as shown in Figure 7. However, deciphering the interconnections among the 19 channels, particularly the influence of seizures, presents a complex challenge. Our approach quantifies the impact of seizures on the neonatal brain’s effective connectivity at the channel level, aiming to uncover how seizures modify brain connectivity and hierarchical organization in each subject.
The EEG recordings typically exhibit smooth variations at the millisecond scale, leading to high autocorrelation in lagged observations, which can increase spurious causal relationships. To mitigate this, we analyze effective brain connectivity through successive differences or increments in EEG recordings, reducing the influence of lagged dependencies, as illustrated in Figure 7.
Following the approach presented in Section 2.1, we estimated the causal influences among brain channels for each subject across different time lags during seizure and non-seizure periods. Following the procedure in Section 2.2, we evaluate the changes in causal influence during seizure time by estimating ranking scores for all channels across 50 seizure epochs and compare them with scores from 50 seizure-free epochs. We used a t-test to determine the statistical significance of the ranking score differences. This analysis revealed significant changes in causality-based ranking scores for several channels for each examined subject, evidenced by very small p-values. For instance, in subjects 1 and 5, we highlight the EEG channels most significantly impacted by seizures. These findings, depicted in Figures 8 and 9, illuminate how seizures alter the hierarchical architecture of brain connectivity, offering deeper insights into their impacts and pinpointing the brain regions where seizures exert the greatest influence. These results demonstrate the efficacy of our CBTR approach in uncovering the nuanced dynamics of effective connectivity influenced by seizure events.
5 Conclusion and Discussion
In this paper, we introduced a novel framework to investigate effective brain connectivity — a domain traditionally explored through the lens of functional connectivity via persistence homology. Our proposed approach is unique in a sense that it integrates causal inference methods with Hodge decomposition, enabling the ranking of brain regions according to their mutual causal influences. This approach offers a novel perspective on the complex dynamics with effective brain connectivity.
Our simulation study demonstrates the robustness and accuracy of our methodology in uncovering and describing various hierarchical structures within multivariate dependence networks. Additionally, the application of our method to a dataset of neonatal EEG recordings highlights its practical value. Our analysis revealed novel insights into the impact of seizures on neonatal brain connectivity, specifically identifying changes in the hierarchical positions of specific brain regions in response to seizure activity.
Our future work entails deeper investigations on epileptic seizure using our proposed method. Given its potential to enhance the localization of seizure origins, this future direction holds promise for making significant contributions to seizure. By refining our understanding of seizure-induced alterations in brain connectivity, we anticipate that our methodology will offer valuable tools for clinical diagnosis and the development of targeted interventions.
References
- [1] Adams, H., Emerson, T., Kirby, M., Neville, R., Peterson, C., Shipman, P.: Persistence images: A stable vector representation of persistent homology. Journal of Machine Learning Research 18, 1–35 (2017). https://doi.org/https://jmlr.org/papers/volume18/16-337/16-337.pdf
- [2] Bendich, P., Marron, J.S., Miller, E., Pieloch, A., Skwerer, S.: Persistent homology analysis of brain artery trees. The Annals of Applied Statistics 10, 198–218 (2016). https://doi.org/10.1214/15-AOAS886
- [3] Bhatia, H., Norgard, G., Pascucci, V., Bremer, P.T.: The helmholtz-hodge decomposition—a survey. IEEE Transactions on Visualization and Computer Graphics 19(8), 1386–1404 (2013). https://doi.org/10.1109/TVCG.2012.316
- [4] Bromfield, E.B., Cavazos, J.E., Sirven, J.I.: An Introduction to Epilepsy. American Epilepsy Society (2006)
- [5] Bubenik, P.: Statistical topological data analysis using persistence landscapes. Journal of Machine Learning Research 16, 77–102 (2015). https://doi.org/10.5555/2789272.2789275
- [6] Carlsson, G., Zomorodian, A., Collins, A., Guibas, L.: Persistence barcodes for shapes. p. 124–135. Association for Computing Machinery (2004). https://doi.org/10.1145/1057432.1057449
- [7] Cook, C.J., Hwang, G., Mathis, J., Nair, V.A., Conant, L.L., Allen, L., Almane, D.N., Birn, R., DeYoe, E.A., Felton, E., Forseth, C., Humphries, C.J., Kraegel, P., Nencka, A., Nwoke, O., Raghavan, M., Rivera-Bonet, C., Rozman, M., Tellapragada, N., Ustine, C., Ward, B.D., Struck, A., Maganti, R., Hermann, B., Prabhakaran, V., Binder, J.R., Meyerand, M.E.: Effective connectivity within the default mode network in left temporal lobe epilepsy: Findings from the epilepsy connectome project. Brain Connectivity 9 (2019). https://doi.org/10.1089/brain.2018.0600
- [8] Edelsbrunner, H., Harer, J.: Persistent homology—a survey. Discrete and Computational Geometry 453, 257–282 (2008). https://doi.org/10.1090/conm/453/08802
- [9] Edelsbrunner, H., Letscher, D., Zomorodian, A.: Topological persistence and simplification 28, 511–533 (2002). https://doi.org/doi.org/10.1007/s00454-002-2885-2
- [10] El-Yaagoubi, A.B., Jiao, S., Chung, M.K., Ombao, H.: Spectral topological data analysis of brain signals. arXiv:2401.05343 [q-bio.NC] (2024). https://doi.org/10.48550/arXiv.2401.05343
- [11] El-Yaagoubi, A.B., Ombao, H.: Topological data analysis for directed dependence networks of multivariate time series data. In: Research Papers in Statistical Inference for Time Series and Related Models, chap. 17. Springer (2023)
- [12] Friston, K., Moran, R., Seth, A.: Analysing connectivity with granger causality and dynamic causal modelling. Current Opinion in Neurobiology 23(2), 172–178 (2013). https://doi.org/https://doi.org/10.1016/j.conb.2012.11.010
- [13] Friston, K.J.: Functional and effective connectivity: A review. Brain Connectivity 1, 13–36 (2011). https://doi.org/10.1089/brain.2011.0008
- [14] Gidea, M., Katz, Y.: Topological data analysis of financial time series: Landscapes of crashes. Physica A: Statistical Mechanics and its Applications 491, 820–834 (2018). https://doi.org/10.1016/j.physa.2017.09.028
- [15] Jiang, X., Lim, L.H., Yao, Y., Ye, Y.: Statistical ranking and combinatorial hodge theory. Mathematical Programming 127(1), 203–244 (March 2011). https://doi.org/10.1007/s10107-010-0419-x
- [16] Jiang, Y., Chen, D., Chen, X., Li, T., Wei, G.W., Pan, F.: Topological representations of crystalline compounds for the machine-learning prediction of materials properties. npj Computational Materials 7(28) (2021). https://doi.org/10.1038/s41524-021-00493-w
- [17] Lee, H., Kang, H., Chung, M.K., Kim, B.N., Lee, D.S.: Persistent brain network homology from the perspective of dendrogram. IEEE Transactions on Medical Imaging 31, 2267–2277 (2012). https://doi.org/10.1109/TMI.2012.2219590
- [18] Pearl, J.: Causality: Models, Reasoning and Inference. Cambridge University Press (2009)
- [19] Rabadan, R., Blumberg, A.J.: Topological Data Analysis for Genomics and Evolution. Cambridge University Press (2019)
- [20] Roebroeck, A., Formisano, E., Goebel, R.: Mapping directed influence over the brain using granger causality and fmri. NeuroImage 25(1), 230–242 (2005). https://doi.org/https://doi.org/10.1016/j.neuroimage.2004.11.017
- [21] Runge, J.: Discovering contemporaneous and lagged causal relations in autocorrelated nonlinear time series datasets. In: Peters, J., Sontag, D. (eds.) Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI). Proceedings of Machine Learning Research, vol. 124, pp. 1388–1397. PMLR (03–06 Aug 2020), https://proceedings.mlr.press/v124/runge20a.html
- [22] Runge, J., Nowack, P., Kretschmer, M., Flaxman, S., Sejdinovic, D.: Detecting and quantifying causal associations in large nonlinear time series datasets. Science Advances 5(11), eaau4996 (2019). https://doi.org/10.1126/sciadv.aau4996
- [23] Seth, A.K., Barrett, A.B., Barnett, L.: Granger causality analysis in neuroscience and neuroimaging. Journal of Neuroscience 35(8), 3293–3297 (2015). https://doi.org/10.1523/JNEUROSCI.4399-14.2015
- [24] Stevenson, N.J., Tapani, K., Lauronen, L., Vanhatalo, S.: A dataset of neonatal eeg recordings with seizure annotations. Scientific Data 6, 190039 (2019). https://doi.org/10.1038/sdata.2019.39
- [25] Weichwald, S., Peters, J.: Causality in cognitive neuroscience: Concepts, challenges, and distributional robustness. Journal of Cognitive Neuroscience 33(2), 226–247 (2021). https://doi.org/10.1162/jocn_a_01623