Abstract
Parkinsonâs disease (PD) exhibits significant clinical heterogeneity, presenting challenges in the identification of reliable electroencephalogram (EEG) biomarkers. Machine learning techniques have been integrated with resting-state EEG for PD diagnosis, but their practicality is constrained by the interpretable features and the stochastic nature of resting-state EEG. The present study proposes a novel and interpretable deep learning model, graph signal processing-graph convolutional networks (GSP-GCNs), using event-related EEG data obtained from a specific task involving vocal pitch regulation for PD diagnosis. By incorporating both local and global information from single-hop and multi-hop networks, our proposed GSP-GCNs models achieved an averaged classification accuracy of 90.2%, exhibiting a significant improvement of 9.5% over other deep learning models. Moreover, the interpretability analysis revealed discriminative distributions of large-scale EEG networks and topographic map of microstate MS5 learned by our models, primarily located in the left ventral premotor cortex, superior temporal gyrus, and Brocaâs area that are implicated in PD-related speech disorders, reflecting our GSP-GCN modelsâ ability to provide interpretable insights identifying distinctive EEG biomarkers from large-scale networks. These findings demonstrate the potential of interpretable deep learning models coupled with voice-related EEG signals for distinguishing PD patients from healthy controls with accuracy and elucidating the underlying neurobiological mechanisms.
Similar content being viewed by others
Introduction
Parkinsonâs disease (PD) is a neurodegenerative disorder that exerts a profound impact on the quality of life for 7â10 million people worldwide1,2. It is characterized by progressive and diverse symptoms that involve both motor and non-motor impairments. However, the pathogenic mechanism of PD remains poorly understood, with only 20% of cases being attributed to specific genetic factors3,4. Therefore, the precise and early diagnosis of PD continues to present considerable challenges, as this holds significance for effective clinical management.
One promising avenue for PD diagnosis lies in the identification of reliable biomarkers across various behavior domains, including handwriting patterns5, motor function6, gait patterns7, and speech characteristics8. Of particular interest, resting-state electroencephalography (EEG) has emerged as a potential diagnostic tool for PD diagnosis due to its noninvasiveness, cost-effectiveness, and ability to capture brain activity with high-temporal resolution9,10,11. Quantitative EEG (QEEG) measures, including power spectral density10,11 and spatiotemporal microstates9, have been extracted as distinctive features to distinguish PD patients from healthy individuals. More recently, an increasing number of studies on PD diagnosis have shifted towards integrating deep learning techniques with large-scale EEG networks12,13. For example, Oh et al. 14 utilized a thirteen-layer convolutional neural network (CNN) for identifying resting-state EEG data from PD patients, achieving a remarkable classification accuracy (ACC) of 88.25%. Chaturvedi et al. 15 integrated resting-state EEG parameters with least absolute shrinkage and selection operator (LASSO) and achieved an area under the curve (AUC) of 0.76 in PD diagnosis. These methods, however, rely heavily on their assumption of stationarity and integrability of the EEG signals9, which may not be valid given the dynamic nature of PD-related changes in brain activity. Therefore, the need to capture the stable and time-varying patterns, which cannot be adequately addressed by the stochastic resting-state EEG signals, arises as a challenge to extract discriminative features for distinguishing PD patients from healthy individuals.
In contrast to resting-state EEG signals, task-related EEG signals exhibit phase- and time-locked responses to motor and non-motor tasks and their functional networks/connectivity. This aspect offers valuable insights into PD-related alterations in neural activity and extraction of distinct EEG features between PD patients and healthy controls. A particular area of interest lies in motor speech disorders, which affect approximately 90% of PD patients and are considered as one of the premotor symptoms16,17. Previous studies have demonstrated that PD patients are impaired in sensorimotor control of vocal production18,19,20,21,22, which is manifested as enhanced event-related potential (ERP) P2 responses to pitch perturbations in voice auditory feedback23,24,25. This observation suggests the potential of machine learning-based extraction of salient features from voice-related EEG signals to obtain robust biomarkers for PD diagnosis. Notably, Shi et al. 26 developed four deep learning architectures, namely the CNN, the Recurrent Neural Network (RNN) and two hybrid models (2D-CNN-RNN and 3D-CNN-RNN), for PD diagnosis based on voice-related EEG signals. Their findings showed that the hybrid models outperformed the conventional models, achieving accuracies of 82.89% (3D-CNN-RNN), 81.13% (2D-CNN-RNN), 80.89% (CNN), and 76.00% (RNN). This study provides preliminary evidence for the feasibility of deep learning models with task-related EEG signals to distinguish PD patients from healthy individuals.
Nevertheless, the complexity of deep learning models has raised concerns regarding their interpretability, particularly in understanding what features they can learn from EEG signals and how they relate to the clinical characteristics of PD patients. Therefore, there is a growing demand for enhancing the interpretability of deep learning methods27,28,29. One promising interpretable model is the multi-category mutual verification30, which integrates graph learning and neurophysiological information models. The graph convolutional network (GCN), a key component of the graph learning model, has achieved strong performance in diagnosing neurodegenerative diseases using neuroimaging data31. The neurophysiological information model involves large-scale neural networks for EEG microstate analysis, which has been successfully used for PD diagnosis with resting-state data9. However, no studies have yet combined these two methods to enhance model interpretability in PD diagnosis using task-related EEG signals.
To this end, the present study proposes a novel knowledge-guided framework (see Fig. 1), graph signal processing-graph convolutional networks (GSP-GCNs), to distinguish PD patients from healthy individuals using large-scale EEG network obtained from a specific task involving the manipulation of auditory feedback during vocal production. This framework consists of four sequential components: GSP, the graph-network module, the classifier, and the interpretable model. Firstly, the GSP module analyzes and processes the large-scale EEG networks to identify dynamic connectivity patterns. Subsequently, the graph-network module captures these connectivity patterns as key features for classification. The classifier component then utilizes these extracted features to discriminate PD patients from healthy individuals. Lastly, the interpretable model is incorporated to enhance the interpretability of the framework by providing a global visualization of essential learned features of the model and aligning them with voice-related EEG microstates characteristics. By adopting this innovative approach, our GSP-GCNs framework aims to provide illustrative information for facilitating the use of the deep learning model in PD diagnosis with task-related EEG data.
The GSP module is responsible for acquiring collaborative graph information of both single-hop and multi-hop networks from the EEG signals, while the GCNs module automatically learns the graph structure by aggregating the information of neighboring nodes. INA incomplete network alignment, GCN graph convolutional network, GSP graph signal processing.
Results
Comparison of classified performance
The present study proposed four graph-network-based models: PCC+GCNs, PLV+GCNs, PCCâ+âGSP-GCNs, and PLVâ+âGSP-GCNs. The PCCâ+âGSP-GCNs model constructs the brain network using PCC features and employs the GSP-GCNs model as the classifier. The PLVâ+âGSP-GCNs model is similar to the PCCâ+âGSP-GCNs model, except that it uses PLV features to construct brain networks. In contrast, the PCC+GCNs and PLV+GCNs models do not include the GSP module based on the aforementioned methods.
Table 1 provides the classification performance achieved by our proposed models and baseline models based on voice-related EEG signals, and Fig. 2 shows the results of non-parametric Mann-Whitney tests with Bonferroni correction that compare the classification performance (ACC and AUC) of these models in 5-fold cross validation. Our PCCâ+âGSP-GCNs model outperformed the EEGNet model significantly, with an accuracy increase of 7.9% (82.3% vs. 90.2%), an AUC increase of 7.9% (81.2% vs. 89.1%), a sensitivity boost of 2.5% (81.5% vs. 84.0%), and a specificity enhancement of 8.8% (79.6% vs. 88.4%). As well, our proposed graph-based models exhibit superior performance compared to previous methods that did not use graph learning26. Specifically, the PLVâ+âGSP-GCNs model improved those metrics by 8.2%, 7.7%, 3.6%, and 10.2%, compared to the 3D-CNN-RNN model. In addition, Table 1 shows the computational complexity of our proposed models and baseline models. The computational complexity of the GCNs model is 2*O(nlogn)+O(n2)32,33 while the computational complexity of the baseline models ranges from O(n·d2)+O(n2) to 3*O(k·n·d2)+2*O(n·d2)+2*O(n2)26,34,35,36, where k denotes the convolution kernel size (greater than 3), d denotes the time series length (dâ=â700), and n denotes the number of channels (nâ=â64). Accordingly, our GCNs model shows a considerably lower computational complexity than all baseline models.
The left and right panels show the resutls of non-parametric Mann-Whitney tests comparing the GCNs model and conventional deep learning models in 5-fold cross validation for classification performance corresponding to ACC and AUC, respectively. Each cell in the heatmap represents the statistical significance of differences between the models, with light blue indicating no significance, red indicating a significant difference, and dark red denoting a higher level of significance.
Notably, Table 1 and Fig. 2 also show a comparison of classification performance across our proposed models. The GSP-GCNs models exhibited significantly superior performance compared to the GCNs models, regardless of the feature type. For example, the PCCâ+âGSP-GCNs model achieved higher ACC, AUC, sensitivity, and specificity than the PCC+GCNs model by 6.1% (84.1% vs. 90.2%), 6.0% (83.1% vs. 89.1%), 5.8% (78.2% vs. 84%), and 2.4% (86.0% vs. 88.4%), respectively. Similarly, the PLVâ+âGSP-GCNs model improved these metrics by 4.1% (84.8% vs. 88.9%), 5.3% (82.2% vs. 87.5%), 7.0% (79.l% vs. 86.l%), and 1.9% (84.3% vs. 86.2%) compared to the PLV+GCNs model. Furthermore, we employed a cross-validation approach to verify the out-of-distribution detection capability and stability of the GCNs model. Figure 3 indicates a consistently low variance of receiver operating characteristic (ROC) values, all remaining below 0.08 for our proposed models. These results highlight the potential of GSP processing in enhancing the classification performance by balancing the local and global information of single-hop and multi-hop networks through graph aggregation.
Panels aâd show the ROC curves for the PCC+GCNs, PLV+GCNs, PCCâ+âGSP-GCNs, and PLVâ+âGSP-GCNs models, respectively. The solid blue line represents the average performance across different test sets using 5-fold cross-validation, while the other colored lines show the performance of individual test set. The gray shaded area indicates the variance range of the average ROC curve.
Interpretable GSP-GCN model
The interpretability of the GSP-GCNs framework was assessed in the present study, aiming to capture global frequency-spatial-temporal dependencies in the large-scale EEG network and extract essential information for decoding tasks from the time series data. We used a modified CAM method to visualize the global representation learned by our models and generate their saliency maps in the context of vocal pitch regulation. Figure 4 illustrates the discriminative distributions of large-scale EEG networks obtained from the FAF task and learned by the PCC+GCNs and the PCCâ+âGSP-GCNs models. The most significant discriminative distributions were located in the left ventral premotor cortex (vPMC), superior temporal gyrus (STG), and Brocaâs area. These results suggest that the GSP-GCNs models have the capability to capture the intrinsic representation of the major brain activity difference during vocal motor control between PD patients and healthy controls.
The numbers ranging from 1 to 64 correspond to the electrodes according to the international EEG 10-20 system. Panels aâd show the distribution of the contribution network for the PCC+GCNs model, the PCCâ+âGSP-GCNs model, the PLV+GCNs model, and the PLVâ+âGSP-GCNs model, respectively. Nodes with a redder hue indicate greater contributions to the model, and edges with a blacker hue indicate higher contributions.
Moreover, we performed microstate analysis on the voice-related EEG signals from PD patients and healthy controls and identified five distinct microstates (MS1-MS5) with high energy fluctuation in the time frame of 0â300âms (see Fig. 5). The most prominent microstate was the MS5, located in the electrodes near the left vPMC and Brocaâs area in the time frame of 205â315âms. This temporal correspondence aligns with the large-scale EEG networks associated with the P2 component. The microstate transitioned from MS5 to MS2 during the 260â300âms period of P2, with MS2 located near the right STG. The network identified in the MS5 microstate resembles the discriminative distributions learned by the GSP-GCNs model, further strengthening the interpretability of our proposed models.
Different colors represent different microstates. Five microstates clustered by k-means algorithm are shown in the first row, average time potential fluctuations with emphasis on P2 component are shown in the second row, global field power is shown in the third row, and functional activity distributions within microstates and EEG networks associated with P2 component are shown in the bottom. GFP global field power, MS microstate.
Discussion
The present study proposed a novel, interpretable deep learning framework based on GSP-GCNs to distinguish from PD patients from healthy controls using voice-related EEG signals. By incorporating both local and global information from single-hop and multi-hop networks, our GSP-GCNs model achieved an averaged classification accuracy of 90.2% and exhibited a performance improvement of 9.5% compared to other deep learning models. Moreover, the interpretability analysis of our GSP-GCNs models revealed discriminative distributions of large-scale EEG networks and topographic map of the microstate MS5 in the P2 time window, primarily located in the left vPMC, STG, and Brocaâs area that have been implicated in PD-related motor speech disorders. Overall, our proposed GSP-GCNs models offer a valuable tool for PD diagnosis based on the interpretable results derived from large-scale voice-related EEG networks.
A growing body of studies has concentrated on the development of PD diagnosis by integrating resting-state EEG with machine learning techniques12,13,14,37,38,39. Recently, several deep learning models have been proposed for PD diagnosis using different resting-state EEG datasets (e.g. the public UNM dataset, public SanDiego dataset). For example, Oh et al. 14 proposed a thirteen-layer CNN model while Lee et al. 39 proposed a hybrid CNN-RNN model, achieving classification accuracies of 88.25% and 99.2% in distinguishing PD patients form healthy controls, respectively. In addition, Shah et al. 38 developed a dynamical system generated hybrid network (DGHNet) for successful categorization of on-medication vs. off-medication PD patients with a classification accuracy of 99.2%. Nevertheless, the practicality of these models is constrained by the prolonged training period, the large datasets requirement, and the lack of interpretability of specific EEG features in PD diagnosis. Furthermore, the stochastic nature of resting-state EEG poses challenges in capturing the distinct patterns of PD-related brain activity.
In attempts to address these limitations, Shi et al. 26 developed four deep learning architectures (i.e. CNN, RNN, 2D-CNN-RNN, 3D-CNN-RNN) using voice-related EEG signals during vocal pitch regulation for PD diagnosis and achieved classification accuracies ranging from 76.00% to 82.89%. Hassin-Baer et al. 40 applied machine learning models for the diagnosis of early-stage PD using the event-related EEG signals during visual Go/No-Go and auditory Oddball cognitive tasks, achieving an AUC of 0.79 and identifying a total of 15 EEG features. In the present study, our proposed the GSP-GCNs models used voice-related EEG signals similar to Shi et al. 26 and achieved classification accuracies ranging from 84.1% to 90.2%, indicating a remarkable performance superiority over those models proposed by Shi et al. 26. Furthermore, our PCCâ+âGSP-GCNs model also outperformed other models (see Table 1 and Fig. 2), including the EEGNet model, that do not use graph learning. Taken together, these findings demonstrate the effectiveness of deep learning models with graph learning and voice-related EEG signals for PD diagnosis.
Our proposed GSP-GCNs models, integrated with voice-related EEG, present a significant advancement over other deep learning models for PD diagnosis in several aspects. A significant strength lies in their capacity to combine the global and local properties of brain functional networks using single-hop and higher-order networks, enabling the aggregation of information from the nodes of the functional brain networks and the extraction of features for graph learning. Another strength is the use of GCN models for graph data, which have advantages over other convolutional neural networks, such as capturing complex node relationships, adapting to different graph properties, and handling large-scale network data. In contrast, the CNN-based approaches only focus on local features due to their limited perceptual field but ignore the global correlations among EEG signals. Similarly, the CNN-RNN approaches only capture the sequential relationships of EEG signals while overlooking the relevance of the EEG connectivity network. On the other hand, resting-state EEG signals are stochastic in nature and fail to capture the functional specificity of brain activity related to PD when compared to task-related EEG signals. The present study used a task that involves auditory-motor integration for vocal pitch regulation, which reveals the neural processes involved in the coordination of sensory and motor systems for speech production. These processes have been demonstrated to be impaired in PD21,22,23,24,25. Therefore, our GSP-GCNs models provide a novel way for PD diagnosis using voice-related EEG signals and deep learning techniques, which is more reliable and feasible than previous methods.
More importantly, the analysis that revealed comprehensive global frequency-spatial-temporal dependencies within the extensive EEG network demonstrated the interpretability of our proposed models, providing insights into how our models learn from voice-related EEG signals and how they relate to the neural processes underlying impaired auditory-vocal integration in PD. Upon visualizing the overall representation acquired by our models and their corresponding saliency maps during vocal pitch regulation, the most prominent discriminatory distributions were located in the left vPMC, STG, and Brocaâs area. These regions have been well-established as integral to speech motor control41,42,43,44,45,46. This consistency suggests that our GSP-GCNs models capture distinct patterns of brain activity during vocal motor control between PD patients and healthy controls.
EEG microstate analysis has been proven valuable in the diagnosis of Alzheimerâs disease47 and schizophrenia13 by revealing the spatiotemporal dynamics of brain activity. In the present study, the microstate analysis of voice-related EEG signals identified five distinct microstates (MS1-MS5), with MS5 being of particular significance. Our observation of distinct MS5 microstate between PD patients and healthy controls is in line with previous studies that have shown associations of specific EEG microstates with vocal tract muscles and motor cortex activity48,49 as well as speech fluency in PD patients50. MS5 corresponds to the large-scale EEG networks that align with the P2 component (205â315âms). This ERP component has been thought to reflect a complex stage of auditory-motor transformation for controlling vocal production that demands higher-level cognitive processing51,52. In particular, enhanced P2 responses to vocal pitch perturbations have been linked to impaired auditory-vocal integration in PD patients when compared to healthy controls23,24,25. Notably, the spatial localization of MS5 coincided with the left vPMC and Brocaâs area (see Fig. 4), regions that have been recognized as significant contributors to enhanced P2 responses during vocal pitch regulation in PD patients compared to healthy controls23. More interestingly, this network representation of MS5 microstate also showed a resemblance to the discriminative distributions learned by the GSP-GCNs model, lending further support to the interpretability of our proposed models to unravel specific brain patterns during speech tasks relevant to PD. Therefore, this interpretability analysis highlights the novelty and significance of our approach in facilitating a more profound understanding of the neural mechanisms underlying PD. Such interpretability is crucial27,28, ensuring that the models are not just âblack boxesâ but provide meaningful insights into the neural dynamics underpinning PD.
The proposed GSP-GCNs models have important clinical implications for PD diagnosis. They offer the potential to improve the diagnostic accuracy by extracting interpretable features from large-scale voice-related EEG networks. They also contribute to reducing the subjective bias and variability across patients, thereby promoting a more objective and consistent assessment of PD. Moreover, they hold the promise to facilitate the treatment of speech impairment in PD by modulating activity of the brain regions observed in microstate analysis through the use of non-invasive brain stimulation techniques such as transcranial magnetic or electrical stimulation25,53,54,55. Therefore, our GSP-GCNs models not only represent an advancement in PD diagnosis but also may pave the way for effective treatment approaches.
Online detection of clinical diseases is a challenging task that requires high accuracy and low latency for brain signals. However, most existing EEG systems for PD diagnosis are based on offline analysis of resting-state data, which suffers from high noise and low stability in real-time EEG signals due to factors such as subject concentration and device inconsistencies. Previous studies have demonstrated the effectiveness and adaptability of GCN models in real-world scenarios across various domains, such as online recommender systems56, traffic flow prediction57, and online animal tracking58. The present study proposed a hybrid approach that combined the computing capabilities of offline systems with the real-time responsiveness of online systems based on the voice-related EEG signals. In this framework, the offline system constructed the large-scale EEG networks to extract the features of specific ERP components, while the online system applied the GSP-GCNs model to enable real-time evaluation. This strategy achieved a balance between high accuracy and low latency in classification, allowing us to detect unique patterns within task-related EEG signals for online PD diagnosis.
This claim is supported by a posteriori verification of the proposed models (see Supplementary Fig. 1), which compares the performance of different deep learning models based on the random selection of trials from a pool of 100 trials for a voting selection process. Remarkably, the post-validation performance of our models exhibits only slight declines (4.1â4.9%) when compared to the results presented in Table 1. In contrast, previous studies59,60 have reported that online detection systems based on the brain signals typically experience an approximate 10% reduction in accuracy compared to their offline counterparts. Moreover, the GSP-CNSs models outperform both conventional deep learning models and previous EEG-based online detection systems59,60 in stochastic posterior performance and computational complexity (see Table 1 and Fig. 2). Therefore, our proposed models have high potential for online detection scenarios due to their high accuracy, low computational complexity, and objective evaluation.
There are several limitations in the present study that warrant further investigation. First, our sample size was relatively small due to difficulties in obtaining task-related EEG data under controlled experimental conditions and specialized equipment requirements, which may limit the generalizability and robustness of our models. Second, our models performed the connectivity analysis of large-scale EEG networks at the electrode level, which cannot provide precise anatomical sources that generate the EEG signals. Incorporating source localization analysis for constructing EEG connectivity network is therefore needed in future studies to provide a more accurate representation of functional interactions within the brain. Also, the exclusive use of GCN models in the present study may not be suitable for all types of graph data. Future studies should consider alternative methodologies (e.g. graph attention networks, graph neural ordinary differential equations) for graph learning. Lastly, our models focused on a specific task that involves vocal motor control for PD diagnosis. Other tasks, such as cognitive control, may reveal different brain activity patterns and network dynamics that can be informative for PD diagnosis.
In conclusion, the present study proposed a novel deep learning framework based on GSP-GCNs for PD diagnosis using voice-related EEG signals. Our models can capture the global frequency-spatial-temporal dependencies among large-scale EEG networks, achieving a remarkable 90.2% classification accuracy and outperforming other deep learning models by 9.5%. Moreover, our models revealed the discriminative distributions of large-scale EEG networks and topography of microstate MS5 for PD diagnosis in terms of interpretability. These findings highlight the promise of interpretable deep learning models with task-related EEG signals in advancing PD diagnosis.
Methods
Task-related EEG Dataset
Fifty-two patients diagnosed with idiopathic PD (24 females and 28 males; mean ageâ=â64.23â±â5.30 years) according to the diagnostic criteria of the UK Parkinsonâs disease Society Brain Bank61 and forty-eight sex- and age-matched healthy controls (HC) (23 females and 25 males; mean ageâ=â63.37â±â5.41 years) participated in this study. All of them were right-handed, native Mandarin speakers. PD patients in the present study met the following inclusion criteria: no more than mild dementia (Mini-Mental State Examination [MMSE]â>â26), no other neurological disease, no history of neurosurgical treatment, laryngeal surgery or pathology, swallowing disorders. PD patients were kept on their antiparkinsonian medication, but they were tested during their off-medication state (i.e. 12âh off anti-PD medication). All participants provided informed consent, and the research protocol was approved by the Institutional Review Board of The First Affiliated Hospital at Sun Yat-sen University in accordance with the Code of Ethics of the World Medical Association (Declaration of Helsinki).
Task-related EEG data were acquired during a task based on the frequency altered feedback (FAF) paradigm62. In brief, participants were instructed to produce a sustained vowel sound (/u/) for a duration of 5â6âs while hearing their voice unexpectedly pitch-shifted downwards by 200 cents (100 cents = one semitone) for a duration of 200âms. Each vocalization consisted of 4â5 perturbations that were presented in a pseudorandomized manner. Participants produced 20â25 consecutive vocalizations, resulting in a total of 100 trials. More details regarding experimental designs for the FAF tasks can be found in previous studies23,25.
While participants performed the vocal production experiment, the EEG signals were scalp-recorded using a 64-electrode Geodesic Sensor Net connected to a Net Amps 300 amplifier (Electrical Geodesics Inc.) at a sampling frequency of 1âkHz using NetStation software (v.4.5, Electrical Geodesics Inc.). During the offline analysis, the EEG signals were band-filtered (1â20âHz) and segmented with a window of â200âms before and 500âms after the perturbation onset. An artifact detection procedure was applied to the segmented epochs to exclude those bad trials from further analysis. And artifact-free trials were re-referenced to the average of the electrodes on each mastoid, averaged, and baseline-corrected to generate an overall cortical ERP response.
System framework
Figure 1 shows the overall framework of knowledge-guide graph convolutional networks, which consists of four modules: EEG network, GSP, GCN, and interpretable module. Firstly, the EEG network module constructs a graph of dynamic brain activity using electrodes as nodes and functional connections between electrodes as edges63,64. Subsequently, the GSP module applies a strategy based on incomplete network alignment (iNEAT)65 and Sparse Graph66 to reorganize the local and global information in the EEG network. Next, the GCNs module learns graph representations for personalized diagnosis by aggregating information from neighboring nodes, capturing intrinsic features from the complex EEG network for PD diagnosis. In addition, the interpretable module uses a modified saliency map derived from the backpropagation algorithm67 to visualize the prominent EEG network in each individual pattern and then compares it with the large-scale EEG network obtained from microstate analysis. The details of each module are described below.
EEG network definition
The EEG network is modeled by a graph G(V,E), where vi represents the ith channel of EEG in a node set V while eij represents the strength of functional connection between nodes vi and vj in an edge set E. Specifically, a single-channel EEG data records a sequence of time-series riâ[1,n]=[si1,si2,â¦,sik]âRk, where kâ=âT xf denotes the number of time points (f represents the frequency range, and T represents the consecutive order of time series). The EEG signal can be represented as a tensor XT =[r1;r2;â¦;rn]âRnÏk, and \({\boldsymbol{G}}\)(V,E) can be represented as an adjacency matrix. Functional connectivity matrices (V) are calculated using Pearson Correlation Coefficient (PCC) or Phase Locking Value (PLV). The PCC between brain signals in different channels is defined by Eq. (1), where \({{\boldsymbol{r}}}_{{\boldsymbol{i}}}^{{{\boldsymbol{f}}}_{{\boldsymbol{p}}}}({\boldsymbol{t}})\) corresponds to the low energy fluctuation within the frequency band fp (4â12âHz) from the ith electrode of the EEG signals.
PLV, defined in Eq. (2), measures the synchronization between phases of brain regions.
\(\Delta {\varnothing }_{{\boldsymbol{r}}}\left({\boldsymbol{t}}\right)\) is the instantaneous phase calculated by Hilbert transform from the original signal \({{\boldsymbol{r}}}^{{\boldsymbol{f}}}({\boldsymbol{t}})\).
The traditional GCNs focus on the local network structure rather than the global network distribution. To overcome this limitation, the present study incorporated a GSP method, including the iNEAT algorithm and Sparse Graph, into the GCNs. This integration allows us to capture both the local and global information within the EEG network. The iNEAT component selects edge features based on the hub and link properties of nodes, while the sparse graph component adjusts node weights according to edge weights. These methods enable a reorganization of the EEG network to obtain a new network that contains important nodes and edges while reducing or eliminating noisy ones. This reorganization is an important step in the GSP process for enhancing the data quality, as evidenced in previous studies on graph learning68,69. Although the iNEAT algorithm integrates both the local and global information within the large-scale EEG network, it may result in an issue of over completeness in fusion information. To overcome this limitation, the present study introduced the Sparse Graph operation, which performs sparse decomposition and dictionary generation on the graph signal, to remove redundant information.
Two graph networks, G1 from a single-hop network (kâ=â1) and G2 from a multi-hop network (kâ=â8), were generated for each individual using the k-nearest neighbor method65. The adjacency matrices A1 and A2 were calculated to obtain the permutation matrix S. To optimize the global graph information of the single-hop network, a graph matching-based method was proposed to solve a Non-deterministic Polynomial problem according to the principle of topology consistency70,71.
where \({{||}\cdot {||}}_{F}\) is the Frobenius norm of the corresponding matrix.
The random walk-based method was used to capture collaborative graph information from both single- and multi-hop networks based on the Kronecker product graph72,73.
where \({\boldsymbol{h}}\) is the vectorization of the prior similarity matrix H via the sine function between mean adjacency matrices in different groups, and \(\otimes\) is the operation of Kronecker product graph.
The iNEAT algorithm, which combines the strengths of the graph matching-based method and the random walk-based method65, was used in the present study. This algorithm effectively integrates the graph structure information from both single- and multi-hop networks within large-scale EEG networks. The optimization objective function is defined as follows:
where \({\boldsymbol{s}}\) represents the vectorization of the similarity matrix S. \({\boldsymbol{D}}={{\boldsymbol{D}}}_{{\bf{1}}}\otimes {{\boldsymbol{D}}}_{{\bf{2}}}\) and \({{\boldsymbol{D}}}_{{\bf{1}}}\) and \({{\boldsymbol{D}}}_{{\bf{2}}}\) are the diagonal degree matrix corresponding to \({{\boldsymbol{A}}}_{{\bf{1}}}\), and \({{\boldsymbol{A}}}_{{\bf{2}}}\). In addition, a permutation matrix was used to reorganize the EEG signals through channel-wise operations at the individual level.
The Sparse Graph operation, defined in Eq. (6), aims to enhance the discriminative ability between groups and reduce the standard deviation across the trials. In Eq. (6), the parameter of \({{\boldsymbol{x}}}_{i}^{e}\) represents the EEG signal of eth electrode from the ith subject, while the graph matrix \({{\boldsymbol{w}}}_{i}^{e}\) is calculated through the iNEAT algorithm to learn the collaborative graph information.
Where \({{\boldsymbol{B}}}_{g}^{e}=[{{\boldsymbol{b}}}_{g1}^{e},\ldots ,{{\boldsymbol{b}}}_{{gi}}^{e},\ldots ,{{\boldsymbol{b}}}_{{gn}}^{e}]\) is a weighting matrix with elements being \({{\boldsymbol{b}}}_{{gi}}^{e}=[{b}_{{gi}}^{e,1},\ldots ,{b}_{{gi}}^{e,e-1},{b}_{{gi}}^{e,e+1},\ldots ,{b}_{{gi}}^{e,E}]\). Similarly, the matrix \({{\boldsymbol{B}}}_{t}^{e}\) is defined. \({s}_{{gij}}^{e}\) and \({s}_{{tij}}^{e}\) denote the similarity between ith and jth subject from different groups.
Graph Convolutional Networks
In contrast to the spectrum-based approach based on the CNNs, the GCNs use the graph structure to intelligently aggregate information from neighboring nodes. Spectral analysis of graph signals decomposes EEG signals into multi-frequency graph modes and identifies the distribution of EEG signals using Laplacian maps from the spatial domain to the spectral domain. Laplacian operator, denoted as L, is defined as follows:
where \({\boldsymbol{D}}\) is the diagonal degree matrix, and \({{\bf{I}}}_{{\boldsymbol{N}}}\) is an identity matrix of size N.
The graph signal can be defined through the Fourier transform74.
where \({\boldsymbol{U}}=\left[{{\boldsymbol{u}}}_{{\bf{0}}},\ldots ,{{\boldsymbol{u}}}_{{\boldsymbol{N}}-{\bf{1}}}\right]\in {{\mathbb{R}}}^{{\boldsymbol{N}}\times {\boldsymbol{N}}}\) is calculated through the eigenvector decomposition of L.
In contrast to traditional methods that calculate a weighted sum of spatial neighbors in the Euclidean space, the ChebNet employed in the present study applied graph filters to generate a linear combination of graph Fourier modes across different frequencies. The convolution operation between two graph signals x and y can be expressed through the graph \({\mathscr{* }}{\mathcal{g}}\):
where \({\odot}\) represents the element-wise Hadamard product.
Since \({\bf{L}}={\boldsymbol{U}}{{\wedge }}{{\boldsymbol{U}}}^{{\boldsymbol{T}}}\) and \({{\wedge }}={\rm{diag}}\left(\left[{ \leftthreetimes }_{0},\ldots ,{ \leftthreetimes }_{N-1}\right]\right)\), we defined \({{\rm{g}}}_{\theta }\) as the filter with parameter θ. The filtering process of signal x can be expressed as:
To avoid calculating the spectral decomposition of the graph Laplacian, the ChebNet model used a truncated expansion of the Chebychev polynomials74.
where \({{\bf{L}}}^{\sim }\) is scaled Laplacian: \({{\bf{L}}}^{\sim }={\bf{2}}{\bf{L}}/{ \leftthreetimes }_{{\boldsymbol{max }}}-{{\boldsymbol{I}}}_{{\boldsymbol{N}}}\).
Interpretable GSP-GCNs model
The present study used two methods to interpret the essential features derived from the GSP-GCNs model, including the examination of the features learned by the GSP-GCNs model and an analysis of the features of the large-scale EEG network through microstate analysis. The GSP-GCNs model was designed to capture global temporal dependencies within EEG data, enabling the identification of crucial information for decoding tasks from time series. While topography and Gradient-weighted Class Activation Mapping (CAM)75 have been used to reveal the global representation learned by deep learning models in motor imagery dataset12, the visualization of the gradients using the CAM often suffers from high levels of noise. To address this issue, the present study used a deconvolution approach that suppresses the flow of gradients through neurons30. Specifically, for a given layer l in the graph signal \({{\rm{\chi }}}^{l}\) and its gradient \({R}^{l}\), the overwritten gradient \(\nabla {{\rm{\chi }}}^{l}{R}^{l}\) can be calculated as follows:
To generate the saliency map, the present study started from the output layer of a pre-trained model and propagated the gradients at each layer using the chain rule until reaching the input layer. This process allows us to visualize the salient regions that significantly contribute to the modelâs predictions, providing valuable insights into the interpreted features.
The microstate analysis of ERPs aims to investigate whether the GSP-GCNs model can capture distinct neural representations between PD patients and healthy controls when they produce vocal compensations for pitch feedback perturbations. Microstate analysis is a method that assesses the functional dynamics of large-scale brain networks by identifying the stable topographic patterns of the EEG signals47,50. This analysis was performed using the MNE-python toolbox (https://mne.tools/stable/index.html), which involves calculating the global field power (GFP) of each participantâs ERP followed by clustering the topographic map underlying the GFP using the k-means algorithm from the sklearn toolbox. The Krzanowski-Lai criterion76,77 was used to determine the optimal number of microstate classes due to its suitability for selecting topographic map classifications based on quality indicators and global explained variance.
Experimental design
The GSP-GCNs model was implemented using the Pytorch toolkit with a 5-fold cross-validation strategy (https://github.com/ShuzhiZhao/ERP_GCN). The model parameters were optimized using the Adam optimizer with gradient descent and the cross-entropy loss function. The network had three GCN layers (two hidden layers and one fully connected layer) and a learning rate of the network was 10â5. Table 2 shows the training parameters of the GCN model according to the Chebyshev polynomial order of each layer. A dropout rate of 0.35 was applied to prevent overfitting. The final output of the GSP-GCNs model was an M-dimensional vector obtained through the Softmax function, where M represents the number of EEG categories. The cross-entropy loss function, as defined in Eq. (15), was used to evaluate the model performance, where y and yË represent the ground truth and predicted label, respectively. Nb denotes the number of trials in a batch.
To evaluate our proposed method, we compared it with several state-of-the-art approaches and performed ablation studies to show the impact of GSP. These baseline models used for comparison were Hybrid Convolutional Recurrent Neural Networks (CNN, RNN, 2D CNN-RNN, and 3D-CNN-RNN)26, EEGNet34, and CRNN (Cascade and Parallel model)35. The performance was evaluated using ACC, AUC, sensitivity, and specificity (1-specificity).
Posteriori verification
A posteriori verification was performed to compare the classification performance of different deep learning models based on the random selection of trials from a pool of 100 trials for a voting selection process. The procedure consists the following steps:
Step 1: For each subject, 10 trialsâ classification labels are randomly selected from a set of 100 trialsâ classification labels Yâ=â{y1, y2, â¦, y10}, where \({{\rm{y}}}_{i}\in \left\{0,1\right\}\) denotes the label of the i-th trial (0 represents PD patients, 1 represents healthy controls).
Step 2: A hard-voting strategy is used, where the total number of 0 labels N0 and the total number of 1 labels N1 in each subjectâs 10 trials are computed. If N0â>âN1, the subject is classified as having PD and the procedure progresses to Step 4. If N0â<âN1, the subject is classified as healthy, bypassing to Step 4. If N0â=âN1, the procedure advances to Step 3.
Step 3: The BoyerâMoore algorithm is applied for majority voting. For example, if Yâ=â{0,1,1,0,1,0,0,1,1,0}, thus N0â=âN1 occurs. The Boyer-Moore algorithmâs criterion is: the first trial vote is 0, the second trial vote is 1, the two votes are different and cancel each other; the second trial vote is 1, the third trial vote is 1, the two votes are the same, and the vote of BM1 increases by 1. We get BM0â=â1 and BM1â=â2, and classify the subject as healthy.
Step 4: Steps 1â3 are reiterated 100 times, and the mean and variance of the outcomes from these repetitions are calculated to provide a statistical overview of the classification performance.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
Anonymized data may be shared on request to the corresponding author for non-commercial use, subject to restrictions according to participant consent and data protection legislation.
Code availability
Code for all data analysis is available from the corresponding author upon request and establishment of data sharing agreement between institutions.
References
Selvaraj, S. & Piramanayagam, S. Impact of gene mutation in the development of Parkinsonâs disease. Genes Dis. 6, 120â128 (2019).
Armstrong, M. J. & Okun, M. S. Diagnosis and treatment of parkinson disease: a review. JAMA 323, 548â560 (2020).
Blauwendraat, C., Nalls, M. A. & Singleton, A. B. The genetic architecture of Parkinsonâs disease. Lancet Neurol. 19, 170â178 (2020).
Bloem, B. R., Okun, M. S. & Klein, C. Parkinsonâs disease. Lancet 397, 2284â2303 (2021).
Drotar, P. et al. Evaluation of handwriting kinematics and pressure for differential diagnosis of Parkinsonâs disease. Artif. Intell. Med. 67, 39â46 (2016).
Miller, D. B. & OâCallaghan, J. P. Biomarkers of Parkinsonâs disease: present and future. Metabolism. 64, S40âS46 (2015).
Horak, F. B. & Mancini, M. Objective biomarkers of balance and gait for Parkinsonâs disease using body-worn sensors. Mov. Disord. 28, 1544â1551 (2013).
Wroge, T. J. et al. Parkinsonâs disease diagnosis using machine learning and voice. In: 2018 IEEE signal processing in medicine and biology symposium (SPMB)). IEEE (2018).
Chu, C. et al. Spatiotemporal EEG microstate analysis in drug-free patients with Parkinsonâs disease. NeuroImage. Clin. 25, 102132 (2020).
Yi, G. S., Wang, J., Deng, B. & Wei, X. L. Complexity of resting-state EEG activity in the patients with early-stage Parkinsonâs disease. Cogn. Neurodyn. 11, 147â160 (2017).
Moazami-Goudarzi, M., Sarnthein, J., Michels, L., Moukhtieva, R. & Jeanmonod, D. Enhanced frontal low and high frequency power and synchronization in the resting EEG of parkinsonian patients. NeuroImage 41, 985â997 (2008).
Song, Y., Zheng, Q., Liu, B. & Gao, X. EEG conformer: Convolutional transformer for EEG decoding and visualization. IEEE Trans. Neural Syst. Rehabil. Eng. (2022).
Lillo, E., Mora, M. & Lucero, B. Automated diagnosis of schizophrenia using EEG microstates and deep convolutional neural network. Expert. Syst. Appl. 209, 118236 (2022).
Oh, S. L. et al. A deep learning approach for Parkinsonâs disease diagnosis from EEG signals. Neural. Comput. Appl. 32, 10927â10933 (2020).
Chaturvedi, M. et al. Quantitative EEG (QEEG) measures differentiate Parkinsonâs disease (PD) patients from healthy controls (HC). Front. Aging Neurosci. 9, 3 (2017).
Siderowf, A. & Lang, A. E. Premotor Parkinsonâs disease: concepts and definitions. Mov. Disord. 27, 608â616 (2012).
Gaig, C. & Tolosa, E. When does Parkinsonâs disease begin? Mov. Disord. 24, S656âS664 (2009).
Liu, H., Wang, E. Q., Verhagen Metman, L. & Larson, C. R. Vocal responses to perturbations in voice auditory feedback in individuals with Parkinsonâs disease. PLoS ONE 7, e33629 (2012).
Chen, X. et al. Sensorimotor control of vocal pitch production in Parkinsonâs disease. Brain Res. 1527, 99â107 (2013).
Mollaei, F., Shiller, D. M. & Gracco, V. L. Sensorimotor adaptation of speech in Parkinsonâs disease. Mov. Disord. 28, 1668â1674 (2013).
Mollaei, F., Shiller, D. M., Baum, S. R. & Gracco, V. L. Sensorimotor control of vocal pitch and formant frequencies in Parkinsonâs disease. Brain Res. 1646, 269â277 (2016).
Sapir, S. Multiple factors are involved in the dysarthria associated with Parkinsonâs disease: a review with implications for clinical practice and research. J. Speech Lang. Hear. Res. 57, 1330â1343 (2014).
Huang, X. et al. The impact of Parkinsonâs disease on the cortical mechanisms that support auditory-motor integration for voice control. Hum. Brain Mapp. 37, 4248â4261 (2016).
Li, Y. et al. Neurobehavioral effects of LSVT® LOUD on auditory-vocal Integration in Parkinsonâs disease: a preliminary study. Front. Neurosci. 15, 624801 (2021).
Dai, G. et al. Continuous theta burst stimulation over left supplementary motor area facilitates auditory-vocal integration in individuals with Parkinsonâs disease. Front. Aging Neurosci. 14, 948696 (2022).
Shi, X., Wang, T., Wang, L., Liu, H. & Yan, N. Hybrid convolutional recurrent neural networks outperform CNN and RNN in task-state EEG detection for Parkinsonâs disease. In: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)). IEEE (2019).
van der Velden, B. H. M., Kuijf, H. J., Gilhuijs, K. G. A. & Viergever, M. A. Explainable artificial intelligence (XAI) in deep learning-based medical image analysis. Med. Image Anal. 79, 102470 (2022).
Gunning, D. et al. XAI-Explainable artificial intelligence. Sci. Robot 4, eaay7120 (2019).
Zhang, L., Wang, M., Liu, M. & Zhang, D. A survey on deep learning for neuroimaging-based brain disorder analysis. Front. Neurosci. 14, 779 (2020).
Springenberg, J. T., Dosovitskiy, A., Brox, T. & Riedmiller, M. Striving for simplicity: The all convolutional net. arXiv arXiv, 1412.6806 (2014).
Kipf, T. N. & Welling, M. Semi-supervised classification with graph convolutional networks. arXiv arXiv, 1609.02907 (2016).
Wu, C., Wu. X.-J. & Kittler, J. Spatial residual layer and dense connection block enhanced spatial temporal graph convolutional network for skeleton-based action recognition. In: Proceedings of the IEEE/CVF international conference on computer vision workshops) (2019).
Wu, B., Zhong, L., Li, H. & Ye, Y. Efficient complementary graph convolutional network without negative sampling for item recommendation. Knowledge-Based Syst. 256, 109758 (2022).
Lawhern, V. J. et al. EEGNet: a compact convolutional neural network for EEG-based brain-computer interfaces. J. Neural Eng. 15, 056013 (2018).
Yue, L. et al. Intention recognition from spatio-temporal representation of EEG signals. In: Australasian Database Conference). Springer (2021).
Vaswani, A. et al. Attention is all you need. Adv. Neur. Inf. Process. Syst. 30 (2017).
Yuvaraj, R., Acharya, U. R. & Hagiwara, Y. A novel Parkinsonâs Disease Diagnosis Index using higher-order spectra features in EEG signals. Neural. Comput. Appl. 30, 1225â1235 (2018).
Shah, S. A. A., Zhang, L. & Bais, A. Dynamical system based compact deep hybrid network for classification of Parkinson disease related EEG signals. Neural Netw. 130, 75â84 (2020).
Lee, S., Hussein, R., Ward, R., Jane Wang, Z. & McKeown, M. J. A convolutional-recurrent neural network approach to resting-state EEG classification in Parkinsonâs disease. J. Neurosci. Methods 361, 109282 (2021).
Hassin-Baer, S. et al. Identification of an early-stage Parkinsonâs disease neuromarker using event-related potentials, brain network analytics and machine-learning. PLoS ONE 17, e0261947 (2022).
Guenther, F. H., Ghosh, S. S. & Tourville, J. A. Neural modeling and imaging of the cortical interactions underlying syllable production. Brain Lang. 96, 280â301 (2006).
Bohland, J. W., Bullock, D. & Guenther, F. H. Neural representations and mechanisms for the performance of simple speech sequences. J. Cogn. Neurosci. 22, 1504â1529 (2010).
Chang, E. F., Niziolek, C. A., Knight, R. T., Nagarajan, S. S. & Houde, J. F. Human cortical sensorimotor network underlying feedback control of vocal pitch. Proc. Natl Acad. Sci. USA 110, 2653â2658 (2013).
Kort, N. S., Cuesta, P., Houde, J. F. & Nagarajan, S. S. Bihemispheric network dynamics coordinating vocal feedback control. Hum. Brain Mapp. 37, 1474â1485 (2016).
Parkinson, A. L. et al. Understanding the neural mechanisms involved in sensory control of voice production. NeuroImage 61, 314â322 (2012).
Behroozmand, R. et al. Sensory-motor networks involved in speech production and motor control: an fMRI study. NeuroImage 109, 418â428 (2015).
Tait, L. et al. EEG microstate complexity for aiding early diagnosis of Alzheimerâs disease. Sci. Rep. 10, 17627 (2020).
Jouen, A. L., Lancheros, M. & Laganaro, M. Microstate ERP analyses to pinpoint the articulatory onset in speech production. Brain Topogr. 34, 29â40 (2021).
Kindler, J., Hubl, D., Strik, W. K., Dierks, T. & Koenig, T. Resting-state EEG in schizophrenia: auditory verbal hallucinations are related to shortening of specific microstates. Clin. Neurophysiol. 122, 1179â1182 (2011).
Costa, T. D. C. et al. Are the EEG microstates correlated with motor and non-motor parameters in patients with Parkinsonâs disease? Neurophysiol. Clin. 53, 102839 (2023).
Guo, Z. et al. Top-down modulation of auditory-motor integration during speech production: the role of working memory. J. Neurosci. 37, 10323â10333 (2017).
Liu, D. et al. Top-down inhibitory mechanisms underlying auditory-motor integration for voice control: evidence by TMS. Cereb. Cortex 30, 4515â4527 (2020).
Li, T. et al. The left inferior frontal gyrus is causally linked to vocal feedback control: evidence from high-definition transcranial alternating current stimulation. Cereb. Cortex 33, 5625â5635 (2023).
Brabenec, L. et al. Non-invasive brain stimulation for speech in Parkinsonâs disease: a randomized controlled trial. Brain Stimul. 14, 571â578 (2021).
Brabenec, L. et al. Non-invasive stimulation of the auditory feedback area for improved articulation in Parkinsonâs disease. Parkinsonism. Relat. Disord. 61, 187â192 (2019).
Wang, S., Zhang, P., Wang, H., Yu, H. & Zhang, F. Detecting shilling groups in online recommender systems based on graph convolutional network. Inf. Process. Manag. 59, 103031 (2022).
Wang, X. et al. Traffic flow prediction via spatial temporal graph neural network. In: Proceedings of the web conference 2020) (2020).
Parmiggiani, A., Liu, D., Psota, E., Fitzgerald, R. & Norton, T. Donât get lost in the crowd: graph convolutional network for online animal tracking in dense groups. Comput. Electron. Agric. 212, 108038 (2023).
Zhang, X. et al. A survey on deep learning-based non-invasive brain signals: recent advances and new frontiers. J. Neural Eng. 18, 031002 (2021).
Aliakbaryhosseinabadi, S., Kamavuako, E. N., Jiang, N., Farina, D. & Mrachacz-Kersting, N. Online adaptive synchronous BCI system with attention variations. Brain-Computer Interface Research: A State-of-the-Art Summary 7, 31â41 (2019).
Hughes, A. J., Daniel, S. E., Kilford, L. & Lees, A. J. Accuracy of clinical diagnosis of idiopathic Parkinsonâs disease: a clinico-pathological study of 100 cases. J. Neurol. Neurosurg. Psychiatr. 55, 181â184 (1992).
Chen, S. H., Liu, H., Xu, Y. & Larson, C. R. Voice F0 responses to pitch-shifted voice feedback during English speech. J. Acoust. Soc. Am. 121, 1157â1163 (2007).
Lai, M., Demuru, M., Hillebrand, A. & Fraschini, M. A comparison between scalp- and source-reconstructed EEG networks. Sci. Rep. 8, 12269 (2018).
Toth, B. et al. EEG network connectivity changes in mild cognitive impairment - Preliminary results. Int. J. Psychophysiol. 92, 1â7 (2014).
Zhang, S., Tong, H. H., Tang, J., Xu, J. J. & Fan, W. Incomplete Network Alignment: Problem Definitions and Fast Solutions. ACM Trans. Knowl. Discov. Data 14, 1â26 (2020).
Kuramochi, M. & Karypis, G. Finding frequent patterns in a large sparse graph. Data. Min. Knowl. Disc. 11, 243â271 (2005).
Simonyan, K., Vedaldi, A. & Zisserman, A. Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv arXiv, 1312.6034 (2013).
Xia, F. et al. Graph learning: a survey. IEEE Trans. Artif. Intell. 2, 109â127 (2021).
Liu, Y. et al. Graph self-supervised learning: a survey. IEEE Trans. Knowl. Data Eng. 35, 5879â5900 (2022).
Koutra, D., Tong, H. & Lubensky, D. Big-align: Fast bipartite graph alignment. In: 2013 IEEE 13th international conference on data mining). IEEE (2013).
Zhang, J. & Philip, S. Y. Multiple anonymized social networks alignment. In: 2015 IEEE International Conference on Data Mining). IEEE (2015).
Singh, R., Xu, J. & Berger, B. Pairwise global alignment of protein interaction networks by matching neighborhood topology. In: Annual international conference on research in computational molecular biology). Springer (2007).
Liao, C. S., Lu, K., Baym, M., Singh, R. & Berger, B. IsoRankN: spectral methods for global alignment of multiple protein networks. Bioinformatics 25, i253âi258 (2009).
Defferrard, M., Bresson, X. & Vandergheynst, P. Convolutional neural networks on graphs with fast localized spectral filtering. Adv. Neural Inf. Process. Syst. 29, (2016).
Selvaraju, R. R. et al. Grad-cam: visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision). IEEE (2017).
Michel, C. M. & Koenig, T. EEG microstates as a tool for studying the temporal dynamics of whole-brain neuronal networks: a review. Neuroimage 180, 577â593 (2018).
Pascual-Marqui, R. D., Michel, C. M. & Lehmann, D. Segmentation of brain electrical activity into microstates: model estimation and validation. IEEE Trans. Biomed. Eng. 42, 658â665 (1995).
Acknowledgements
This study was funded by grants from the National Key R&D Program of China (2018YFA0701405), National Natural Science Foundation of China (82172528, 81972147, 62271477, 82372556, 82302859, 82102648), Guangdong Basic and Applied Basic Research Foundation (2022A1515011203), Guangdong Province Science and Technology Planning Project (2017A050501014), Guangzhou Science and Technology Programme (201604020115), Shenzhen Science and Technology Program (KQTD20200820113106007), Shenzhen Science and Technology Program Grant Award (JCYJ20210324115810030, JCYJ20220818101217037, JCYJ20220818101411025), and The Science and Technology Planning Project of Guangdong Province (2023B1212060018).
Author information
Authors and Affiliations
Contributions
X.C., N.Y. and H.L. conceived, designed, and coordinated the study. G.D., J.L., X.Z., X.H., Y.L., M.T. and X.C. recruited patients and completed data extraction. S.Z., G.D., J.L., X.Z. and X.H. contributed to performing the data analysis. S.Z., L.W., P.F., N.Y. and H.L. contributed to interpreting the results. S.Z., N.Y. and H.L. drafted and revised the manuscript. All authors approved the final version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhao, S., Dai, G., Li, J. et al. An interpretable model based on graph learning for diagnosis of Parkinsonâs disease with voice-related EEG. npj Digit. Med. 7, 3 (2024). https://doi.org/10.1038/s41746-023-00983-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41746-023-00983-9