Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: multibib

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: CC BY-NC-SA 4.0
arXiv:2404.05781v1 [q-bio.NC] 08 Apr 2024
\newcites

suppReferences

Group-specific discriminant analysis reveals statistically validated sex differences in lateralization of brain functional network

Shuo Zhou Department of Computer Science, University of Sheffield, Sheffield, UK Centre for Machine Intelligence, University of Sheffield, Sheffield, UK These authors contributed equally to this work Junhao Luo State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China These authors contributed equally to this work Yaya Jiang State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China Haolin Wang Department of Computer Science, University of Sheffield, Sheffield, UK Centre for Machine Intelligence, University of Sheffield, Sheffield, UK Haiping Lu Department of Computer Science, University of Sheffield, Sheffield, UK Centre for Machine Intelligence, University of Sheffield, Sheffield, UK h.lu@sheffield.ac.uk; gaolang.gong@bnu.edu.cn Gaolang Gong State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China Beijing Key Laboratory of Brain Imaging and Connectomics, Beijing Normal University, Beijing, China Chinese Institute for Brain Research, Beijing, China h.lu@sheffield.ac.uk; gaolang.gong@bnu.edu.cn
Abstract

Lateralization is a fundamental feature of the human brain, where sex differences have been observed. Conventional studies in neuroscience on sex-specific lateralization are typically conducted on univariate statistical comparisons between male and female groups. However, these analyses often lack effective validation of group specificity. Here, we formulate modeling sex differences in lateralization of functional networks as a dual-classification problem, consisting of first-order classification for left vs. right functional networks and second-order classification for male vs. female models. To capture sex-specific patterns, we develop the Group-Specific Discriminant Analysis (GSDA) for first-order classification. The evaluation on two public neuroimaging datasets demonstrates the efficacy of GSDA in learning sex-specific models from functional networks, achieving a significant improvement in group specificity over baseline methods. The major sex differences are in the strength of lateralization and the interactions within and between lobes. The GSDA-based method is generic in nature and can be adapted to other group-specific analyses such as handedness-specific or disease-specific analyses.

Introduction

Human brains are functionally lateralized [1, 2]. The asymmetries between left and right brain hemispheres are believed to reflect a complex interplay of evolutionary, hereditary, developmental, experiential, and pathological influences [3]. Researchers are developing insights on lateralization through psychological, pharmacological, and neuroscience investigations [4, 5]. One important understanding is that multiple factors influence human brain lateralization [5], with sex being one of the most representative [6, 7, 8, 9, 10, 11, 12, 13]. A popular viewpoint is that males have a more asymmetric brain organization while females have a more “bilateral” brain organization, which may result in the males’ superior spatial skills and the females’ superior verbal skills [14, 15].

Measurement of functional brain lateralization is valuable but challenging [16]. Direct approaches such as selectively modulating or suppressing cortical activities and circuits in a single hemisphere [17] often pose a risk of inflicting harm on the human brains [18]. Over the last two decades, functional neuroimaging techniques have been widely used in neuroscience, offering a powerful and non-invasive approach to studying human brain lateralization [19, 20]. One popular technique is analyzing functional connectivity (FC) of the brain’s resting-state, which is also known as the brain network or connectome [21]. This is usually derived from functional magnetic resonance imaging (fMRI) time series and considered an intrinsic “fingerprint” of the human brain [4, 22, 23]. A previous study [8] reported sex differences in lateralization of resting-state networks, with more right-lateralized visual and default-mode network components for males and females, respectively. Several other networks also showed differences between males and females [8]. Additionally, males and females have also demonstrated significant differences in homotopic functional connectivity of numerous regions [24].

Studies on brain lateralization have largely focused on modeling asymmetry effects region-by-region [5]. These lateralized brain regions are usually measured by the laterality index (LI) [25, 7], or identified through statistical univariate analysis comparing homologous regions [26, 27]. However, using these conventional methods to search for sex-specific lateralization patterns is mostly limited to within-group analysis [28, 29, 30, 31, 32, 33]. For example, to understand male-specific lateralization, analyses are performed on male and female data separately to label features significantly differing from female data as “male-specific”. Moreover, these analyses often do not validate the models on unseen samples, and therefore the resulting patterns may not be truly group-specific. In addition, given the high similarity between male and female brains, the small size of statistical effects makes detecting sex differences in lateralization more difficult [34]. As a result, the true specificity of lateralization may be overwhelmed by the similarities. Hence, effectively modeling and validating sex-specific lateralization remains challenging.

Refer to caption
Figure 1: a, The proposed classification workflow for identifying sex-specific brain lateralization. ①–③: Hemispheric features are extracted from the intrahemispheric brain functional network, which is constructed using resting-state functional MRI time series. ④⑤: First-order classification learns the differences between the two hemispheres, where a group-specific discriminant analysis (GSDA) classifier is trained to classify left vs. right hemispheres for a target group. ⑥: Second-order classification trains a standard logistic regression for classifying the male- vs. female-specific models obtained from the first-order classification, to identify the weights that significantly contribute to the sex-specific predictions. ⑦: Evaluate the predictions and interpret the model weights. b, Group-specific discriminant analysis with the logistic loss (GSDA-Logit) for the first-order classification of left vs. right brain hemispheres. This model jointly maximizes the likelihood of labels for the target group (with non-target-group labels masked out) and the grouping factor dependence for both the target and non-target groups, where top{}^{\top}start_FLOATSUPERSCRIPT ⊤ end_FLOATSUPERSCRIPT denotes the transpose of vectors, 𝐱𝐱\mathbf{x}bold_x denotes the input training samples, 𝐱tsubscript𝐱normal-t\mathbf{x}_{\mathrm{t}}bold_x start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT denotes target group training samples, 𝐱tsubscript𝐱normal-t\mathbf{x}_{\setminus\mathrm{t}}bold_x start_POSTSUBSCRIPT ∖ roman_t end_POSTSUBSCRIPT denotes non-target group samples, and a hyperparameter λ0𝜆0\lambda\geq 0italic_λ ≥ 0 controls the grouping factor dependence.

Here, we formulate the identification of sex differences in brain lateralization as a machine learning classification problem to address the aforementioned challenges. First, to model sex differences in brain lateralization, we propose a dual classification workflow. This consists of a first-order classification of left vs. right brain hemispheres and a second-order classification of male- vs. female-specific models. The obtained first- and second-order model weights can be interpreted as lateralization strength and sex differences, respectively. The whole workflow is presented in Fig. 1a. Second, to learn group (sex)-specific models in the first-order classification, we propose a novel group-specific discriminant analysis (GSDA) algorithm (Fig. 1b). Third, to validate the learned models, we leverage the cross-validation method in machine learning for a statistical evaluation. The models’ classification performance will be assessed by their accuracy on male and female test samples via cross-validation. Fourth, we propose a new metric, the Group Specificity Index (GSI), for evaluating the group specificity of the learned models.

Our final contribution involves conducting classification experiments using intrahemispheric connections extracted from resting-state fMRI (rs-fMRI) data of two public neuroimaging repositories, the Human Connectome Project (HCP) [35] and the Brain Genomics Superstruct Project (GSP) [36]. The results demonstrate a significant improvement in GSI obtained by GSDA over the baselines. Further second-order classification reveals consistent sex differences in lateralization across datasets: 1) about half of the sex-specific lateralized connections are shared between male and female brain functional network, with differences in the strength of lateralization, 2) stronger positive inter-lobe interactions are more left-lateralized in the male brain network, while stronger positive intra-lobe interactions are more right-lateralized in the female brain network.

Results

Refer to caption
Figure 2: Performance of left vs. right brain classification using Group-Specific Discriminant Analysis with the logistic loss (GSDA-Logit) on Human Connectome Project (HCP) [35] data with respect to the hyperparameter λ𝜆\lambdaitalic_λ, which controls the grouping factor (sex) dependence. A larger λ𝜆\lambdaitalic_λ corresponds to a higher dependence. When λ=0𝜆0\lambda=0italic_λ = 0, GSDA-Logit degenerates to a standard logistic regression for the target-group data. There were 1,000 random training-testing partitions for the experiment, where each subject randomly contributed one hemisphere for training and the other for testing, resulting in 50% brain hemispheres being selected as training samples. a, The test accuracy on male and female sets increasingly diverges with the increase of dependence on sex (λ𝜆\lambdaitalic_λ). The average test accuracy is represented by solid or dashed lines, with standard deviations shown as error bands, computed across 1,000 random training-testing partitions. b, The Group Specificity Index (GSI) calculated from the test results in Fig. 2a increases with λ𝜆\lambdaitalic_λ. The horizontal lines in each box represent the 25th percentile, median, and 75th percentile of the GSI over the 1,000 test sets, respectively, from bottom to top, and the green triangles represent the mean. The average GSI approaches 0 when λ=0𝜆0\lambda=0italic_λ = 0, indicating that the learned logistic regression models without regularization of group dependence are not sex-specific, despite being trained on male (or female) data only.

Diverged test accuracy on male and female sets

Figure 2 depicts the performance of group-specific discriminant analysis (GSDA) in classifying left vs. right brain hemispheres on the HCP data [35], across a varied range of values for hyperparameter λ𝜆\lambdaitalic_λ. A larger λ𝜆\lambdaitalic_λ indicates a higher grouping factor (sex) dependence. When the target group is male (the left of Fig. 2a), the labels for the left and right hemispheres of the female training data were masked. Therefore, the training female samples were only involved in the grouping factor dependence regularization. In this scenario, the average accuracy obtained on the male test samples (the blue solid line) stays higher than that on the female test samples (the orange dashed line). The increase of λ𝜆\lambdaitalic_λ leads to an increased gap between the test accuracy on target and non-target test sets. In particular, this discrepancy widens significantly within the range 0<λ50𝜆50<\lambda\leq 50 < italic_λ ≤ 5 and stabilizes to a 20% gap for λ>5𝜆5\lambda>5italic_λ > 5 (Fig. 2a). These observations remain consistent in results with different cross-validation strategies for the HCP data (Extended Data Fig. 1a,c) and the GSP data (Extended Data Fig. 2a,c).

The group specificity of models obtained by GSDA increases with a larger λ𝜆\lambdaitalic_λ, as reflected by our proposed metric, the Group Specificity Index (GSI), which is presented as a box plot in Fig. 2b. When 0<λ50𝜆50<\lambda\leq 50 < italic_λ ≤ 5, the GSI for both male- and female-specific GSDA models increases with the increase of λ𝜆\lambdaitalic_λ. When λ5𝜆5\lambda\geq 5italic_λ ≥ 5, the GSI maintains at around 0.4. Based on both accuracy and GSI results, λ=5𝜆5\lambda=5italic_λ = 5 is an “elbow” point in the experiment across different datasets and cross-validation strategies, which can be considered an optimal value for the trade-off between classification accuracy, group specificity, and model complexity (the hyperparameter for 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization was fixed to 0.1, so the larger λ𝜆\lambdaitalic_λ, the lower relative importance of 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization). Hence, in the rest of this article, we will use λ=5𝜆5\lambda=5italic_λ = 5 for GSDA as the main sex-specific model to present the results and findings.

In contrast, the GSI steadily approaches zero without the grouping factor dependence regularization. At λ=0𝜆0\lambda=0italic_λ = 0, where GSDA degenerates to a standard logistic regression trained only on the target-group hemispheres, the accuracy is nearly 100% for both male and female test samples (Fig. 2a, Table 1, Extended Data Fig. 1a,c, and Extended Data Fig. 2a,c). This performance is similar to the multivariate control baseline, which uses standard logistic regression trained on mixed male and female hemispheres. From Table 1, the control models achieved an accuracy of 99.99 ±plus-or-minus\pm± 0.04% for male and 99.92 ±plus-or-minus\pm± 0.13% for female HCP test samples, and 99.94 ±plus-or-minus\pm± 0.07% for male and 99.99 ±plus-or-minus\pm± 0.01% for female GSP test samples. Because of the same property and similar performance compared to the standard logistic regression (multivariate control baselines), we will view GSDA with λ=0𝜆0\lambda=0italic_λ = 0 as an additional multivariate baseline.

GSDA-based models learned distinct weights

Beyond classification performance similarity, the weights of multivariate baselines (control and GSDA with λ=0𝜆0\lambda=0italic_λ = 0) are also highly correlated. As shown in Fig. 3a, the average Pearson correlation coefficients between multivariate baselines are 0.99 for analyses conducted within either the HCP or GSP datasets. Similarly, in univariate analyses based on the t𝑡titalic_t-test of paired left and right connections, the t𝑡titalic_t-values of within-group analysis showed a 0.99 correlation with the t𝑡titalic_t-values derived from mixed male and female samples (univariate control). Among these multivariate and univariate baselines, the correlation for any arbitrary pair exceeds 0.910.910.910.91 for within-dataset results and 0.70.70.70.7 for cross-dataset results. This high correlation suggests that the lateralization modeled by multivariate or univariate baselines is common to both males and females, regardless of whether the analysis is conducted with exclusively male or female data, or with mixed data. This corresponds to the top red triangular cluster in Fig. 3a.

In contrast, our sex-specific models (with a higher GSI) show lower correlations with the univariate and multivariate baseline models. This corresponds to the blue rectangular cluster at the bottom of Fig. 3a, where a majority of coefficients fall within the range of 0.35 to 0.5. Increasing the value of λ𝜆\lambdaitalic_λ leads to a decreasing correlation between the control and GSDA models (λ>0𝜆0\lambda>0italic_λ > 0), for both results from HCP (Fig. 3b, and first columns of Fig. 3c,d) and GSP (Extended Data Fig. 3a,b, and first columns of Extended Data Fig. 3c,d). Moreover, the weights of sex-specific models are stable. As shown in Fig. 3c,d, the average correlation of any pair for GSDA with λ2𝜆2\lambda\geq 2italic_λ ≥ 2 is 0.99 or above.

Table 1: First-order classification (left vs. right brain hemispheres) accuracy on male and female test sets from the HCP [35] & Brain Genomics Superstruct Project (GSP) [36]. Group-specific models (GSDA with λ=5𝜆5\lambda=5italic_λ = 5) are compared with three multivariate baselines: 1) standard logistic regression trained on a mixture of male and female training data, 2) GSDA with λ=0𝜆0\lambda=0italic_λ = 0 (equivalent to standard logistic regression) trained on male data only, and 3) GSDA with λ=0𝜆0\lambda=0italic_λ = 0 trained on female data only. λ=5𝜆5\lambda=5italic_λ = 5 is an optimal value for GSDA on the data as determined by the accuracy and GSI in Fig. 2. The baselines achieved similar accuracy on both male and female test sets, indicating a lack of group specificity. Conversely, the group-specific models maintained accuracy on the target test set but showed a significant gap with the lower accuracy on the non-target test set.
Average test accuracy (%) and gap (target -- non-target)
Classification method (target group) HCP male HCP female HCP gap GSP male GSP female GSP gap
Logistic regression (male + female) 99.99 ±plus-or-minus\pm± 0.04 99.92 ±plus-or-minus\pm± 0.13 0.07 99.94 ±plus-or-minus\pm± 0.07 99.99 ±plus-or-minus\pm± 0.01 0.05
GSDA (λ=0𝜆0\lambda=0italic_λ = 0, male) 99.87 ±plus-or-minus\pm± 0.16 99.85 ±plus-or-minus\pm± 0.17 0.02 99.93 ±plus-or-minus\pm± 0.08 99.99 ±plus-or-minus\pm± 0.01 0.06
GSDA (λ=0𝜆0\lambda=0italic_λ = 0, female) 99.93 ±plus-or-minus\pm± 0.12 99.99 ±plus-or-minus\pm± 0.04 0.06 99.97 ±plus-or-minus\pm± 0.05 99.95 ±plus-or-minus\pm± 0.07 0.02
\rowcolorgray!20 GSDA (λ=5𝜆5\lambda=5italic_λ = 5, male) 92.75 ±plus-or-minus\pm± 1.83 68.52 ±plus-or-minus\pm± 2.88 24.23 91.85 ±plus-or-minus\pm± 1.77 71.28 ±plus-or-minus\pm± 2.13 20.57
\rowcolorgray!20 GSDA (λ=5𝜆5\lambda=5italic_λ = 5, female) 70.76 ±plus-or-minus\pm± 2.56 93.16 ±plus-or-minus\pm± 1.89 22.40 74.70 ±plus-or-minus\pm± 2.22 92.81 ±plus-or-minus\pm± 1.35 18.11
Refer to caption
Figure 3: Pearson correlation between model weights. a, Pairwise correlation between weights of 16 models, including multivariate models from Table 1 and univariate models, labeled along the x𝑥xitalic_x- and y𝑦yitalic_y-axis. Two clusters can be observed here: multivariate and univariate baselines versus GSDA with λ=5𝜆5\lambda=5italic_λ = 5. b, Correlation between GSDA and multivariate control models (trained on mixed male and female data) on the HCP data. As λ𝜆\lambdaitalic_λ increases, the GSDA models become less correlated with the control models. c, d, Average pairwise correlation for c. male-specific and d. female-specific GSDA models trained on HCP data, with respect to λ𝜆\lambdaitalic_λ. The weights of sex-specific models remain stable (correlation 0.99absent0.99\geq 0.99≥ 0.99) for λ2𝜆2\lambda\geq 2italic_λ ≥ 2.

Identifying sex-specific lateralized connections with dual-classification weights

Refer to caption
Figure 4: Lateralized functional connections identified by sex-specific models (GSDA λ=5𝜆5\lambda=5italic_λ = 5). a, A mask for identifying sex-specific lateralized connections is derived in two steps: 1) averaging weights across 1,000 second-order models from different random splits for HCP and GSP data, respectively; 2) identifying overlaps between the top 5% largest average weights from HCP and those from GSP. The circle represents a brain hemisphere, and each cell on the rim represents a region of interest (ROI) within the half brain. The seven colors indicate seven functional parcellations defined in Brainnetome atlas (BNA) [37]. The numbers on the rim are the start and end ROI numbers of the lobe in the BNA atlas, where the 123 ROIs are labeled from 0 to 122. b, The average degree [38] over ROIs in each of the seven BNA lobes, where the frontal lobe shows the largest degree. c, 30 unique sex-specific lateralized connections learned from HCP and GSP. Each connection is represented in the form of {ROI}-{ROI}, and each ROI is represented in the form of {Lobe}_{Gyrus}_{ROI Number}, where the lobe and gyrus are defined in BNA. The full names of the gyrus are SFG: superior frontal gyrus, MFG: middle frontal gyrus, IFG: inferior frontal gyrus, OrG: orbital gyrus, PrG: precentral gyrus, PCL: paracentral lobule, STG: superior temporal gyrus, MTG: middle temporal gyrus, ITG: inferior temporal gyrus, IPL: inferior parietal lobe, Pcun: precuneus, PoG, postcentral gyrus, INS: insular gyrus, CG: cingulate gyrus, MVOcC: medioVentral occipital cortex, LOcC: lateral occipital cortex. 22 out of 30 connections are associated with the frontal. These connections are either shared between males and females (middle column) or “exclusive” to one group (left and right column). For each sex-specific model, e.g. male-specific model on HCP, about half of the identified connections are shared. Nine connections are shared in total and their sex differences are in the strength of lateralization (Fig. 5a-d). 21 connections are “exclusive” and their sex differences are in the patterns of inter-/intra-lobe interactions (Fig. 5e-f, 6).
Refer to caption
Figure 5: Sex-specific lateralized connections with first-order model weights (GSDA λ=5𝜆5\lambda=5italic_λ = 5). The connections were identified by applying the mask in Fig. 4a to the top 5% weights (by magnitude) from four models specific to males of HCP (a and e), females of HCP (b and f), males of GSP (c and g), and females of GSP (d and h). Each of the four models was obtained by averaging the corresponding 1,000 first-order models. The sex-specific lateralized connections consist of shared connections between male and female models (a-d), and the group (sex) “exclusive” connections (e-h). The weights of these shared connections show consistent sex differences. In female-specific models, the weights for connections involving the frontal lobe tend to be larger than those in male-specific models, especially for positive weights. Conversely, in male-specific models, the weights for connections to other lobes are generally larger than those in female-specific models. The “exclusive” connections in male-specific models are mostly inter-lobe and negative, whereas in female-specific models, they are mostly intra-lobe and positive. Statistics about these connections can be found in Fig. 6.
Refer to caption
Figure 6: Count of the group “exclusive” lateralized connections for HCP and GSP (Fig. 5e-h), categorized by associated lobes, inter- or intra-lobe, and signs of the first-order weights. The connections were identified by a, the male-specific models with positive first-order weights; b, the female-specific models with positive first-order weights; c, the male-specific models with negative first-order weights; d, the female-specific models with negative first-order weights. In male brain networks, 7 out of 10 “exclusive” connection counts are inter-lobe, with 71.4% of the inter-lobe connections having negative first-order weights. In female brain networks, 11 out of 13 “exclusive” connection counts are intra-lobe, with 91.7% of the intra-lobe connections having positive first-order weights.

To identify lateralized connections related to sex differences among the 7,503 intrahemispheric connections, we performed a second-order classification. This involved training standard logistic regression models to distinguish between male- and female-specific models learned from the first-order classification, using 80% of the first-order models for training and 20% for testing. The test accuracy for second-order classification consistently achieved nearly 100% over 1,000 random splits. This indicates that the sex differences in the first-order model weights are generalizable.

Based on the weights from these second-order classification models, we derived a mask that characterizes sex differences in the lateralized connections. We first averaged the weights across 1,000 second-order classification models from different random splits for the HCP and GSP datasets, respectively. Then, we identified the overlap between the top 5% of the largest average weights (by magnitude) from HCP and those from GSP. The resulting map is represented by the figure of chords in Fig. 4a. The threshold of 5% was chosen because the second-order logistic regression classifiers were trained with 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization, which can be interpreted as a Gaussian prior (normal distribution) on model weights, with 5% being a commonly used statistical significance level for a Gaussian distribution. By calculating the average degree [38] (Fig. 4b) of connections for each lobe within this mask, we can learn that sex differences in first-order weights are associated with frontal, parietal, and occipital lobes, where the average degrees exceed 1. The frontal lobe shows the largest average degree, indicating significant sex differences.

We then applied this mask to the top 5% weights of four first-order classification models: HCP male-specific, HCP female-specific, GSP male-specific, and GSP female-specific. The obtained lateralized connections with sex differences are shown in Fig. 4c, Fig. 5a-h, and Extended Data Fig. 4a-d. The weights of these four models were obtained by taking the average of the corresponding 1,000 models learned from different random splits. In total, 47 lateralized connections with repetition were identified, of which 30 connections are unique. Among these 47 connections, the middle frontal gyrus (MFG) was the most frequently involved region, suggesting it may serve as a hub. Specifically, 17 out of the 47 connections were associated with the MFG in both male and female samples across both datasets.

Sex-specific lateralization: shared and “exclusive” connections

For each of the four sex-specific models, half of the identified lateralized connections are shared between male and female brain networks (Fig. 4c) on average: on HCP, 6 out of 12 for the male-specific and 6 out of 13 for the female-specific model; on GSP, 6 out of 10 for the male-specific and 6 out of 12 for the female-specific model. Among the 30 unique sex-specific lateralized connections identified across datasets, 9 (nearly one-third) are shared between males and females. To illustrate these findings, we have separated the shared and “exclusive” connections, as depicted in Fig. 5a-h and Fig. 5e-h, respectively.

For the shared lateralized connections (Fig. 5a,b for HCP and Fig. 5c,d for GSP), we observed sex differences in the magnitude of first-order weights, i.e., strength of lateralization. Specifically, for the female-specific models, the magnitudes of first-order weights corresponding to the connections associated with the frontal lobe are generally larger compared to those for male-specific models, particularly those of the positive weights. In male-specific models, the magnitudes of first-order weights for connections related to other lobes are larger than those in female-specific models.

For the “exclusive” connections in the four models (Fig. 5e-h), male-specific models contain more inter-lobe lateralized connections (Fig. 6a), with more than 70% of corresponding weights being negative, as shown by the blue chords in Fig. 5e,g. Female-specific models, on the other hand, contain more intra-lobe lateralized connections (Fig. 6b), with more than 90% of weights being positive, indicated by the red chords in Fig. 5f,h. Notably, these patterns of inter- and intra-lobe lateralization for males and females are consistent across joint or separate analyses of both HCP and GSP data (Fig. Extended Data Fig. 5), demonstrating the stability and reliability of these findings.

Discussion

Cross-validation challenges conventional statistical approach for investigating lateralization

Traditional neuroscience studies commonly assume that results from within-group analyses are specific to the group being studied [29, 30, 33, 28, 32]. This methodology of exclusively using male or female data to explore sex-specific characteristics is intuitively logical. However, our experimental results obtained via cross-validation challenge this assumption. For example, as depicted in Fig. 2a, when standard logistic regression models are exclusively trained on data from one target group (male or female), the performance on test sets for both the target and non-target groups is nearly identical. Hence, the performance of these baseline models is not sensitive to sex-based sampling. Based on the definition of generalization in statistical learning theory [39], the similarity in generalization errors suggests that these models are general to both males and females, not sex-specific. This finding implies that statistical methods can learn common patterns even when trained on data from a specific group, contradicting the conventional assumption in group-specific analysis. This conclusion holds at least in our study on left vs. right brain classification using the HCP and GSP datasets.

The performance of our sex-specific models (GSDA with λ>0𝜆0\lambda>0italic_λ > 0) is sensitive to sex-based sampling. The classification results (Fig. 2a, Extended Data Fig. 1a,c, and Extended Data Fig. 2a,c) reveal that the generalization error for the target-group test sets is significantly lower than that for the non-target-group test sets. This indicates a stronger specificity to sex compared to the multivariate baseline models, as reflected by our group-specificity index (GSI) results. The differences observed in test performance highlight the importance of cross-validation in validating the group specificity of statistical analysis results.

While univariate analysis results are not directly applicable to unseen samples for testing, the strong correlation between univariate and multivariate baselines offers valuable insights for pattern identification. For example, the correlation of the within-group univariate t𝑡titalic_t-test results with univariate control models (mixed) and multivariate baselines exceeds 0.99 and 0.91, respectively. This suggests that the outcomes of our within-group univariate analyses are likely common to both males and females. Consequently, previous conclusions from such within-group analyses should be revisited and validated. Moreover, although multivariate methods are theoretically superior in capturing interactions between features compared to univariate methods, the observed similarity between multivariate and univariate results suggests that multivariate methods might not always identify patterns distinct from those found by univariate methods.

Regions and connections identified across datasets for sex-specific lateralization

The mask resulting from the second-order classification revealed sex differences in connections across lobes, including the frontal, temporal, parietal, insular, limbic, and occipital lobes, where functional differences between males and females were observed in previous studies [8, 40, 9, 41, 42, 43]. Among the sex-specific lateralized connections, MTG-IPL, MFG-IFG, and MFG-INS are shared in both male- and female-specific models across the two datasets.

From the perspective of gyrus, which engages in various cognitive functions, the lateralized regions include the MTG[44] (sound recognition and language processing), MFG [45] (literacy and numeracy), IPL [46, 47] (spatial attention, multimodal sensory integration, and oculomotor control), IFG[48] (speech and language processing), and INS [49] (various sensorimotor processing and risk-reward behavior). These regions show lateralization and sex differences in certain functions including speech processing, language, and spatial attention [50, 8, 40]. The MFG, a hub region in this study, is a core component of the multiple demand system [51], and presents hemispheric specialization, with the left MFG primarily supporting literacy development, while the right MFG is vital for numeracy [45]. The MTG showed lateralization in activated volumes for both males and females during language tasks, while the lateralization of IFG was only observed in males [52]. Our study reports different weights of connections related to these two regions. This suggests that the lateralization of a region’s external connections can reflect the lateralization of its functional activation. The “activity flow” theory in neuroscience has linked the connections and functional activation [53], proposing that the seed-based connection-weighted sums of the activation of other regions can predict functional activation of the seed region. Our results suggest a correlation between the lateralization of functional connectivity and activation, although further quantitative analysis is required to investigate the specifics of this relationship.

From the perspective of connections, the MTG-IPL connection is associated with picture naming and displays notable plasticity [54]. Laws (2004) [55] assessed sex differences in picture naming speed, and Ala-Salomäki et al. [56] found that picture naming shows reliable left-lateralized evoked activation. These studies indicate the potential of sex and hemispheric differences for MTG-IPL connection and picture-naming cognition. Our GSDA framework successfully captured these effects, suggesting that the underlying mechanism for picture naming lateralization could be the lateralization of related connections, such as the MTG-IPL connection. As for the MFG-IFG connection, proficiency in artificial grammar rules was found to be positively linked to the functional connectivity between the left IFG and left MFG [57, 52], and this function also identified sex differences [58]. In consistency with these studies, our framework has captured both the lateralization and sex effects on the MFG-IFG connection. In the case of the MFG-INS connection, associated with mild traumatic brain injury (mTBI)[59], our study reveals sex-specific hemispheric differences, suggesting that personalized intervention and diagnostic approaches considering brain hemispheric and sex differences may be necessary for more effective mTBI treatment.

Sex differences in lateralization: strength and inter-/intra-lobe interaction patterns

Sex differences in strengths of shared lateralized connections: As reported in the results section, the first-order weights corresponding to these shared connections show consistent sex differences (Fig. 4c-f). In our labeling strategy, “left” was labeled as 0 and “right” as 1. Therefore, a positive first-order weight indicates that stronger positive interactions (FC value approaching 1) between two ROIs suggest a higher probability of right lateralization. Conversely, a more negative interaction (FC value approaching -1) indicates that stronger negative interactions between two ROIs suggest a higher probability of left lateralization. The opposite interpretation applies to negative first-order weights. Therefore, we can interpret the sex differences in first-order weights for the shared connections as follows: positive interactions involving the frontal lobes are more right-lateralized in females than in males, whereas positive interactions involving the temporal, parietal, insular, and limbic lobes are more left-lateralized in males than in females. This observation of difference in strength of lateralization aligns with the findings on the lateralization patterns of right- and left-handed individuals [60], and supports neuroscience discoveries of shared functional network mechanisms across males and females [61, 32].

Sex differences in inter-/intra-lobe interaction patterns are identified by the “exclusive” lateralized connections, particularly within the frontal lobe (Fig. 4g-j), a key region for language processing [62]. Using the same approach as above for interpreting first-order weights, we can summarize that males tend to have a stronger left lateralization in positive inter-lobe interactions, while females tend to have a stronger right lateralization in positive intra-lobe interactions. This divergence may result from the evolutionary pressure for lateralization, which optimizes functional organization and reduces redundancy among brain regions [63, 64]. Inter-lobe connections, characterized by long-range wiring, are metabolically costly [65, 66, 67, 68], while the shorter-range intra-lobe connections are more energy-efficient. In females, these intra-frontal connections may enhance language abilities. Conversely, male inter-lobe connections may be driven by the need to engage more extensive functional areas for complex visuospatial tasks. This divergence can be a factor in sex and lateralized differences in cognitive abilities, with males typically outperforming in rightward visuospatial tasks, and females in leftward verbal tasks [14, 15].

Potential of GSDA for more general applications

Our study focused on sex as a grouping factor and employed brain hemisphere labels to identify sex-specific lateralized patterns for human brain functional networks. The results demonstrate efficacy and stability in identifying and validating sex differences in lateralization. Importantly, the scope of this general predictive framework extends beyond its current application. For instance, employing different grouping factors, such as handedness or disease vs. control, can advance the analysis of distinct characteristics. Moreover, the GSDA algorithm can be extended to grouping factor combinations, such as sex and handedness. While this work focused on classification and a discrete group factor, our method can be adapted for regression tasks (e.g. predicting behavioral/cognitive scores) and continuous grouping factors (e.g. age or IQ score).

References

  • [1] Ojemann, G., Ojemann, J., Lettich, E. & Berger, M. Cortical language localization in left, dominant hemisphere: an electrical stimulation mapping investigation in 117 patients. \JournalTitleJournal of Neurosurgery 71, 316–326 (1989).
  • [2] Gazzaniga, M. S. Cerebral specialization and interhemispheric communication: does the corpus callosum enable the human condition? \JournalTitleBrain 123, 1293–1326 (2000).
  • [3] Corballis, M. C. The evolution of lateralized brain circuits. \JournalTitleFrontiers in Psychology 8, 1021 (2017).
  • [4] Passingham, R. E., Stephan, K. E. & Kötter, R. The anatomical basis of functional localization in the cortex. \JournalTitleNature Reviews Neuroscience 3, 606–616 (2002).
  • [5] Toga, A. W. & Thompson, P. M. Mapping brain asymmetry. \JournalTitleNature Reviews Neuroscience 4, 37–48 (2003).
  • [6] Clements, A. et al. Sex differences in cerebral laterality of language and visuospatial processing. \JournalTitleBrain and Language 98, 150–158 (2006).
  • [7] Tomasi, D. & Volkow, N. D. Laterality patterns of brain functional connectivity: gender effects. \JournalTitleCerebral Cortex 22, 1455–1462 (2012).
  • [8] Agcaoglu, O., Miller, R., Mayer, A. R., Hugdahl, K. & Calhoun, V. D. Lateralization of resting state networks and relationship to age and gender. \JournalTitleNeuroImage 104, 310–325 (2015).
  • [9] Reber, J. & Tranel, D. Sex differences in the functional lateralization of emotion and decision making in the human brain. \JournalTitleJournal of Neuroscience Research 95, 270–278 (2017).
  • [10] Guadalupe, T. et al. Asymmetry within and around the human planum temporale is sexually dimorphic and influenced by genes involved in steroid hormone receptor activity. \JournalTitleCortex 62, 41–55 (2015).
  • [11] Hirnstein, M., Westerhausen, R., Korsnes, M. S. & Hugdahl, K. Sex differences in language asymmetry are age-dependent and small: A large-scale, consonant–vowel dichotic listening study with behavioral and fMRI data. \JournalTitleCortex 49, 1910–1921 (2013).
  • [12] Plessen, K. J., Hugdahl, K., Bansal, R., Hao, X. & Peterson, B. S. Sex, age, and cognitive correlates of asymmetries in thickness of the cortical mantle across the life span. \JournalTitleJournal of Neuroscience 34, 6294–6302 (2014).
  • [13] Sommer, I. E., Aleman, A., Somers, M., Boks, M. P. & Kahn, R. S. Sex differences in handedness, asymmetry of the planum temporale and functional language lateralization. \JournalTitleBrain Research 1206, 76–88 (2008).
  • [14] Levy, J. Lateral specialization of the human brain, behavioral manifestations and possible evolutionary basis. \JournalTitleThe Biology of Behavior (1972).
  • [15] Levy, J. Lateral differences in the human brain in cognition and behavioral control. \JournalTitleCerebral Correlates of Conscious Experience (1978).
  • [16] Jansen, A. et al. The assessment of hemispheric lateralization in functional MRI—robustness and reproducibility. \JournalTitleNeuroImage 33, 204–217 (2006).
  • [17] Kinsbourne, M. Hemineglect and hemisphere rivalry. \JournalTitleAdv. Neurol. 18, 41–49 (1977).
  • [18] Kolb, B. & Whishaw, I. Q. Fundamentals of Human Neuropsychology (Macmillan, 2009).
  • [19] Fox, M. D. & Raichle, M. E. Spontaneous fluctuations in brain activity observed with functional magnetic resonance imaging. \JournalTitleNature Reviews Neuroscience 8, 700–711 (2007).
  • [20] Logothetis, N. K. What we can do and what we cannot do with fMRI. \JournalTitleNature 453, 869–878 (2008).
  • [21] Sporns, O., Tononi, G. & Kötter, R. The human connectome: a structural description of the human brain. \JournalTitlePLoS Computational Biology 1, e42 (2005).
  • [22] Smith, S. M. et al. Temporally-independent functional modes of spontaneous brain activity. \JournalTitleProceedings of the National Academy of Sciences 109, 3131–3136 (2012).
  • [23] Finn, E. S. et al. Functional connectome fingerprinting: identifying individuals using patterns of brain connectivity. \JournalTitleNature Neuroscience 18, 1664–1671 (2015).
  • [24] Zuo, X.-N. et al. Growing together and growing apart: regional and sex differences in the lifespan developmental trajectories of functional homotopy. \JournalTitleJournal of Neuroscience 30, 15034–15043 (2010).
  • [25] Seghier, M. L. Laterality index in functional MRI: methodological issues. \JournalTitleMagnetic Resonance Imaging 26, 594–601 (2008).
  • [26] Liégeois, F. et al. A direct test for lateralization of language activation using fMRI: comparison with invasive assessments in children with epilepsy. \JournalTitleNeuroImage 17, 1861–1867 (2002).
  • [27] Friston, K. J. Functional and effective connectivity: a review. \JournalTitleBrain Connectivity 1, 13–36 (2011).
  • [28] Jacobsen, L. K., Slotkin, T. A., Mencl, W. E., Frost, S. J. & Pugh, K. R. Gender-specific effects of prenatal and adolescent exposure to tobacco smoke on auditory and visual attention. \JournalTitleNeuropsychopharmacology 32, 2453–2464 (2007).
  • [29] Schwarz, E. et al. Sex-specific serum biomarker patterns in adults with asperger’s syndrome. \JournalTitleMolecular Psychiatry 16, 1213–1220 (2011).
  • [30] Kret, M. E., Pichon, S., Grèzes, J. & Gelder, B. d. Men fear other men most: gender specific brain activations in perceiving threat from dynamic faces and bodies–an fMRI study. \JournalTitleFrontiers in Psychology 2, 3 (2011).
  • [31] Ingalhalikar, M. et al. Sex differences in the structural connectome of the human brain. \JournalTitleProceedings of the National Academy of Sciences 111, 823–828 (2014).
  • [32] Cui, Z., Su, M., Li, L., Shu, H. & Gong, G. Individualized prediction of reading comprehension ability using gray matter volume. \JournalTitleCerebral Cortex 28, 1656–1672 (2018).
  • [33] Zullo, S. & Caenazzo, L. Gene editing and gender-specific medicine: a challenge for dementia research. \JournalTitlePalgrave Communications 6, 1–6 (2020).
  • [34] Good, C. D. et al. Cerebral asymmetry and the effects of sex and handedness on brain structure: a voxel-based morphometric analysis of 465 normal adult human brains. \JournalTitleNeuroImage 14, 685–700 (2001).
  • [35] Smith, S. M. et al. Resting-state fMRI in the human connectome project. \JournalTitleNeuroImage 80, 144–168 (2013).
  • [36] Holmes, A. J. et al. Brain genomics superstruct project initial data release with structural, functional, and behavioral measures. \JournalTitleScientific Data 2, 1–16 (2015).
  • [37] Fan, L. et al. The human brainnetome atlas: a new brain atlas based on connectional architecture. \JournalTitleCerebral Cortex 26, 3508–3526 (2016).
  • [38] Diestel, R. Graph Theory (New York: Springer, 2000).
  • [39] Vapnik, V. The Nature of Statistical Learning Theory (Springer science & business media, 1999).
  • [40] Smith, D. V. et al. Characterizing individual differences in functional connectivity using dual-regression and seed-based approaches. \JournalTitleNeuroImage 95, 1–12 (2014).
  • [41] Zhang, C., Dougherty, C. C., Baum, S. A., White, T. & Michael, A. M. Functional connectivity predicts gender: Evidence for gender differences in resting brain connectivity. \JournalTitleHuman Brain Mapping 39, 1765–1776 (2018).
  • [42] Sen, B. & Parhi, K. K. Predicting biological gender and intelligence from fMRI via dynamic functional connectivity. \JournalTitleIEEE Trans. on Biomedical Engineering 68, 815–825 (2020).
  • [43] Kong, R. et al. Spatial topography of individual-specific cortical networks predicts human cognition, personality, and emotion. \JournalTitleCerebral Cortex 29, 2533–2551 (2019).
  • [44] Onitsuka, T. et al. Middle and inferior temporal gyrus gray matter volume abnormalities in chronic schizophrenia: an MRI study. \JournalTitleAmerican Journal of Psychiatry 161, 1603–1611 (2004).
  • [45] Koyama, M. S., O’Connor, D., Shehzad, Z. & Milham, M. P. Differential contributions of the middle frontal gyrus functional connectivity to literacy and numeracy. \JournalTitleScientific Reports 7, 17548 (2017).
  • [46] Lynch, J. C. The functional organization of posterior parietal association cortex. \JournalTitleBehavioral and Brain Sciences 3, 485–499 (1980).
  • [47] Hyva, J. et al. Regional distribution of functions in parietal association area 7 of the monkey. \JournalTitleBrain Research 206, 287–303 (1981).
  • [48] Greenlee, J. D. et al. Functional connections within the human inferior frontal gyrus. \JournalTitleJournal of Comparative Neurology 503, 550–559 (2007).
  • [49] Kortz, M. W. & Lillehei, K. O. Insular cortex. \JournalTitleStatPearls (2021).
  • [50] Hirnstein, M., Hugdahl, K. & Hausmann, M. Cognitive sex differences and hemispheric asymmetry: A critical review of 40 years of research. \JournalTitleLaterality: Asymmetries of Body, Brain and Cognition 24, 204–252 (2019).
  • [51] Duncan, J. The multiple-demand (md) system of the primate brain: mental programs for intelligent behaviour. \JournalTitleTrends in Cognitive Sciences 14, 172–179 (2010).
  • [52] Kansaku, K., Yamaura, A. & Kitazawa, S. Sex differences in lateralization revealed in the posterior language areas. \JournalTitleCerebral Cortex 10, 866–872 (2000).
  • [53] Cole, M. W., Ito, T., Bassett, D. S. & Schultz, D. H. Activity flow over resting-state networks shapes cognitive task activations. \JournalTitleNature Neuroscience 19, 1718–1726 (2016).
  • [54] van Dokkum, L. E. et al. Resting state network plasticity related to picture naming in low-grade glioma patients before and after resection. \JournalTitleNeuroImage: Clinical 24, 102010 (2019).
  • [55] Laws, K. R. Sex differences in lexical size across semantic categories. \JournalTitlePersonality and Individual Differences 36, 23–32 (2004).
  • [56] Ala-Salomäki, H., Kujala, J., Liljeström, M. & Salmelin, R. Picture naming yields highly consistent cortical activation patterns: Test–retest reliability of magnetoencephalography recordings. \JournalTitleNeuroImage 227, 117651 (2021).
  • [57] Kepinska, O., de Rover, M., Caspers, J. & Schiller, N. O. Connectivity of the hippocampus and broca’s area during acquisition of a novel grammar. \JournalTitleNeuroImage 165, 1–10 (2018).
  • [58] Lotz, A. & Kinder, A. Rapid communication: Sex differences in artificial grammar learning: Evidence for different strategies in men and women. \JournalTitleQuarterly Journal of Experimental Psychology 64, 417–424 (2011).
  • [59] Li, F. et al. Causal interactions with an insular-cortical network in mild traumatic brain injury. \JournalTitleEuropean Journal of Radiology 157, 110594 (2022).
  • [60] Vingerhoets, G. et al. Cerebral lateralization of praxis in right-and left-handedness: Same pattern, different strength. \JournalTitleHuman Brain Mapping 33, 763–777 (2012).
  • [61] Linnman, C., Beucke, J.-C., Jensen, K. B., Gollub, R. L. & Kong, J. Sex similarities and differences in pain-related periaqueductal gray connectivity. \JournalTitlePain 153, 444–454 (2012).
  • [62] Tuckute, G. et al. Frontal language areas do not emerge in the absence of temporal language areas: A case study of an individual born without a left temporal lobe. \JournalTitleNeuropsychologia 169, 108184 (2022).
  • [63] Corballis, M. C. Evolution of cerebral asymmetry. \JournalTitleProgress in Brain Research 250, 153–178 (2019).
  • [64] Vallortigara, G. & Rogers, L. Survival with an asymmetrical brain: advantages and disadvantages of cerebral lateralization. \JournalTitleBehavioral and Brain Sciences (2005).
  • [65] Laughlin, S. B. & Sejnowski, T. J. Communication in neuronal networks. \JournalTitleScience 301, 1870–1874 (2003).
  • [66] Lennie, P. The cost of cortical computation. \JournalTitleCurrent Biology 13, 493–497 (2003).
  • [67] Levy, W. B. & Baxter, R. A. Energy efficient neural codes. \JournalTitleNeural Computation 8, 531–543 (1996).
  • [68] Ringo, J. L., Doty, R. W., Demeter, S. & Simard, P. Y. Time is of the essence: a conjecture that hemispheric specialization arises from interhemispheric conduction delay. \JournalTitleCerebral Cortex 4, 331–343 (1994).
\nocitesupp

*

Methods

Dual classification with group-specific discriminant analysis (GSDA)

We propose a dual classification framework with two primary objectives: learning group-specific models and identifying group-specific discriminant weights. For the first objective, specifically the classification of left vs. right brain hemispheres, we train a linear classifier on the training data and then validate its performance on the test data. We refer to this process as the first-order classification. The weights derived from the model are called the first-order weights. Then we perform a second round of classification to identify the weights that show significant differences between group-specific models. Here, we train a linear classifier to differentiate between male- and female-specific models. This stage is called the second-order classification, and the associated weights are referred to as the second-order weights. This process is illustrated in stages ⑤ and ⑥ of Fig. 1a.

The first-order classification builds a (group-specific) prediction function. This function predicts whether an unseen brain hemisphere is left or right, based on a feature vector. These vectors represent the left or right human brain hemispheres and are extracted from the training neuroimaging data. The resulting prediction accuracy serves as a quantitative measure of the extent to which the learned lateralization patterns are generalized among the brain networks within the test set. The learned model weights can be interpreted as indicators of the significance or extent of differences between the corresponding connections of the left and right brain hemispheres.

The second-order classification is designed to identify weights that show significant differences between the male- and female-specific first-order models. In this stage, a linear classification model is trained on the first-order model weights to predict whether an unseen model is male- or female-specific. The features with larger weights in the second-order classification are considered to represent the stronger sex differences. We propose a group-specific discriminant analysis algorithm to learn group-specific models for the first-order classification.

Problem formulation of GSDA

Let (𝐱i,yi,𝐠i)subscript𝐱𝑖subscript𝑦𝑖subscript𝐠𝑖(\mathbf{x}_{i},y_{i},\mathbf{g}_{i})( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) represent the i𝑖iitalic_ith sample, where 𝐱i𝒳psubscript𝐱𝑖𝒳superscript𝑝\mathbf{x}_{i}\in\mathcal{X}\subseteq\mathbb{R}^{p}bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X ⊆ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT denotes an input data vector, yi𝒴subscript𝑦𝑖𝒴y_{i}\in\mathcal{Y}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_Y denotes an output variable (label), and 𝐠i𝒢qsubscript𝐠𝑖𝒢superscript𝑞\mathbf{g}_{i}\in\mathcal{G}\subseteq\mathbb{R}^{q}bold_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_G ⊆ blackboard_R start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT represents a covariate vector for the grouping factor(s). Here, i[1,m]𝑖1𝑚i\in[1,m]italic_i ∈ [ 1 , italic_m ], with m𝑚mitalic_m being the total number of samples. 𝒳𝒳\mathcal{X}caligraphic_X, 𝒴𝒴\mathcal{Y}caligraphic_Y and 𝒢𝒢\mathcal{G}caligraphic_G are the feature spaces of the input data, output label, and grouping factor, respectively, with p𝑝pitalic_p and q𝑞qitalic_q as the corresponding feature dimensions for the input data 𝐱isubscript𝐱𝑖\mathbf{x}_{i}bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and grouping factor 𝐠isubscript𝐠𝑖\mathbf{g}_{i}bold_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. In the context of this article, 𝐱isubscript𝐱𝑖\mathbf{x}_{i}bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a feature vector that represents a brain hemisphere, yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT indicates whether 𝐱isubscript𝐱𝑖\mathbf{x}_{i}bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the left or right hemisphere, and 𝐠isubscript𝐠𝑖\mathbf{g}_{i}bold_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a binary (zero and one) indicator representing whether 𝐱isubscript𝐱𝑖\mathbf{x}_{i}bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is from a male or female subject (e.g. gi=0subscript𝑔𝑖0g_{i}=0italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 for male and gi=1subscript𝑔𝑖1g_{i}=1italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 for female). Assuming 𝐱0=1subscript𝐱01\mathbf{x}_{0}=1bold_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1, considering 𝐰0subscript𝐰0\mathbf{w}_{0}bold_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT as the bias term, and denoting 𝐰p+1𝐰superscript𝑝1\mathbf{w}\in\mathbb{R}^{p+1}bold_w ∈ blackboard_R start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT as the vector of weights (coefficients) to be learned, with the target group represented as subscript tt{}_{\mathrm{t}}start_FLOATSUBSCRIPT roman_t end_FLOATSUBSCRIPT, we formulate the objective of learning group-specific models as follows:

argmax𝐰1mti=1mt(yi|𝐱i,𝐰)+λmj=1m(𝐠j|𝐱j,𝐰),subscriptarg𝐰1subscript𝑚tsuperscriptsubscript𝑖1subscript𝑚tconditionalsubscript𝑦𝑖subscript𝐱𝑖𝐰𝜆𝑚superscriptsubscript𝑗1𝑚conditionalsubscript𝐠𝑗subscript𝐱𝑗𝐰\operatorname*{arg\max}_{\mathbf{w}}\frac{1}{m_{\mathrm{t}}}\sum_{i=1}^{m_{% \mathrm{t}}}\mathbb{P}(y_{i}|\mathbf{x}_{i},\mathbf{w})+\frac{\lambda}{m}\sum_% {j=1}^{m}\mathbb{P}(\mathbf{g}_{j}|\mathbf{x}_{j},\mathbf{w}),start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT bold_w end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT blackboard_P ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_w ) + divide start_ARG italic_λ end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT blackboard_P ( bold_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | bold_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , bold_w ) , (1)

where mtsubscript𝑚tm_{\mathrm{t}}italic_m start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT denotes the number of training samples from the target group, and λ0𝜆0\lambda\geq 0italic_λ ≥ 0 is the hyperparameter that quantifies the importance of grouping factor(s) dependence. Based on Eq. (1), we formulate a general group-specific discriminant analysis (GSDA) framework as:

argmin𝐰L(𝐗t𝐰,𝐲t)+α𝐰K2λρ(𝐗𝐰,𝐆)Group dependence,subscriptarg𝐰𝐿superscriptsubscript𝐗ttop𝐰subscript𝐲t𝛼superscriptsubscriptnorm𝐰𝐾2𝜆subscript𝜌superscript𝐗top𝐰𝐆Group dependence\operatorname*{arg\min}_{\mathbf{w}}L(\mathbf{X}_{\mathrm{t}}^{\top}\mathbf{w}% ,\mathbf{y}_{\mathrm{t}})+\alpha\|\mathbf{w}\|_{K}^{2}-\lambda\underbrace{\rho% (\mathbf{X}^{\top}\mathbf{w},\mathbf{G})}_{\text{Group dependence}},start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT bold_w end_POSTSUBSCRIPT italic_L ( bold_X start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w , bold_y start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT ) + italic_α ∥ bold_w ∥ start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_λ under⏟ start_ARG italic_ρ ( bold_X start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w , bold_G ) end_ARG start_POSTSUBSCRIPT Group dependence end_POSTSUBSCRIPT , (2)

where L(,)𝐿L(\cdot,\cdot)italic_L ( ⋅ , ⋅ ) denotes a classification or regression loss function, such as least square, logistic, or hinge, α0𝛼0\alpha\geq 0italic_α ≥ 0 is the hyperparameter used for weight regularization, K2\|\cdot\|_{K}^{2}∥ ⋅ ∥ start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT denotes either an 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT or 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization, with K=1𝐾1K=1italic_K = 1 or 2222 respectively, 𝐗tsubscript𝐗t\mathbf{X}_{\mathrm{t}}bold_X start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT denotes the target group’s training samples, 𝐗𝐗\mathbf{X}bold_X denotes all training samples that consist of both target and non-target group samples, and ρ(,)𝜌\rho(\cdot,\cdot)italic_ρ ( ⋅ , ⋅ ) is a statistical dependence measure. In this work, we employed Hilbert-Schmidt Independence Criterion (HSIC) \citesuppgretton2005measuring. Given two sets 𝐗={𝐱1,𝐱2,,𝐱m}𝐗subscript𝐱1subscript𝐱2subscript𝐱𝑚\mathbf{X}=\{\mathbf{x}_{1},\mathbf{x}_{2},\dots,\mathbf{x}_{m}\}bold_X = { bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , bold_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } and 𝐘={𝐲1,𝐲2,,𝐲m}𝐘subscript𝐲1subscript𝐲2subscript𝐲𝑚\mathbf{Y}=\{\mathbf{y}_{1},\mathbf{y}_{2},\dots,\mathbf{y}_{m}\}bold_Y = { bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , bold_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT }, both with size m𝑚mitalic_m, HSIC compute the statistical dependence between tests whether 𝐗𝐗\mathbf{X}bold_X and 𝐘𝐘\mathbf{Y}bold_Y via

ρh(𝐗,𝐘)=1(m1)2tr(𝐊𝐇𝐋𝐇),subscript𝜌𝐗𝐘1superscript𝑚12tr𝐊𝐇𝐋𝐇\rho_{h}(\mathbf{X,Y})=\frac{1}{(m-1)^{2}}\text{tr}(\mathbf{KHLH}),italic_ρ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_X , bold_Y ) = divide start_ARG 1 end_ARG start_ARG ( italic_m - 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG tr ( bold_KHLH ) , (3)

where 𝐊,𝐇,𝐋m×m𝐊𝐇𝐋superscript𝑚𝑚\mathbf{K,H,L}\in\mathbb{R}^{m\times m}bold_K , bold_H , bold_L ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_m end_POSTSUPERSCRIPT, 𝐊i,j:=kx(𝐱i,𝐱j)assignsubscript𝐊𝑖𝑗subscript𝑘𝑥subscript𝐱𝑖subscript𝐱𝑗\mathbf{K}_{i,j}:=k_{x}(\mathbf{x}_{i},\mathbf{x}_{j})bold_K start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT := italic_k start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ), 𝐋i,j:=ky(𝐲i,𝐲j)assignsubscript𝐋𝑖𝑗subscript𝑘𝑦subscript𝐲𝑖subscript𝐲𝑗\mathbf{L}_{i,j}:=k_{y}(\mathbf{y}_{i},\mathbf{y}_{j})bold_L start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT := italic_k start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ), kx(,)subscript𝑘𝑥k_{x}(\cdot,\cdot)italic_k start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( ⋅ , ⋅ ) and ky(,)subscript𝑘𝑦k_{y}(\cdot,\cdot)italic_k start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ( ⋅ , ⋅ ) are two kernel functions, such as linear, polynomial, or radial basis function (RBF), 𝐇=𝐈1m𝟏𝟏𝐇𝐈1𝑚superscript11top\mathbf{H}=\mathbf{I}-\frac{1}{m}\mathbf{11^{\top}}bold_H = bold_I - divide start_ARG 1 end_ARG start_ARG italic_m end_ARG bold_11 start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is the centering matrix, 𝐈𝐈\mathbf{I}bold_I is an identity matrix, and tr()tr\text{tr}\left(\cdot\right)tr ( ⋅ ) is the trace function. HSIC ρ(𝐗,𝐘)0𝜌𝐗𝐘0\rho(\mathbf{X,Y})\geq 0italic_ρ ( bold_X , bold_Y ) ≥ 0, and it is zero if and only if the two sets of variables 𝐗𝐗\mathbf{X}bold_X and 𝐘𝐘\mathbf{Y}bold_Y are independent, i.e., (𝐱,𝐲)=(𝐱)(𝐲)𝐱𝐲𝐱𝐲\mathbb{P}(\mathbf{x,y})=\mathbb{P}(\mathbf{x})\mathbb{P}(\mathbf{y})blackboard_P ( bold_x , bold_y ) = blackboard_P ( bold_x ) blackboard_P ( bold_y ). A higher HSIC value suggests stronger statistical dependence.

GSDA with logistic loss and maximum likelihood estimation

To maximize the likelihood of the target-group labels and the grouping factor(s) dependence as specified in Eq. (1), we adopt maximum likelihood estimation for optimizing the model weights 𝐰𝐰\mathbf{w}bold_w. Here, we develop a novel algorithm, Group-Specific Discriminant Analysis with logistic loss (GSDA-Logit), as a variant of logistic regression for group-dependent learning. The overall likelihood to be maximized is as follows:

(𝐲t|𝐗t,𝐰)(𝐰)(𝐆|𝐗,𝐰)=(i=1mtS(𝐰𝐱i)yi(1S(𝐰𝐱i))(1yi))12πσe(𝐰𝐰2σ2)S(ρh(𝐰𝐗,𝐆)),conditionalsubscript𝐲tsubscript𝐗t𝐰𝐰conditional𝐆𝐗𝐰superscriptsubscriptproduct𝑖1subscript𝑚t𝑆superscriptsuperscript𝐰topsubscript𝐱𝑖subscript𝑦𝑖superscript1𝑆superscript𝐰topsubscript𝐱𝑖1subscript𝑦𝑖12𝜋𝜎𝑒superscript𝐰top𝐰2superscript𝜎2𝑆subscript𝜌superscript𝐰top𝐗𝐆\displaystyle\begin{aligned} \mathbb{P}(\mathbf{y}_{\mathrm{t}}|\mathbf{X}_{% \mathrm{t}},\mathbf{w})\mathbb{P}(\mathbf{w})\mathbb{P}(\mathbf{G}|\mathbf{X},% \mathbf{w})=\big{(}\prod_{i=1}^{m_{\mathrm{t}}}S(\mathbf{w^{\top}x}_{i})^{y_{i% }}(1-S(\mathbf{w^{\top}x}_{i}))^{(1-y_{i})}\big{)}\frac{1}{\sqrt{2\pi\sigma}}e% \big{(}-\frac{\mathbf{w^{\top}w}}{2\sigma^{2}}\big{)}S(\rho_{h}(\mathbf{w^{% \top}X},\mathbf{G})),\end{aligned}start_ROW start_CELL blackboard_P ( bold_y start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT | bold_X start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT , bold_w ) blackboard_P ( bold_w ) blackboard_P ( bold_G | bold_X , bold_w ) = ( ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( 1 - italic_S ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ( 1 - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) divide start_ARG 1 end_ARG start_ARG square-root start_ARG 2 italic_π italic_σ end_ARG end_ARG italic_e ( - divide start_ARG bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) italic_S ( italic_ρ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_X , bold_G ) ) , end_CELL end_ROW (4)

where (𝐲t|𝐗t,𝐰)conditionalsubscript𝐲tsubscript𝐗t𝐰\mathbb{P}(\mathbf{y}_{\mathrm{t}}|\mathbf{X}_{\mathrm{t}},\mathbf{w})blackboard_P ( bold_y start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT | bold_X start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT , bold_w ) is the likelihood of target labels 𝐲tsubscript𝐲t\mathbf{y}_{\mathrm{t}}bold_y start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT given the model and target-group data 𝐗tsubscript𝐗t\mathbf{X}_{\mathrm{t}}bold_X start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT, (𝐰)𝐰\mathbb{P}(\mathbf{w})blackboard_P ( bold_w ) is the prior probability of weights in 𝐰𝐰\mathbf{w}bold_w, (𝐆|𝐗,𝐰)conditional𝐆𝐗𝐰\mathbb{P}(\mathbf{G}|\mathbf{X},\mathbf{w})blackboard_P ( bold_G | bold_X , bold_w ) is the likelihood of grouping factor dependence, S()𝑆S(\cdot)italic_S ( ⋅ ) denotes the logistic (or sigmoid) function, and (𝐰)𝐰\mathbb{P}(\mathbf{w})blackboard_P ( bold_w ) can be interpreted as the 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization for 𝐰𝐰\mathbf{w}bold_w, assuming that the weights in 𝐰𝐰\mathbf{w}bold_w follow a normal distribution with a mean of zero and a standard deviation of σ𝜎\sigmaitalic_σ. Given that 𝐰𝐗superscript𝐰top𝐗\mathbf{w^{\top}X}bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_X produces a row vector, Equation (3) can be reformulated in the form of simplified HSIC \citesuppzhou2020side:

ρsh(𝐰𝐗,𝐆)=tr((𝐰𝐗)(𝐰𝐗)𝐇𝐋𝐇)=𝐰𝐗𝐇𝐋𝐇𝐗𝐰,subscript𝜌𝑠superscript𝐰top𝐗𝐆absenttrsuperscriptsuperscript𝐰top𝐗topsuperscript𝐰top𝐗𝐇𝐋𝐇missing-subexpressionabsentsuperscript𝐰topsuperscript𝐗𝐇𝐋𝐇𝐗top𝐰\displaystyle\begin{aligned} \rho_{sh}(\mathbf{w^{\top}X},\mathbf{G})&=\text{% tr}((\mathbf{w^{\top}X})^{\top}(\mathbf{w^{\top}X})\mathbf{HL}\mathbf{H})\\ &=\mathbf{w}^{\top}\mathbf{XHLHX}^{\top}\mathbf{w},\end{aligned}start_ROW start_CELL italic_ρ start_POSTSUBSCRIPT italic_s italic_h end_POSTSUBSCRIPT ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_X , bold_G ) end_CELL start_CELL = tr ( ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_X ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_X ) bold_HLH ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_XHLHX start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w , end_CELL end_ROW (5)

where 𝐋=𝐆𝐆𝐋superscript𝐆top𝐆\mathbf{L}=\mathbf{G^{\top}G}bold_L = bold_G start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_G. By replacing ρh(𝐰𝐗,𝐆)subscript𝜌superscript𝐰top𝐗𝐆\rho_{h}(\mathbf{w^{\top}X},\mathbf{G})italic_ρ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_X , bold_G ) with the simplified HSIC ρsh(𝐰𝐗,𝐆)subscript𝜌𝑠superscript𝐰top𝐗𝐆\rho_{sh}(\mathbf{w^{\top}X},\mathbf{G})italic_ρ start_POSTSUBSCRIPT italic_s italic_h end_POSTSUBSCRIPT ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_X , bold_G ), the likelihood can be rewritten as

(𝐲t|𝐗t,𝐰)(𝐰)(𝐆|𝐗,𝐰)=(i=1mtS(𝐰𝐱i)yi(1S(𝐰𝐱i))(1yi))12πσe(𝐰𝐰2σ2)S(𝐰𝐗𝐇𝐋𝐇𝐗𝐰).conditionalsubscript𝐲tsubscript𝐗t𝐰𝐰conditional𝐆𝐗𝐰superscriptsubscriptproduct𝑖1subscript𝑚t𝑆superscriptsuperscript𝐰topsubscript𝐱𝑖subscript𝑦𝑖superscript1𝑆superscript𝐰topsubscript𝐱𝑖1subscript𝑦𝑖12𝜋𝜎𝑒superscript𝐰top𝐰2superscript𝜎2𝑆superscript𝐰topsuperscript𝐗𝐇𝐋𝐇𝐗top𝐰\displaystyle\begin{aligned} \mathbb{P}(\mathbf{y}_{\mathrm{t}}|\mathbf{X}_{% \mathrm{t}},\mathbf{w})\mathbb{P}(\mathbf{w})\mathbb{P}(\mathbf{G}|\mathbf{X},% \mathbf{w})=\big{(}\prod_{i=1}^{m_{\mathrm{t}}}S(\mathbf{w^{\top}x}_{i})^{y_{i% }}(1-S(\mathbf{w^{\top}x}_{i}))^{(1-y_{i})}\big{)}\frac{1}{\sqrt{2\pi\sigma}}e% \big{(}-\frac{\mathbf{w^{\top}w}}{2\sigma^{2}}\big{)}S(\mathbf{w^{\top}XHLHX^{% \top}w}).\end{aligned}start_ROW start_CELL blackboard_P ( bold_y start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT | bold_X start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT , bold_w ) blackboard_P ( bold_w ) blackboard_P ( bold_G | bold_X , bold_w ) = ( ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( 1 - italic_S ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ( 1 - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) divide start_ARG 1 end_ARG start_ARG square-root start_ARG 2 italic_π italic_σ end_ARG end_ARG italic_e ( - divide start_ARG bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) italic_S ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_XHLHX start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w ) . end_CELL end_ROW (6)

The likelihood in Eq. (6) can be maximized using the same optimization steps for a standard logistic regression, i.e., computing the gradient of the negative log-likelihood. Let α=1σ2𝛼1superscript𝜎2\alpha=\frac{1}{\sigma^{2}}italic_α = divide start_ARG 1 end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and λ𝜆\lambdaitalic_λ denote the two hyperparameters that control the importance of the 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization and grouping factor dependence regularization, respectively. Let J(𝐰)𝐽𝐰J(\mathbf{w})italic_J ( bold_w ) denote the negative logarithm of the likelihood. Taking the gradient of J(𝐰)𝐽𝐰J(\mathbf{w})italic_J ( bold_w ) with respect to 𝐰𝐰\mathbf{w}bold_w, we obtain

J(𝐰)=𝐗t(S(𝐗t𝐰)𝐲t)+α𝐰+λ(S(𝐰𝐗𝐇𝐋𝐇𝐗𝐰)1)𝐗𝐇𝐋𝐇𝐗𝐰.𝐽𝐰subscript𝐗t𝑆subscriptsuperscript𝐗topt𝐰subscript𝐲t𝛼𝐰𝜆𝑆superscript𝐰topsuperscript𝐗𝐇𝐋𝐇𝐗top𝐰1superscript𝐗𝐇𝐋𝐇𝐗top𝐰\displaystyle\begin{aligned} \nabla J(\mathbf{w})=\mathbf{X}_{\mathrm{t}}(S(% \mathbf{X}^{\top}_{\mathrm{t}}\mathbf{w})-\mathbf{y}_{\mathrm{t}})+\alpha% \mathbf{w}+\lambda(S(\mathbf{w^{\top}XHLHX^{\top}w})-1)\mathbf{XHLHX^{\top}w}.% \end{aligned}start_ROW start_CELL ∇ italic_J ( bold_w ) = bold_X start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT ( italic_S ( bold_X start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT bold_w ) - bold_y start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT ) + italic_α bold_w + italic_λ ( italic_S ( bold_w start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_XHLHX start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w ) - 1 ) bold_XHLHX start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_w . end_CELL end_ROW (7)

Finally, 𝐰𝐰\mathbf{w}bold_w can be optimized iteratively via

𝐰k+1=𝐰kηJ(𝐰k),superscript𝐰𝑘1superscript𝐰𝑘𝜂𝐽superscript𝐰𝑘\mathbf{w}^{k+1}=\mathbf{w}^{k}-\eta\nabla J(\mathbf{w}^{k}),bold_w start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT = bold_w start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - italic_η ∇ italic_J ( bold_w start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) , (8)

where k𝑘kitalic_k denotes the k𝑘kitalic_kth iteration, η𝜂\etaitalic_η is the learning rate (step size). Algorithm 1 is the pseudocode for GSDA-Logit. In addition to standard gradient descent optimization, we have implemented the LBFGS optimization algorithm \citesuppliu1989limited for faster coefficient estimation.

Algorithm 1 Group-Specific Discriminant Analysis with logistic loss (GSDA-Logit)

Input: Input data matrix 𝐗p×m𝐗superscript𝑝𝑚\mathbf{X}\in\mathbb{R}^{p\times m}bold_X ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_m end_POSTSUPERSCRIPT, target-group label vector 𝐲tmtsubscript𝐲tsuperscriptsubscript𝑚t\mathbf{y}_{\mathrm{t}}\in\mathbb{R}^{m_{\mathrm{t}}}bold_y start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, grouping factor(s), and indices of samples from the target group (optional, if not given, first mt<msubscript𝑚t𝑚m_{\mathrm{t}}<mitalic_m start_POSTSUBSCRIPT roman_t end_POSTSUBSCRIPT < italic_m samples are assumed to be the labeled target samples).
hyperparameters: α𝛼\alphaitalic_α for 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization, λ𝜆\lambdaitalic_λ for group dependence (HSIC) regularization, and η𝜂\etaitalic_η for learning rate.
Output: Coefficient vector 𝐰p+1𝐰superscript𝑝1\mathbf{w}\in\mathbb{R}^{p+1}bold_w ∈ blackboard_R start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT.

1:Encode the grouping factor(s) into a matrix 𝐆q×m𝐆superscript𝑞𝑚\mathbf{G}\in\mathbb{R}^{q\times m}bold_G ∈ blackboard_R start_POSTSUPERSCRIPT italic_q × italic_m end_POSTSUPERSCRIPT (q=1𝑞1q=1italic_q = 1 for a binary grouping factor) using one-hot encoding. Then construct the kernel matrix 𝐋m×m=𝐆𝐆𝐋superscript𝑚𝑚superscript𝐆top𝐆\mathbf{L}\in\mathbb{R}^{m\times m}=\mathbf{G}^{\top}\mathbf{G}bold_L ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_m end_POSTSUPERSCRIPT = bold_G start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_G and the centering matrix 𝐇m×m𝐇superscript𝑚𝑚\mathbf{H}\in\mathbb{R}^{m\times m}bold_H ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_m end_POSTSUPERSCRIPT;
2:Add a row of 1111s to 𝐗𝐗\mathbf{X}bold_X;
3:Randomly initialize 𝐰ksuperscript𝐰𝑘\mathbf{w}^{k}bold_w start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT (k=0𝑘0k=0italic_k = 0);
4:while Not converge do
5:     Compute gradient J(𝐰)𝐽𝐰\nabla J(\mathbf{w})∇ italic_J ( bold_w ) by Eq. (7);
6:     Update 𝐰k+1=𝐰kηJ(𝐰k)superscript𝐰𝑘1superscript𝐰𝑘𝜂𝐽superscript𝐰𝑘\mathbf{w}^{k+1}=\mathbf{w}^{k}-\eta\nabla J(\mathbf{w}^{k})bold_w start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT = bold_w start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - italic_η ∇ italic_J ( bold_w start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT );
7:end while
8:return GSDA-Logit coefficient vector 𝐰𝐰\mathbf{w}bold_w.

Evaluation: group specificity index (GSI)

To measure the group specificity, we set the following criteria for the metric

  • The value of this metric lies within [0,1]01[0,1][ 0 , 1 ].

  • The value of this metric is 0 if the test accuracy for the target and non-target groups are identical.

  • When the test accuracy of the target and non-target groups differ, the value of this metric should be proportional to 1) the absolute accuracy for the target group, and 2) the closeness of accuracy for the non-target group to the random chance.

  • Greater relative accuracy divergence between target and non-target groups will result in a higher value of this metric.

To satisfy the above conditions, we propose a Group Specificity Index (GSI) for binary classification problems as follows:

GSI=2BAT(BAT0.5|BANT0.5|),GSI2BATBAT0.5BANT0.5\mathrm{GSI}=2\mathrm{BAT}(\mathrm{BAT}-0.5-|\mathrm{BANT}-0.5|),roman_GSI = 2 roman_B roman_A roman_T ( roman_BAT - 0.5 - | roman_BANT - 0.5 | ) , (9)

where BAT[0.5,1]BAT0.51\mathrm{BAT}\in[0.5,1]roman_BAT ∈ [ 0.5 , 1 ] represents the balanced accuracy of the target-group data, and BANT[0,1]BANT01\mathrm{BANT}\in[0,1]roman_BANT ∈ [ 0 , 1 ] denotes the balanced accuracy of the non-target-group data. Balanced accuracy is chosen to mitigate the impact of imbalanced samples. It is defined as BA=TPR+TNR2BATPRTNR2\mathrm{BA}=\frac{\mathrm{TPR}+\mathrm{TNR}}{2}roman_BA = divide start_ARG roman_TPR + roman_TNR end_ARG start_ARG 2 end_ARG, where the true positive rate TPR=number of true positivesnumber of total positivesTPRnumber of true positivesnumber of total positives\mathrm{TPR}=\frac{\text{number of true positives}}{\text{number of total % positives}}roman_TPR = divide start_ARG number of true positives end_ARG start_ARG number of total positives end_ARG and the true negative rate TNR=number of true negativesnumber of total negativesTNRnumber of true negativesnumber of total negatives\mathrm{TNR}=\frac{\text{number of true negatives}}{\text{number of total % negatives}}roman_TNR = divide start_ARG number of true negatives end_ARG start_ARG number of total negatives end_ARG. In the left vs. right brain hemisphere classification problem, the numbers of left and right training examples are equal, making balanced accuracy equivalent to accuracy. The expression |BANT0.5|BANT0.5|\mathrm{BANT}-0.5|| roman_BANT - 0.5 | measures how close the accuracy of the non-target group is to random chance (0.5), and BAT0.5|BANT0.5|BAT0.5BANT0.5\mathrm{BAT}-0.5-|\mathrm{BANT}-0.5|roman_BAT - 0.5 - | roman_BANT - 0.5 | quantifies the relative accuracy divergence between the target and non-target groups. Since we are interested in generalized lateralization patterns for the target group, models that perform worse than random chance on target test sets are not considered.

Resting-State fMRI data and processing

We use resting-state fMRI data from the Human Connectome Project (HCP) [35] and the Genomics Superstruct Project (GSP) [36] for brain hemisphere classification to study lateralization. Table 2 summarizes the demographic information of the subjects involved in our experiments across both datasets.

Human Connectome Project (HCP)

Acquisition

All MRI data were collected using the same 3T Siemens Skyra magnetic resonance machines at Washington University in St. Louis with a 32-channel head coil \citesuppvan2013wu. Specifically, rs-fMRI was acquired using a gradient-echo echo-planar imaging (GE-EPI) sequence with the following parameters: repetition time (TR) = 720 ms, echo time (TE) = 33.1 ms, flip angle (FA) = 52°, bandwidth = 2290 Hz/pixel, field of view (FOV) =208×180 mm2absent208180superscript mm2=208\times 180\text{ mm}^{2}= 208 × 180 mm start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, matrix =104×90absent10490=104\times 90= 104 × 90, voxel size =2×2×2 mm3absent222superscript mm3=2\times 2\times 2\text{ mm}^{3}= 2 × 2 × 2 mm start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT, multi-band acceleration factor =8absent8=8= 8, slices = 72, and total scan time of 1200 frames = 14 min and 24 s [35]. During the scan, participants were asked to open their eyes and stare at a white cross on a screen with a black background. There were two rs-fMRI sessions (REST1 and REST2) acquired on two consecutive days, each including two runs with a left-to-right (LR) and a right-to-left (RL) phase encoding direction. The T1-weighted images were acquired by using a magnetized rapid gradient-echo imaging (MPRAGE) sequence with the following parameters: TR = 2,400 ms, TE = 2.14 ms, reversal time (TI) = 1,000 ms, FA = 8°, FOV =224×224 mm2absent224224superscript mm2=224\times 224\text{ mm}^{2}= 224 × 224 mm start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, voxel size 0.7 mm isotropic, and total scan time = 7 min and 40 s.

Preprocessing

The HCP minimal preprocessing pipeline (version 2.0) was utilized, which included steps for magnetic gradient distortion correction, EPI distortion correction, non-brain tissue removal, Montreal Neurological Institute (MNI) standard space registration, and intensity normalization \citesuppglasser2013minimal. Resultant data were denoised using independent component analysis (ICA) facilitated by the FIX tool \citesuppsalimi2014automatic. This tool effectively identifies and eliminates spatiotemporal signal components that originate from non-neuronal or structural noise, with an emphasis on head movement [35]. Subsequently, five post-processing steps were utilized based on the minimal preprocessed data, including 1) spatial smoothing with 4 mm FWHM kernel, twice the voxel resolution of HCP fMRI data \citesuppfox2005human; 2) linear detrending to minimize the effects of low-frequency drift \citesupplowe1999treatment; 3) regression of a suite of nuisance variables unrelated to neural signals, such as average signals from white matter (WM) and cerebrospinal fluid (CSF), as well as the whole brain (global signal, GS)\citesupphallquist2013nuisance, fox2009global; 4) bandpass filtering (0.01Hz-0.1Hz) \citesuppthompson2015frequency,fox2005human; 5) scrubbing to control effects of transient movement across the time series frames \citesupppower2012spurious.

Genomics Superstruct Project (GSP)

Acquisition

All imaging data were collected on matched 3T Tim Trio scanners (Siemens Healthcare, Erlangen, Germany) at Harvard University and Massachusetts General Hospital using the vendor-supplied 12-channel phased-array head coil [36]. Structural data included a high-resolution (1.2 mm isotropic) multi-echo T1-weighted magnetization-prepared gradient-echo image. Functional imaging data were acquired using a GE-EPI sequence sensitive to blood oxygenation level-dependent (BOLD) contrast with the following parameters: TR = 3,000 ms, TE = 30 ms, FA = 85°, voxel size =3×3×3 mm3absent333superscript mm3=3\times 3\times 3\text{ mm}^{3}= 3 × 3 × 3 mm start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT, slices = 47, and total scan time of 124 frames = 6 min and 12 s.

Preprocessing

All fMRI data were preprocessed by SPM \citesuppfriston2003statistical and GRETNA \citesuppwang2015gretna toolkit, including the following steps: 1) removing the first four volumes to assure that the magnetization is at steady state \citesuppparkes2018evaluation; 2) slice-timing correction; 3) realignment of all volumes to the first volume to reduce the effects of head motion\citesuppparkes2018evaluation; 4) co-registration of GE-EPI data to the native, cropped, high-resolution structural image and then normalizing them to the MNI space through Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL) algorithm\citesuppashburner2007fast; 5) spatial smoothing with a 6 mm𝑚𝑚mmitalic_m italic_m FWHM kernel, twice the voxel resolution of GSP fMRI data \citesuppfox2005human; 6) linear detrending to minimize the effects of low-frequency drift \citesupplowe1999treatment; 7) six head motion parameters regression \citesuppyan2013addressing, as well as the WM, CSF and GS \citesupphallquist2013nuisance, fox2009global; 8) lowpass filtering (<0.08Hz) \citesuppyeo2011organization.

Extracting intrahemispheric brain network

We use intrahemispheric brain network connectivity as features to represent brain hemispheres. Figure 1a ①-③ illustrates the data processing workflow for obtaining intrahemispheric connections from resting-state time series. Time sequences were extracted using the Brainnetome atlas (BNA) [37], which divides the human brain into 246 regions (123 per hemisphere). Pearson correlation was computed to represent the connectivity between brain regions. Following Liang et al. \citesuppliang2014neuroplasticity, the correlation coefficients were transformed into z𝑧zitalic_z-scores using Fisher’s z𝑧zitalic_z transform. For HCP data, we averaged z𝑧zitalic_z-scores across the RL and LR runs for each session. To extract half-brain features, we reordered the columns and rows of the connectivity matrix to produce two 123 ×\times× 123 matrices, representing the intrahemispheric networks for the two brain hemispheres of each subject. We then extracted the upper (or lower) triangle of these matrices (illustrated as the red and blue areas in ③ of Fig. 1) to form two 7503-dimensional feature vectors by BNA for the two hemispheres for experiments.

Table 2: Information of HCP and GSP dataset used for the experiments, where “M” denotes male and “F” denotes female for sex, “L” denotes left-handedness, “R” denotes right-handedness, and “A” denotes ambidexterity for handedness, and “SD” denotes standard deviation.
Dataset # Total Subjects Sex (M/F) Handedness (L/R/A) Age (SD) # Sessions
HCP [35] 960 445/515 85/875/0 28.7 (3.71) 2
GSP [36] 1570 665/905 110/1449/11 21.5 (2.89) 1

Experimental setting

Multivariate classification algorithm setup

For all multivariate methods, the classification problem is binary: left brain hemispheres are labeled as 0, and right brain hemispheres are labeled as 1. For GSDA-Logit, sex is utilized as the grouping factor in the experiments, encoding males as 0 and females as 1. Given the binary nature of the grouping factor, the matrix 𝐆𝐆\mathbf{G}bold_G simplifies to a vector 𝐠𝐠\mathbf{g}bold_g in this experiment. There are two hyperparameters in GSDA-Logit: α𝛼\alphaitalic_α and λ𝜆\lambdaitalic_λ. In this experiment, the value of α𝛼\alphaitalic_α is set to 0.1, while λ[0,1.0,2.0,5.0,8.0,10.0]𝜆01.02.05.08.010.0\lambda\in[0,1.0,2.0,5.0,8.0,10.0]italic_λ ∈ [ 0 , 1.0 , 2.0 , 5.0 , 8.0 , 10.0 ]. When λ=0𝜆0\lambda=0italic_λ = 0, indicating an absence of the grouping factor dependence for optimizing model weights, GSDA-Logit degenerates to a standard logistic regression for target-group data. A logistic regression classifier implemented in scikit-learn \citesuppscikit-learn with default hyperparameters is used for learning both first-order multivariate control models and conducting second-order classification.

Cross-validation strategy

First-order classification setting

Given that each participant provides two brain hemispheres (left and right), the correlation between these hemispheres might impact the effectiveness of machine learning models. To address this, we divided our subjects into two groups: for 50% of the subjects, we used their left hemispheres for training, and for the remaining 50%, we used their right hemispheres. The hemispheres not selected for training were then used for testing, as illustrated in ④ of Fig. 1a. To further validate our findings, we employed an alternative cross-validation method. This method involves holding 20% of the subjects as unseen to the models, serving as an additional test set. Training samples are drawn from the remaining 80% using the same selection process mentioned above. Since HCP data includes two scanning sessions per subject on two different days, the session not involved in training serves as an additional test set. Each cross-validation strategy was iterated 1,000 times, resulting in 1,000 models for each learning task.

Second-order classification setting

With the 1,000 first-order models for each task learned, we perform a second-order classification through the following steps:

  1. 1.

    Select classification problem of interest, for example, male-specific GSDA models trained on HCP with α=0.1𝛼0.1\alpha=0.1italic_α = 0.1, λ=5𝜆5\lambda=5italic_λ = 5 vs . female-specific GSDA models trained on HCP with α=0.1𝛼0.1\alpha=0.1italic_α = 0.1, λ=5𝜆5\lambda=5italic_λ = 5, with 1,000 models for each group.

  2. 2.

    Perform a random stratified split of the 2,000 models into training (80%) and test (20%) sets.

  3. 3.

    Train a standard logistic regression classifier using the scikit-learn \citesuppscikit-learn implementation with default hyperparameters on the training set and then evaluate the performance on the test set.

  4. 4.

    Repeat steps 2 and 3 using different random seeds to split the training and test sets 1,000 times.

Data availability

This study used publicly available data from HCP (https://www.humanconnectome.org/) and GSP (https://www.neuroinfo.org/gsp/). Original data can be accessed via data use agreements. Processed intrahemispheric network data for classification is available at https://doi.org/10.5281/zenodo.10050233 for HCP and https://doi.org/10.5281/zenodo.10050234 for GSP. The Brainnetome atlas (BNA) atlas is available at http://atlas.brainnetome.org/ and the information about the lobes and gyrus is available at https://pan.cstcloud.cn/web/share.html?hash=6eRCJ0zDTFk.

Code availability

All custom preprocessing (in Matlab) and analysis (in Python) code can be found in the following repository: https://github.com/shuo-zhou/GSDA-Lateralization. An online demonstration example is available at: https://colab.research.google.com/github/shuo-zhou/GSDA-Lateralization/blob/main/gsda_demo.ipynb

\bibliographystylesupp

naturemag-doi \bibliographysuppmethod_ref

Acknowledgements

This work was supported by the STI 2030-Major Projects (2021ZD0200500, 2021ZD0201701), the National Natural Science Foundation of China (T2325006, 82021004), and the Fundamental Research Funds for the Central Universities (2233200020).

Author contributions statement

S.Z., J.L., H.L., and G.G. conceptualized the study. S.Z. and J.L. designed the research experiments. S.Z. developed the Group-Specific Discriminant Analysis algorithm and performed the classification experiments. J.L. analyzed model weights. Y.J. implemented the preprocessing pipeline and processed the data for experiments. S.Z. and H.W. developed the metric Group-Specificity Index for evaluation. S.Z., J.L., and H.W. wrote the manuscript. H.L. and G.G. performed critical revisions of the article. All authors reviewed and edited the manuscript.

Extended data figure

Refer to caption
Figure Extended Data Fig. 1: Additional results of left vs. right brain classification using GSDA-Logit on HCP data [35] with different cross-validation strategies from Fig. 2. a, Average test accuracy on the held-out session; for example, training was conducted on the data from the REST1 session, and test was performed on the data from the REST2 session. b, GSI calculated from the test results shown in Extended Data Fig. 1a. c, Average test accuracy on the held-out subjects’ data; for example, training was conducted on the 80% subjects’ data sampled from the REST1 session, and test was performed on the remaining 20% subjects’ data from REST1 and REST2. d, GSI calculated from the test results shown in Extended Data Fig. 1c. The remaining detailed descriptions of the figures, along with the main observations, are the same as those in the caption of Fig. 2.
Refer to caption
Figure Extended Data Fig. 2: Performance of left vs. right brain classification using GSDA-Logit on Brain Genomics Superstruct Project (GSP) data [36] with respect to the hyperparameter λ𝜆\lambdaitalic_λ. a, Average test accuracy on the held-out hemispheres, with a cross-validation strategy consistent with that shown in Fig. 2a. b, GSI calculated from the test results shown in Extended Data Fig. 2a. c, Average test accuracy on the held-out subjects’ data, with a cross-validation strategy consistent with that shown in Extended Data Fig. 1c. d, GSI calculated from the test results shown in Extended Data Fig. 2c. The remaining detailed descriptions of the figures, along with the main observations, are the same as those in the caption of Fig. 2.
Refer to caption
Figure Extended Data Fig. 3: Additional Pearson correlation coefficients between model weights learned from GSP data [36]. a, Correlation between male-specific and multivariate control models. b, Correlation between female-specific and multivariate control models. c, d, Average pairwise correlation for c, male-specific and d, female-specific GSDA models. The main observations are consistent with those in Fig. 3.
Refer to caption
Figure Extended Data Fig. 4: Sex-specific lateralized connections identified by a, male-specific models for HCP (Fig. 5a + Fig. 5e), b, female-specific models for HCP (Fig. 5b + Fig. 5f), c, male-specific models for GSP (Fig. 5c + Fig. 5g), and d, female-specific models for GSP (Fig. 5d + Fig. 5g).
Refer to caption
Figure Extended Data Fig. 5: Count of the group “exclusive” lateralized connections for HCP and GSP (Fig. 5e,f,g,h) categorized by associated lobes, and inter- or intra-lobe. The connections are identified by a, male-specific models for HCP, b, female-specific models for HCP, c, male-specific models for GSP, d, female-specific models for GSP. e, Sum of Extended Data Fig. 5a and c. and f, Sum of Extended Data Fig. 5b and d.