\newcites

suppReferences

Group-specific discriminant analysis reveals statistically validated sex differences in lateralization of brain functional network

Shuo Zhou Department of Computer Science, University of Sheffield, Sheffield, UK Centre for Machine Intelligence, University of Sheffield, Sheffield, UK These authors contributed equally to this work Junhao Luo State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China These authors contributed equally to this work Yaya Jiang State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China Haolin Wang Department of Computer Science, University of Sheffield, Sheffield, UK Centre for Machine Intelligence, University of Sheffield, Sheffield, UK Haiping Lu Department of Computer Science, University of Sheffield, Sheffield, UK Centre for Machine Intelligence, University of Sheffield, Sheffield, UK h.lu@sheffield.ac.uk; gaolang.gong@bnu.edu.cn Gaolang Gong State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China Beijing Key Laboratory of Brain Imaging and Connectomics, Beijing Normal University, Beijing, China Chinese Institute for Brain Research, Beijing, China h.lu@sheffield.ac.uk; gaolang.gong@bnu.edu.cn

Abstract

Lateralization is a fundamental feature of the human brain, where sex differences have been observed. Conventional studies in neuroscience on sex-specific lateralization are typically conducted on univariate statistical comparisons between male and female groups. However, these analyses often lack effective validation of group specificity. Here, we formulate modeling sex differences in lateralization of functional networks as a dual-classification problem, consisting of first-order classification for left vs. right functional networks and second-order classification for male vs. female models. To capture sex-specific patterns, we develop the Group-Specific Discriminant Analysis (GSDA) for first-order classification. The evaluation on two public neuroimaging datasets demonstrates the efficacy of GSDA in learning sex-specific models from functional networks, achieving a significant improvement in group specificity over baseline methods. The major sex differences are in the strength of lateralization and the interactions within and between lobes. The GSDA-based method is generic in nature and can be adapted to other group-specific analyses such as handedness-specific or disease-specific analyses.

Introduction

Human brains are functionally lateralized [1, 2]. The asymmetries between left and right brain hemispheres are believed to reflect a complex interplay of evolutionary, hereditary, developmental, experiential, and pathological influences [3]. Researchers are developing insights on lateralization through psychological, pharmacological, and neuroscience investigations [4, 5]. One important understanding is that multiple factors influence human brain lateralization [5], with sex being one of the most representative [6, 7, 8, 9, 10, 11, 12, 13]. A popular viewpoint is that males have a more asymmetric brain organization while females have a more “bilateral” brain organization, which may result in the males’ superior spatial skills and the females’ superior verbal skills [14, 15].

Measurement of functional brain lateralization is valuable but challenging [16]. Direct approaches such as selectively modulating or suppressing cortical activities and circuits in a single hemisphere [17] often pose a risk of inflicting harm on the human brains [18]. Over the last two decades, functional neuroimaging techniques have been widely used in neuroscience, offering a powerful and non-invasive approach to studying human brain lateralization [19, 20]. One popular technique is analyzing functional connectivity (FC) of the brain’s resting-state, which is also known as the brain network or connectome [21]. This is usually derived from functional magnetic resonance imaging (fMRI) time series and considered an intrinsic “fingerprint” of the human brain [4, 22, 23]. A previous study [8] reported sex differences in lateralization of resting-state networks, with more right-lateralized visual and default-mode network components for males and females, respectively. Several other networks also showed differences between males and females [8]. Additionally, males and females have also demonstrated significant differences in homotopic functional connectivity of numerous regions [24].

Studies on brain lateralization have largely focused on modeling asymmetry effects region-by-region [5]. These lateralized brain regions are usually measured by the laterality index (LI) [25, 7], or identified through statistical univariate analysis comparing homologous regions [26, 27]. However, using these conventional methods to search for sex-specific lateralization patterns is mostly limited to within-group analysis [28, 29, 30, 31, 32, 33]. For example, to understand male-specific lateralization, analyses are performed on male and female data separately to label features significantly differing from female data as “male-specific”. Moreover, these analyses often do not validate the models on unseen samples, and therefore the resulting patterns may not be truly group-specific. In addition, given the high similarity between male and female brains, the small size of statistical effects makes detecting sex differences in lateralization more difficult [34]. As a result, the true specificity of lateralization may be overwhelmed by the similarities. Hence, effectively modeling and validating sex-specific lateralization remains challenging.

Refer to caption — Figure 1: a, The proposed classification workflow for identifying sex-specific brain lateralization. ①–③: Hemispheric features are extracted from the intrahemispheric brain functional network, which is constructed using resting-state functional MRI time series. ④⑤: First-order classification learns the differences between the two hemispheres, where a group-specific discriminant analysis (GSDA) classifier is trained to classify left vs. right hemispheres for a target group. ⑥: Second-order classification trains a standard logistic regression for classifying the male- vs. female-specific models obtained from the first-order classification, to identify the weights that significantly contribute to the sex-specific predictions. ⑦: Evaluate the predictions and interpret the model weights. b, Group-specific discriminant analysis with the logistic loss (GSDA-Logit) for the first-order classification of left vs. right brain hemispheres. This model jointly maximizes the likelihood of labels for the target group (with non-target-group labels masked out) and the grouping factor dependence for both the target and non-target groups, where ${}^{\top}$ denotes the transpose of vectors, $\mathbf{x}$ denotes the input training samples, $\mathbf{x}_{\mathrm{t}}$ denotes target group training samples, $\mathbf{x}_{\setminus\mathrm{t}}$ denotes non-target group samples, and a hyperparameter $\lambda\geq 0$ controls the grouping factor dependence.

Here, we formulate the identification of sex differences in brain lateralization as a machine learning classification problem to address the aforementioned challenges. First, to model sex differences in brain lateralization, we propose a dual classification workflow. This consists of a first-order classification of left vs. right brain hemispheres and a second-order classification of male- vs. female-specific models. The obtained first- and second-order model weights can be interpreted as lateralization strength and sex differences, respectively. The whole workflow is presented in Fig. 1a. Second, to learn group (sex)-specific models in the first-order classification, we propose a novel group-specific discriminant analysis (GSDA) algorithm (Fig. 1b). Third, to validate the learned models, we leverage the cross-validation method in machine learning for a statistical evaluation. The models’ classification performance will be assessed by their accuracy on male and female test samples via cross-validation. Fourth, we propose a new metric, the Group Specificity Index (GSI), for evaluating the group specificity of the learned models.

Our final contribution involves conducting classification experiments using intrahemispheric connections extracted from resting-state fMRI (rs-fMRI) data of two public neuroimaging repositories, the Human Connectome Project (HCP) [35] and the Brain Genomics Superstruct Project (GSP) [36]. The results demonstrate a significant improvement in GSI obtained by GSDA over the baselines. Further second-order classification reveals consistent sex differences in lateralization across datasets: 1) about half of the sex-specific lateralized connections are shared between male and female brain functional network, with differences in the strength of lateralization, 2) stronger positive inter-lobe interactions are more left-lateralized in the male brain network, while stronger positive intra-lobe interactions are more right-lateralized in the female brain network.

Results

Diverged test accuracy on male and female sets

Figure 2 depicts the performance of group-specific discriminant analysis (GSDA) in classifying left vs. right brain hemispheres on the HCP data [35], across a varied range of values for hyperparameter $\lambda$ . A larger $\lambda$ indicates a higher grouping factor (sex) dependence. When the target group is male (the left of Fig. 2a), the labels for the left and right hemispheres of the female training data were masked. Therefore, the training female samples were only involved in the grouping factor dependence regularization. In this scenario, the average accuracy obtained on the male test samples (the blue solid line) stays higher than that on the female test samples (the orange dashed line). The increase of $\lambda$ leads to an increased gap between the test accuracy on target and non-target test sets. In particular, this discrepancy widens significantly within the range $0<\lambda\leq 5$ and stabilizes to a 20% gap for $\lambda>5$ (Fig. 2a). These observations remain consistent in results with different cross-validation strategies for the HCP data (Extended Data Fig. 1a,c) and the GSP data (Extended Data Fig. 2a,c).

The group specificity of models obtained by GSDA increases with a larger $\lambda$ , as reflected by our proposed metric, the Group Specificity Index (GSI), which is presented as a box plot in Fig. 2b. When $0<\lambda\leq 5$ , the GSI for both male- and female-specific GSDA models increases with the increase of $\lambda$ . When $\lambda\geq 5$ , the GSI maintains at around 0.4. Based on both accuracy and GSI results, $\lambda=5$ is an “elbow” point in the experiment across different datasets and cross-validation strategies, which can be considered an optimal value for the trade-off between classification accuracy, group specificity, and model complexity (the hyperparameter for $\ell_{2}$ regularization was fixed to 0.1, so the larger $\lambda$ , the lower relative importance of $\ell_{2}$ regularization). Hence, in the rest of this article, we will use $\lambda=5$ for GSDA as the main sex-specific model to present the results and findings.

In contrast, the GSI steadily approaches zero without the grouping factor dependence regularization. At $\lambda=0$ , where GSDA degenerates to a standard logistic regression trained only on the target-group hemispheres, the accuracy is nearly 100% for both male and female test samples (Fig. 2a, Table 1, Extended Data Fig. 1a,c, and Extended Data Fig. 2a,c). This performance is similar to the multivariate control baseline, which uses standard logistic regression trained on mixed male and female hemispheres. From Table 1, the control models achieved an accuracy of 99.99 $\pm$ 0.04% for male and 99.92 $\pm$ 0.13% for female HCP test samples, and 99.94 $\pm$ 0.07% for male and 99.99 $\pm$ 0.01% for female GSP test samples. Because of the same property and similar performance compared to the standard logistic regression (multivariate control baselines), we will view GSDA with $\lambda=0$ as an additional multivariate baseline.

GSDA-based models learned distinct weights

Beyond classification performance similarity, the weights of multivariate baselines (control and GSDA with $\lambda=0$ ) are also highly correlated. As shown in Fig. 3a, the average Pearson correlation coefficients between multivariate baselines are 0.99 for analyses conducted within either the HCP or GSP datasets. Similarly, in univariate analyses based on the $t$ -test of paired left and right connections, the $t$ -values of within-group analysis showed a 0.99 correlation with the $t$ -values derived from mixed male and female samples (univariate control). Among these multivariate and univariate baselines, the correlation for any arbitrary pair exceeds $0.91$ for within-dataset results and $0.7$ for cross-dataset results. This high correlation suggests that the lateralization modeled by multivariate or univariate baselines is common to both males and females, regardless of whether the analysis is conducted with exclusively male or female data, or with mixed data. This corresponds to the top red triangular cluster in Fig. 3a.

In contrast, our sex-specific models (with a higher GSI) show lower correlations with the univariate and multivariate baseline models. This corresponds to the blue rectangular cluster at the bottom of Fig. 3a, where a majority of coefficients fall within the range of 0.35 to 0.5. Increasing the value of $\lambda$ leads to a decreasing correlation between the control and GSDA models ( $\lambda>0$ ), for both results from HCP (Fig. 3b, and first columns of Fig. 3c,d) and GSP (Extended Data Fig. 3a,b, and first columns of Extended Data Fig. 3c,d). Moreover, the weights of sex-specific models are stable. As shown in Fig. 3c,d, the average correlation of any pair for GSDA with $\lambda\geq 2$ is 0.99 or above.

Table 1: First-order classification (left vs. right brain hemispheres) accuracy on male and female test sets from the HCP [35] & Brain Genomics Superstruct Project (GSP) [36]. Group-specific models (GSDA with

\lambda=5

) are compared with three multivariate baselines: 1) standard logistic regression trained on a mixture of male and female training data, 2) GSDA with

\lambda=0

(equivalent to standard logistic regression) trained on male data only, and 3) GSDA with

\lambda=0

trained on female data only.

\lambda=5

is an optimal value for GSDA on the data as determined by the accuracy and GSI in Fig. 2. The baselines achieved similar accuracy on both male and female test sets, indicating a lack of group specificity. Conversely, the group-specific models maintained accuracy on the target test set but showed a significant gap with the lower accuracy on the non-target test set.

	Average test accuracy (%) and gap (target $-$ non-target)
Classification method (target group)	HCP male	HCP female	HCP gap	GSP male	GSP female	GSP gap
Logistic regression (male + female)	99.99 $\pm$ 0.04	99.92 $\pm$ 0.13	0.07	99.94 $\pm$ 0.07	99.99 $\pm$ 0.01	0.05
GSDA ( $\lambda=0$ , male)	99.87 $\pm$ 0.16	99.85 $\pm$ 0.17	0.02	99.93 $\pm$ 0.08	99.99 $\pm$ 0.01	0.06
GSDA ( $\lambda=0$ , female)	99.93 $\pm$ 0.12	99.99 $\pm$ 0.04	0.06	99.97 $\pm$ 0.05	99.95 $\pm$ 0.07	0.02
\rowcolorgray!20 GSDA ( $\lambda=5$ , male)	92.75 $\pm$ 1.83	68.52 $\pm$ 2.88	24.23	91.85 $\pm$ 1.77	71.28 $\pm$ 2.13	20.57
\rowcolorgray!20 GSDA ( $\lambda=5$ , female)	70.76 $\pm$ 2.56	93.16 $\pm$ 1.89	22.40	74.70 $\pm$ 2.22	92.81 $\pm$ 1.35	18.11

Identifying sex-specific lateralized connections with dual-classification weights

To identify lateralized connections related to sex differences among the 7,503 intrahemispheric connections, we performed a second-order classification. This involved training standard logistic regression models to distinguish between male- and female-specific models learned from the first-order classification, using 80% of the first-order models for training and 20% for testing. The test accuracy for second-order classification consistently achieved nearly 100% over 1,000 random splits. This indicates that the sex differences in the first-order model weights are generalizable.

Based on the weights from these second-order classification models, we derived a mask that characterizes sex differences in the lateralized connections. We first averaged the weights across 1,000 second-order classification models from different random splits for the HCP and GSP datasets, respectively. Then, we identified the overlap between the top 5% of the largest average weights (by magnitude) from HCP and those from GSP. The resulting map is represented by the figure of chords in Fig. 4a. The threshold of 5% was chosen because the second-order logistic regression classifiers were trained with $\ell_{2}$ regularization, which can be interpreted as a Gaussian prior (normal distribution) on model weights, with 5% being a commonly used statistical significance level for a Gaussian distribution. By calculating the average degree [38] (Fig. 4b) of connections for each lobe within this mask, we can learn that sex differences in first-order weights are associated with frontal, parietal, and occipital lobes, where the average degrees exceed 1. The frontal lobe shows the largest average degree, indicating significant sex differences.

We then applied this mask to the top 5% weights of four first-order classification models: HCP male-specific, HCP female-specific, GSP male-specific, and GSP female-specific. The obtained lateralized connections with sex differences are shown in Fig. 4c, Fig. 5a-h, and Extended Data Fig. 4a-d. The weights of these four models were obtained by taking the average of the corresponding 1,000 models learned from different random splits. In total, 47 lateralized connections with repetition were identified, of which 30 connections are unique. Among these 47 connections, the middle frontal gyrus (MFG) was the most frequently involved region, suggesting it may serve as a hub. Specifically, 17 out of the 47 connections were associated with the MFG in both male and female samples across both datasets.

Sex-specific lateralization: shared and “exclusive” connections

For each of the four sex-specific models, half of the identified lateralized connections are shared between male and female brain networks (Fig. 4c) on average: on HCP, 6 out of 12 for the male-specific and 6 out of 13 for the female-specific model; on GSP, 6 out of 10 for the male-specific and 6 out of 12 for the female-specific model. Among the 30 unique sex-specific lateralized connections identified across datasets, 9 (nearly one-third) are shared between males and females. To illustrate these findings, we have separated the shared and “exclusive” connections, as depicted in Fig. 5a-h and Fig. 5e-h, respectively.

For the shared lateralized connections (Fig. 5a,b for HCP and Fig. 5c,d for GSP), we observed sex differences in the magnitude of first-order weights, i.e., strength of lateralization. Specifically, for the female-specific models, the magnitudes of first-order weights corresponding to the connections associated with the frontal lobe are generally larger compared to those for male-specific models, particularly those of the positive weights. In male-specific models, the magnitudes of first-order weights for connections related to other lobes are larger than those in female-specific models.

For the “exclusive” connections in the four models (Fig. 5e-h), male-specific models contain more inter-lobe lateralized connections (Fig. 6a), with more than 70% of corresponding weights being negative, as shown by the blue chords in Fig. 5e,g. Female-specific models, on the other hand, contain more intra-lobe lateralized connections (Fig. 6b), with more than 90% of weights being positive, indicated by the red chords in Fig. 5f,h. Notably, these patterns of inter- and intra-lobe lateralization for males and females are consistent across joint or separate analyses of both HCP and GSP data (Fig. Extended Data Fig. 5), demonstrating the stability and reliability of these findings.

Discussion

Cross-validation challenges conventional statistical approach for investigating lateralization

Traditional neuroscience studies commonly assume that results from within-group analyses are specific to the group being studied [29, 30, 33, 28, 32]. This methodology of exclusively using male or female data to explore sex-specific characteristics is intuitively logical. However, our experimental results obtained via cross-validation challenge this assumption. For example, as depicted in Fig. 2a, when standard logistic regression models are exclusively trained on data from one target group (male or female), the performance on test sets for both the target and non-target groups is nearly identical. Hence, the performance of these baseline models is not sensitive to sex-based sampling. Based on the definition of generalization in statistical learning theory [39], the similarity in generalization errors suggests that these models are general to both males and females, not sex-specific. This finding implies that statistical methods can learn common patterns even when trained on data from a specific group, contradicting the conventional assumption in group-specific analysis. This conclusion holds at least in our study on left vs. right brain classification using the HCP and GSP datasets.

The performance of our sex-specific models (GSDA with $\lambda>0$ ) is sensitive to sex-based sampling. The classification results (Fig. 2a, Extended Data Fig. 1a,c, and Extended Data Fig. 2a,c) reveal that the generalization error for the target-group test sets is significantly lower than that for the non-target-group test sets. This indicates a stronger specificity to sex compared to the multivariate baseline models, as reflected by our group-specificity index (GSI) results. The differences observed in test performance highlight the importance of cross-validation in validating the group specificity of statistical analysis results.

While univariate analysis results are not directly applicable to unseen samples for testing, the strong correlation between univariate and multivariate baselines offers valuable insights for pattern identification. For example, the correlation of the within-group univariate $t$ -test results with univariate control models (mixed) and multivariate baselines exceeds 0.99 and 0.91, respectively. This suggests that the outcomes of our within-group univariate analyses are likely common to both males and females. Consequently, previous conclusions from such within-group analyses should be revisited and validated. Moreover, although multivariate methods are theoretically superior in capturing interactions between features compared to univariate methods, the observed similarity between multivariate and univariate results suggests that multivariate methods might not always identify patterns distinct from those found by univariate methods.

Regions and connections identified across datasets for sex-specific lateralization

The mask resulting from the second-order classification revealed sex differences in connections across lobes, including the frontal, temporal, parietal, insular, limbic, and occipital lobes, where functional differences between males and females were observed in previous studies [8, 40, 9, 41, 42, 43]. Among the sex-specific lateralized connections, MTG-IPL, MFG-IFG, and MFG-INS are shared in both male- and female-specific models across the two datasets.

From the perspective of gyrus, which engages in various cognitive functions, the lateralized regions include the MTG[44] (sound recognition and language processing), MFG [45] (literacy and numeracy), IPL [46, 47] (spatial attention, multimodal sensory integration, and oculomotor control), IFG[48] (speech and language processing), and INS [49] (various sensorimotor processing and risk-reward behavior). These regions show lateralization and sex differences in certain functions including speech processing, language, and spatial attention [50, 8, 40]. The MFG, a hub region in this study, is a core component of the multiple demand system [51], and presents hemispheric specialization, with the left MFG primarily supporting literacy development, while the right MFG is vital for numeracy [45]. The MTG showed lateralization in activated volumes for both males and females during language tasks, while the lateralization of IFG was only observed in males [52]. Our study reports different weights of connections related to these two regions. This suggests that the lateralization of a region’s external connections can reflect the lateralization of its functional activation. The “activity flow” theory in neuroscience has linked the connections and functional activation [53], proposing that the seed-based connection-weighted sums of the activation of other regions can predict functional activation of the seed region. Our results suggest a correlation between the lateralization of functional connectivity and activation, although further quantitative analysis is required to investigate the specifics of this relationship.

From the perspective of connections, the MTG-IPL connection is associated with picture naming and displays notable plasticity [54]. Laws (2004) [55] assessed sex differences in picture naming speed, and Ala-Salomäki et al. [56] found that picture naming shows reliable left-lateralized evoked activation. These studies indicate the potential of sex and hemispheric differences for MTG-IPL connection and picture-naming cognition. Our GSDA framework successfully captured these effects, suggesting that the underlying mechanism for picture naming lateralization could be the lateralization of related connections, such as the MTG-IPL connection. As for the MFG-IFG connection, proficiency in artificial grammar rules was found to be positively linked to the functional connectivity between the left IFG and left MFG [57, 52], and this function also identified sex differences [58]. In consistency with these studies, our framework has captured both the lateralization and sex effects on the MFG-IFG connection. In the case of the MFG-INS connection, associated with mild traumatic brain injury (mTBI)[59], our study reveals sex-specific hemispheric differences, suggesting that personalized intervention and diagnostic approaches considering brain hemispheric and sex differences may be necessary for more effective mTBI treatment.

Sex differences in lateralization: strength and inter-/intra-lobe interaction patterns

Sex differences in strengths of shared lateralized connections: As reported in the results section, the first-order weights corresponding to these shared connections show consistent sex differences (Fig. 4c-f). In our labeling strategy, “left” was labeled as 0 and “right” as 1. Therefore, a positive first-order weight indicates that stronger positive interactions (FC value approaching 1) between two ROIs suggest a higher probability of right lateralization. Conversely, a more negative interaction (FC value approaching -1) indicates that stronger negative interactions between two ROIs suggest a higher probability of left lateralization. The opposite interpretation applies to negative first-order weights. Therefore, we can interpret the sex differences in first-order weights for the shared connections as follows: positive interactions involving the frontal lobes are more right-lateralized in females than in males, whereas positive interactions involving the temporal, parietal, insular, and limbic lobes are more left-lateralized in males than in females. This observation of difference in strength of lateralization aligns with the findings on the lateralization patterns of right- and left-handed individuals [60], and supports neuroscience discoveries of shared functional network mechanisms across males and females [61, 32].

Sex differences in inter-/intra-lobe interaction patterns are identified by the “exclusive” lateralized connections, particularly within the frontal lobe (Fig. 4g-j), a key region for language processing [62]. Using the same approach as above for interpreting first-order weights, we can summarize that males tend to have a stronger left lateralization in positive inter-lobe interactions, while females tend to have a stronger right lateralization in positive intra-lobe interactions. This divergence may result from the evolutionary pressure for lateralization, which optimizes functional organization and reduces redundancy among brain regions [63, 64]. Inter-lobe connections, characterized by long-range wiring, are metabolically costly [65, 66, 67, 68], while the shorter-range intra-lobe connections are more energy-efficient. In females, these intra-frontal connections may enhance language abilities. Conversely, male inter-lobe connections may be driven by the need to engage more extensive functional areas for complex visuospatial tasks. This divergence can be a factor in sex and lateralized differences in cognitive abilities, with males typically outperforming in rightward visuospatial tasks, and females in leftward verbal tasks [14, 15].

Potential of GSDA for more general applications

Our study focused on sex as a grouping factor and employed brain hemisphere labels to identify sex-specific lateralized patterns for human brain functional networks. The results demonstrate efficacy and stability in identifying and validating sex differences in lateralization. Importantly, the scope of this general predictive framework extends beyond its current application. For instance, employing different grouping factors, such as handedness or disease vs. control, can advance the analysis of distinct characteristics. Moreover, the GSDA algorithm can be extended to grouping factor combinations, such as sex and handedness. While this work focused on classification and a discrete group factor, our method can be adapted for regression tasks (e.g. predicting behavioral/cognitive scores) and continuous grouping factors (e.g. age or IQ score).

References

[1] Ojemann, G., Ojemann, J., Lettich, E. & Berger, M. Cortical language localization in left, dominant hemisphere: an electrical stimulation mapping investigation in 117 patients. \JournalTitleJournal of Neurosurgery 71, 316–326 (1989).
[2] Gazzaniga, M. S. Cerebral specialization and interhemispheric communication: does the corpus callosum enable the human condition? \JournalTitleBrain 123, 1293–1326 (2000).
[3] Corballis, M. C. The evolution of lateralized brain circuits. \JournalTitleFrontiers in Psychology 8, 1021 (2017).
[4] Passingham, R. E., Stephan, K. E. & Kötter, R. The anatomical basis of functional localization in the cortex. \JournalTitleNature Reviews Neuroscience 3, 606–616 (2002).
[5] Toga, A. W. & Thompson, P. M. Mapping brain asymmetry. \JournalTitleNature Reviews Neuroscience 4, 37–48 (2003).
[6] Clements, A. et al. Sex differences in cerebral laterality of language and visuospatial processing. \JournalTitleBrain and Language 98, 150–158 (2006).
[7] Tomasi, D. & Volkow, N. D. Laterality patterns of brain functional connectivity: gender effects. \JournalTitleCerebral Cortex 22, 1455–1462 (2012).
[8] Agcaoglu, O., Miller, R., Mayer, A. R., Hugdahl, K. & Calhoun, V. D. Lateralization of resting state networks and relationship to age and gender. \JournalTitleNeuroImage 104, 310–325 (2015).
[9] Reber, J. & Tranel, D. Sex differences in the functional lateralization of emotion and decision making in the human brain. \JournalTitleJournal of Neuroscience Research 95, 270–278 (2017).
[10] Guadalupe, T. et al. Asymmetry within and around the human planum temporale is sexually dimorphic and influenced by genes involved in steroid hormone receptor activity. \JournalTitleCortex 62, 41–55 (2015).
[11] Hirnstein, M., Westerhausen, R., Korsnes, M. S. & Hugdahl, K. Sex differences in language asymmetry are age-dependent and small: A large-scale, consonant–vowel dichotic listening study with behavioral and fMRI data. \JournalTitleCortex 49, 1910–1921 (2013).
[12] Plessen, K. J., Hugdahl, K., Bansal, R., Hao, X. & Peterson, B. S. Sex, age, and cognitive correlates of asymmetries in thickness of the cortical mantle across the life span. \JournalTitleJournal of Neuroscience 34, 6294–6302 (2014).
[13] Sommer, I. E., Aleman, A., Somers, M., Boks, M. P. & Kahn, R. S. Sex differences in handedness, asymmetry of the planum temporale and functional language lateralization. \JournalTitleBrain Research 1206, 76–88 (2008).
[14] Levy, J. Lateral specialization of the human brain, behavioral manifestations and possible evolutionary basis. \JournalTitleThe Biology of Behavior (1972).
[15] Levy, J. Lateral differences in the human brain in cognition and behavioral control. \JournalTitleCerebral Correlates of Conscious Experience (1978).
[16] Jansen, A. et al. The assessment of hemispheric lateralization in functional MRI—robustness and reproducibility. \JournalTitleNeuroImage 33, 204–217 (2006).
[17] Kinsbourne, M. Hemineglect and hemisphere rivalry. \JournalTitleAdv. Neurol. 18, 41–49 (1977).
[18] Kolb, B. & Whishaw, I. Q. Fundamentals of Human Neuropsychology (Macmillan, 2009).
[19] Fox, M. D. & Raichle, M. E. Spontaneous fluctuations in brain activity observed with functional magnetic resonance imaging. \JournalTitleNature Reviews Neuroscience 8, 700–711 (2007).
[20] Logothetis, N. K. What we can do and what we cannot do with fMRI. \JournalTitleNature 453, 869–878 (2008).
[21] Sporns, O., Tononi, G. & Kötter, R. The human connectome: a structural description of the human brain. \JournalTitlePLoS Computational Biology 1, e42 (2005).
[22] Smith, S. M. et al. Temporally-independent functional modes of spontaneous brain activity. \JournalTitleProceedings of the National Academy of Sciences 109, 3131–3136 (2012).
[23] Finn, E. S. et al. Functional connectome fingerprinting: identifying individuals using patterns of brain connectivity. \JournalTitleNature Neuroscience 18, 1664–1671 (2015).
[24] Zuo, X.-N. et al. Growing together and growing apart: regional and sex differences in the lifespan developmental trajectories of functional homotopy. \JournalTitleJournal of Neuroscience 30, 15034–15043 (2010).
[25] Seghier, M. L. Laterality index in functional MRI: methodological issues. \JournalTitleMagnetic Resonance Imaging 26, 594–601 (2008).
[26] Liégeois, F. et al. A direct test for lateralization of language activation using fMRI: comparison with invasive assessments in children with epilepsy. \JournalTitleNeuroImage 17, 1861–1867 (2002).
[27] Friston, K. J. Functional and effective connectivity: a review. \JournalTitleBrain Connectivity 1, 13–36 (2011).
[28] Jacobsen, L. K., Slotkin, T. A., Mencl, W. E., Frost, S. J. & Pugh, K. R. Gender-specific effects of prenatal and adolescent exposure to tobacco smoke on auditory and visual attention. \JournalTitleNeuropsychopharmacology 32, 2453–2464 (2007).
[29] Schwarz, E. et al. Sex-specific serum biomarker patterns in adults with asperger’s syndrome. \JournalTitleMolecular Psychiatry 16, 1213–1220 (2011).
[30] Kret, M. E., Pichon, S., Grèzes, J. & Gelder, B. d. Men fear other men most: gender specific brain activations in perceiving threat from dynamic faces and bodies–an fMRI study. \JournalTitleFrontiers in Psychology 2, 3 (2011).
[31] Ingalhalikar, M. et al. Sex differences in the structural connectome of the human brain. \JournalTitleProceedings of the National Academy of Sciences 111, 823–828 (2014).
[32] Cui, Z., Su, M., Li, L., Shu, H. & Gong, G. Individualized prediction of reading comprehension ability using gray matter volume. \JournalTitleCerebral Cortex 28, 1656–1672 (2018).
[33] Zullo, S. & Caenazzo, L. Gene editing and gender-specific medicine: a challenge for dementia research. \JournalTitlePalgrave Communications 6, 1–6 (2020).
[34] Good, C. D. et al. Cerebral asymmetry and the effects of sex and handedness on brain structure: a voxel-based morphometric analysis of 465 normal adult human brains. \JournalTitleNeuroImage 14, 685–700 (2001).
[35] Smith, S. M. et al. Resting-state fMRI in the human connectome project. \JournalTitleNeuroImage 80, 144–168 (2013).
[36] Holmes, A. J. et al. Brain genomics superstruct project initial data release with structural, functional, and behavioral measures. \JournalTitleScientific Data 2, 1–16 (2015).
[37] Fan, L. et al. The human brainnetome atlas: a new brain atlas based on connectional architecture. \JournalTitleCerebral Cortex 26, 3508–3526 (2016).
[38] Diestel, R. Graph Theory (New York: Springer, 2000).
[39] Vapnik, V. The Nature of Statistical Learning Theory (Springer science & business media, 1999).
[40] Smith, D. V. et al. Characterizing individual differences in functional connectivity using dual-regression and seed-based approaches. \JournalTitleNeuroImage 95, 1–12 (2014).
[41] Zhang, C., Dougherty, C. C., Baum, S. A., White, T. & Michael, A. M. Functional connectivity predicts gender: Evidence for gender differences in resting brain connectivity. \JournalTitleHuman Brain Mapping 39, 1765–1776 (2018).
[42] Sen, B. & Parhi, K. K. Predicting biological gender and intelligence from fMRI via dynamic functional connectivity. \JournalTitleIEEE Trans. on Biomedical Engineering 68, 815–825 (2020).
[43] Kong, R. et al. Spatial topography of individual-specific cortical networks predicts human cognition, personality, and emotion. \JournalTitleCerebral Cortex 29, 2533–2551 (2019).
[44] Onitsuka, T. et al. Middle and inferior temporal gyrus gray matter volume abnormalities in chronic schizophrenia: an MRI study. \JournalTitleAmerican Journal of Psychiatry 161, 1603–1611 (2004).
[45] Koyama, M. S., O’Connor, D., Shehzad, Z. & Milham, M. P. Differential contributions of the middle frontal gyrus functional connectivity to literacy and numeracy. \JournalTitleScientific Reports 7, 17548 (2017).
[46] Lynch, J. C. The functional organization of posterior parietal association cortex. \JournalTitleBehavioral and Brain Sciences 3, 485–499 (1980).
[47] Hyva, J. et al. Regional distribution of functions in parietal association area 7 of the monkey. \JournalTitleBrain Research 206, 287–303 (1981).
[48] Greenlee, J. D. et al. Functional connections within the human inferior frontal gyrus. \JournalTitleJournal of Comparative Neurology 503, 550–559 (2007).
[49] Kortz, M. W. & Lillehei, K. O. Insular cortex. \JournalTitleStatPearls (2021).
[50] Hirnstein, M., Hugdahl, K. & Hausmann, M. Cognitive sex differences and hemispheric asymmetry: A critical review of 40 years of research. \JournalTitleLaterality: Asymmetries of Body, Brain and Cognition 24, 204–252 (2019).
[51] Duncan, J. The multiple-demand (md) system of the primate brain: mental programs for intelligent behaviour. \JournalTitleTrends in Cognitive Sciences 14, 172–179 (2010).
[52] Kansaku, K., Yamaura, A. & Kitazawa, S. Sex differences in lateralization revealed in the posterior language areas. \JournalTitleCerebral Cortex 10, 866–872 (2000).
[53] Cole, M. W., Ito, T., Bassett, D. S. & Schultz, D. H. Activity flow over resting-state networks shapes cognitive task activations. \JournalTitleNature Neuroscience 19, 1718–1726 (2016).
[54] van Dokkum, L. E. et al. Resting state network plasticity related to picture naming in low-grade glioma patients before and after resection. \JournalTitleNeuroImage: Clinical 24, 102010 (2019).
[55] Laws, K. R. Sex differences in lexical size across semantic categories. \JournalTitlePersonality and Individual Differences 36, 23–32 (2004).
[56] Ala-Salomäki, H., Kujala, J., Liljeström, M. & Salmelin, R. Picture naming yields highly consistent cortical activation patterns: Test–retest reliability of magnetoencephalography recordings. \JournalTitleNeuroImage 227, 117651 (2021).
[57] Kepinska, O., de Rover, M., Caspers, J. & Schiller, N. O. Connectivity of the hippocampus and broca’s area during acquisition of a novel grammar. \JournalTitleNeuroImage 165, 1–10 (2018).
[58] Lotz, A. & Kinder, A. Rapid communication: Sex differences in artificial grammar learning: Evidence for different strategies in men and women. \JournalTitleQuarterly Journal of Experimental Psychology 64, 417–424 (2011).
[59] Li, F. et al. Causal interactions with an insular-cortical network in mild traumatic brain injury. \JournalTitleEuropean Journal of Radiology 157, 110594 (2022).
[60] Vingerhoets, G. et al. Cerebral lateralization of praxis in right-and left-handedness: Same pattern, different strength. \JournalTitleHuman Brain Mapping 33, 763–777 (2012).
[61] Linnman, C., Beucke, J.-C., Jensen, K. B., Gollub, R. L. & Kong, J. Sex similarities and differences in pain-related periaqueductal gray connectivity. \JournalTitlePain 153, 444–454 (2012).
[62] Tuckute, G. et al. Frontal language areas do not emerge in the absence of temporal language areas: A case study of an individual born without a left temporal lobe. \JournalTitleNeuropsychologia 169, 108184 (2022).
[63] Corballis, M. C. Evolution of cerebral asymmetry. \JournalTitleProgress in Brain Research 250, 153–178 (2019).
[64] Vallortigara, G. & Rogers, L. Survival with an asymmetrical brain: advantages and disadvantages of cerebral lateralization. \JournalTitleBehavioral and Brain Sciences (2005).
[65] Laughlin, S. B. & Sejnowski, T. J. Communication in neuronal networks. \JournalTitleScience 301, 1870–1874 (2003).
[66] Lennie, P. The cost of cortical computation. \JournalTitleCurrent Biology 13, 493–497 (2003).
[67] Levy, W. B. & Baxter, R. A. Energy efficient neural codes. \JournalTitleNeural Computation 8, 531–543 (1996).
[68] Ringo, J. L., Doty, R. W., Demeter, S. & Simard, P. Y. Time is of the essence: a conjecture that hemispheric specialization arises from interhemispheric conduction delay. \JournalTitleCerebral Cortex 4, 331–343 (1994).

\nocitesupp

Methods

Dual classification with group-specific discriminant analysis (GSDA)

We propose a dual classification framework with two primary objectives: learning group-specific models and identifying group-specific discriminant weights. For the first objective, specifically the classification of left vs. right brain hemispheres, we train a linear classifier on the training data and then validate its performance on the test data. We refer to this process as the first-order classification. The weights derived from the model are called the first-order weights. Then we perform a second round of classification to identify the weights that show significant differences between group-specific models. Here, we train a linear classifier to differentiate between male- and female-specific models. This stage is called the second-order classification, and the associated weights are referred to as the second-order weights. This process is illustrated in stages ⑤ and ⑥ of Fig. 1a.

The first-order classification builds a (group-specific) prediction function. This function predicts whether an unseen brain hemisphere is left or right, based on a feature vector. These vectors represent the left or right human brain hemispheres and are extracted from the training neuroimaging data. The resulting prediction accuracy serves as a quantitative measure of the extent to which the learned lateralization patterns are generalized among the brain networks within the test set. The learned model weights can be interpreted as indicators of the significance or extent of differences between the corresponding connections of the left and right brain hemispheres.

The second-order classification is designed to identify weights that show significant differences between the male- and female-specific first-order models. In this stage, a linear classification model is trained on the first-order model weights to predict whether an unseen model is male- or female-specific. The features with larger weights in the second-order classification are considered to represent the stronger sex differences. We propose a group-specific discriminant analysis algorithm to learn group-specific models for the first-order classification.

Problem formulation of GSDA

Let $(\mathbf{x}_{i},y_{i},\mathbf{g}_{i})$ represent the $i$ th sample, where $\mathbf{x}_{i}\in\mathcal{X}\subseteq\mathbb{R}^{p}$ denotes an input data vector, $y_{i}\in\mathcal{Y}$ denotes an output variable (label), and $\mathbf{g}_{i}\in\mathcal{G}\subseteq\mathbb{R}^{q}$ represents a covariate vector for the grouping factor(s). Here, $i\in[1,m]$ , with $m$ being the total number of samples. $\mathcal{X}$ , $\mathcal{Y}$ and $\mathcal{G}$ are the feature spaces of the input data, output label, and grouping factor, respectively, with $p$ and $q$ as the corresponding feature dimensions for the input data $\mathbf{x}_{i}$ and grouping factor $\mathbf{g}_{i}$ . In the context of this article, $\mathbf{x}_{i}$ is a feature vector that represents a brain hemisphere, $y_{i}$ indicates whether $\mathbf{x}_{i}$ is the left or right hemisphere, and $\mathbf{g}_{i}$ is a binary (zero and one) indicator representing whether $\mathbf{x}_{i}$ is from a male or female subject (e.g. $g_{i}=0$ for male and $g_{i}=1$ for female). Assuming $\mathbf{x}_{0}=1$ , considering $\mathbf{w}_{0}$ as the bias term, and denoting $\mathbf{w}\in\mathbb{R}^{p+1}$ as the vector of weights (coefficients) to be learned, with the target group represented as subscript ${}_{\mathrm{t}}$ , we formulate the objective of learning group-specific models as follows:

\operatorname*{arg\max}_{\mathbf{w}}\frac{1}{m_{\mathrm{t}}}\sum_{i=1}^{m_{% \mathrm{t}}}\mathbb{P}(y_{i}|\mathbf{x}_{i},\mathbf{w})+\frac{\lambda}{m}\sum_% {j=1}^{m}\mathbb{P}(\mathbf{g}_{j}|\mathbf{x}_{j},\mathbf{w}),

(1)

where $m_{\mathrm{t}}$ denotes the number of training samples from the target group, and $\lambda\geq 0$ is the hyperparameter that quantifies the importance of grouping factor(s) dependence. Based on Eq. (1), we formulate a general group-specific discriminant analysis (GSDA) framework as:

\operatorname*{arg\min}_{\mathbf{w}}L(\mathbf{X}_{\mathrm{t}}^{\top}\mathbf{w}% ,\mathbf{y}_{\mathrm{t}})+\alpha\|\mathbf{w}\|_{K}^{2}-\lambda\underbrace{\rho% (\mathbf{X}^{\top}\mathbf{w},\mathbf{G})}_{\text{Group dependence}},

(2)

where $L(\cdot,\cdot)$ denotes a classification or regression loss function, such as least square, logistic, or hinge, $\alpha\geq 0$ is the hyperparameter used for weight regularization, $\|\cdot\|_{K}^{2}$ denotes either an $\ell_{1}$ or $\ell_{2}$ regularization, with $K=1$ or $2$ respectively, $\mathbf{X}_{\mathrm{t}}$ denotes the target group’s training samples, $\mathbf{X}$ denotes all training samples that consist of both target and non-target group samples, and $\rho(\cdot,\cdot)$ is a statistical dependence measure. In this work, we employed Hilbert-Schmidt Independence Criterion (HSIC) \citesuppgretton2005measuring. Given two sets $\mathbf{X}=\{\mathbf{x}_{1},\mathbf{x}_{2},\dots,\mathbf{x}_{m}\}$ and $\mathbf{Y}=\{\mathbf{y}_{1},\mathbf{y}_{2},\dots,\mathbf{y}_{m}\}$ , both with size $m$ , HSIC compute the statistical dependence between tests whether $\mathbf{X}$ and $\mathbf{Y}$ via

\rho_{h}(\mathbf{X,Y})=\frac{1}{(m-1)^{2}}\text{tr}(\mathbf{KHLH}),

(3)

where $\mathbf{K,H,L}\in\mathbb{R}^{m\times m}$ , $\mathbf{K}_{i,j}:=k_{x}(\mathbf{x}_{i},\mathbf{x}_{j})$ , $\mathbf{L}_{i,j}:=k_{y}(\mathbf{y}_{i},\mathbf{y}_{j})$ , $k_{x}(\cdot,\cdot)$ and $k_{y}(\cdot,\cdot)$ are two kernel functions, such as linear, polynomial, or radial basis function (RBF), $\mathbf{H}=\mathbf{I}-\frac{1}{m}\mathbf{11^{\top}}$ is the centering matrix, $\mathbf{I}$ is an identity matrix, and $\text{tr}\left(\cdot\right)$ is the trace function. HSIC $\rho(\mathbf{X,Y})\geq 0$ , and it is zero if and only if the two sets of variables $\mathbf{X}$ and $\mathbf{Y}$ are independent, i.e., $\mathbb{P}(\mathbf{x,y})=\mathbb{P}(\mathbf{x})\mathbb{P}(\mathbf{y})$ . A higher HSIC value suggests stronger statistical dependence.

GSDA with logistic loss and maximum likelihood estimation

To maximize the likelihood of the target-group labels and the grouping factor(s) dependence as specified in Eq. (1), we adopt maximum likelihood estimation for optimizing the model weights $\mathbf{w}$ . Here, we develop a novel algorithm, Group-Specific Discriminant Analysis with logistic loss (GSDA-Logit), as a variant of logistic regression for group-dependent learning. The overall likelihood to be maximized is as follows:

\displaystyle\begin{aligned} \mathbb{P}(\mathbf{y}_{\mathrm{t}}|\mathbf{X}_{% \mathrm{t}},\mathbf{w})\mathbb{P}(\mathbf{w})\mathbb{P}(\mathbf{G}|\mathbf{X},% \mathbf{w})=\big{(}\prod_{i=1}^{m_{\mathrm{t}}}S(\mathbf{w^{\top}x}_{i})^{y_{i% }}(1-S(\mathbf{w^{\top}x}_{i}))^{(1-y_{i})}\big{)}\frac{1}{\sqrt{2\pi\sigma}}e% \big{(}-\frac{\mathbf{w^{\top}w}}{2\sigma^{2}}\big{)}S(\rho_{h}(\mathbf{w^{% \top}X},\mathbf{G})),\end{aligned}

(4)

where $\mathbb{P}(\mathbf{y}_{\mathrm{t}}|\mathbf{X}_{\mathrm{t}},\mathbf{w})$ is the likelihood of target labels $\mathbf{y}_{\mathrm{t}}$ given the model and target-group data $\mathbf{X}_{\mathrm{t}}$ , $\mathbb{P}(\mathbf{w})$ is the prior probability of weights in $\mathbf{w}$ , $\mathbb{P}(\mathbf{G}|\mathbf{X},\mathbf{w})$ is the likelihood of grouping factor dependence, $S(\cdot)$ denotes the logistic (or sigmoid) function, and $\mathbb{P}(\mathbf{w})$ can be interpreted as the $\ell_{2}$ regularization for $\mathbf{w}$ , assuming that the weights in $\mathbf{w}$ follow a normal distribution with a mean of zero and a standard deviation of $\sigma$ . Given that $\mathbf{w^{\top}X}$ produces a row vector, Equation (3) can be reformulated in the form of simplified HSIC \citesuppzhou2020side:

\displaystyle\begin{aligned} \rho_{sh}(\mathbf{w^{\top}X},\mathbf{G})&=\text{% tr}((\mathbf{w^{\top}X})^{\top}(\mathbf{w^{\top}X})\mathbf{HL}\mathbf{H})\\ &=\mathbf{w}^{\top}\mathbf{XHLHX}^{\top}\mathbf{w},\end{aligned}

(5)

where $\mathbf{L}=\mathbf{G^{\top}G}$ . By replacing $\rho_{h}(\mathbf{w^{\top}X},\mathbf{G})$ with the simplified HSIC $\rho_{sh}(\mathbf{w^{\top}X},\mathbf{G})$ , the likelihood can be rewritten as

\displaystyle\begin{aligned} \mathbb{P}(\mathbf{y}_{\mathrm{t}}|\mathbf{X}_{% \mathrm{t}},\mathbf{w})\mathbb{P}(\mathbf{w})\mathbb{P}(\mathbf{G}|\mathbf{X},% \mathbf{w})=\big{(}\prod_{i=1}^{m_{\mathrm{t}}}S(\mathbf{w^{\top}x}_{i})^{y_{i% }}(1-S(\mathbf{w^{\top}x}_{i}))^{(1-y_{i})}\big{)}\frac{1}{\sqrt{2\pi\sigma}}e% \big{(}-\frac{\mathbf{w^{\top}w}}{2\sigma^{2}}\big{)}S(\mathbf{w^{\top}XHLHX^{% \top}w}).\end{aligned}

(6)

The likelihood in Eq. (6) can be maximized using the same optimization steps for a standard logistic regression, i.e., computing the gradient of the negative log-likelihood. Let $\alpha=\frac{1}{\sigma^{2}}$ and $\lambda$ denote the two hyperparameters that control the importance of the $\ell_{2}$ regularization and grouping factor dependence regularization, respectively. Let $J(\mathbf{w})$ denote the negative logarithm of the likelihood. Taking the gradient of $J(\mathbf{w})$ with respect to $\mathbf{w}$ , we obtain

\displaystyle\begin{aligned} \nabla J(\mathbf{w})=\mathbf{X}_{\mathrm{t}}(S(% \mathbf{X}^{\top}_{\mathrm{t}}\mathbf{w})-\mathbf{y}_{\mathrm{t}})+\alpha% \mathbf{w}+\lambda(S(\mathbf{w^{\top}XHLHX^{\top}w})-1)\mathbf{XHLHX^{\top}w}.% \end{aligned}

(7)

Finally, $\mathbf{w}$ can be optimized iteratively via

\mathbf{w}^{k+1}=\mathbf{w}^{k}-\eta\nabla J(\mathbf{w}^{k}),

(8)

where $k$ denotes the $k$ th iteration, $\eta$ is the learning rate (step size). Algorithm 1 is the pseudocode for GSDA-Logit. In addition to standard gradient descent optimization, we have implemented the LBFGS optimization algorithm \citesuppliu1989limited for faster coefficient estimation.

Algorithm 1 Group-Specific Discriminant Analysis with logistic loss (GSDA-Logit)

Input: Input data matrix $\mathbf{X}\in\mathbb{R}^{p\times m}$ , target-group label vector $\mathbf{y}_{\mathrm{t}}\in\mathbb{R}^{m_{\mathrm{t}}}$ , grouping factor(s), and indices of samples from the target group (optional, if not given, first $m_{\mathrm{t}}<m$ samples are assumed to be the labeled target samples).
hyperparameters: $\alpha$ for $\ell_{2}$ regularization, $\lambda$ for group dependence (HSIC) regularization, and $\eta$ for learning rate.
Output: Coefficient vector $\mathbf{w}\in\mathbb{R}^{p+1}$ .

1:Encode the grouping factor(s) into a matrix

\mathbf{G}\in\mathbb{R}^{q\times m}

(

q=1

for a binary grouping factor) using one-hot encoding. Then construct the kernel matrix

\mathbf{L}\in\mathbb{R}^{m\times m}=\mathbf{G}^{\top}\mathbf{G}

and the centering matrix

\mathbf{H}\in\mathbb{R}^{m\times m}

;

2:Add a row of

1

s to

\mathbf{X}

;

3:Randomly initialize

\mathbf{w}^{k}

(

k=0

);

4:while Not converge do

5: Compute gradient

\nabla J(\mathbf{w})

by Eq. (7);

6: Update

\mathbf{w}^{k+1}=\mathbf{w}^{k}-\eta\nabla J(\mathbf{w}^{k})

;

7:end while

8:return GSDA-Logit coefficient vector

\mathbf{w}

Evaluation: group specificity index (GSI)

To measure the group specificity, we set the following criteria for the metric

•

The value of this metric lies within $[0,1]$ .
•

The value of this metric is 0 if the test accuracy for the target and non-target groups are identical.
•

When the test accuracy of the target and non-target groups differ, the value of this metric should be proportional to 1) the absolute accuracy for the target group, and 2) the closeness of accuracy for the non-target group to the random chance.
•

Greater relative accuracy divergence between target and non-target groups will result in a higher value of this metric.

To satisfy the above conditions, we propose a Group Specificity Index (GSI) for binary classification problems as follows:

\mathrm{GSI}=2\mathrm{BAT}(\mathrm{BAT}-0.5-|\mathrm{BANT}-0.5|),

(9)

where $\mathrm{BAT}\in[0.5,1]$ represents the balanced accuracy of the target-group data, and $\mathrm{BANT}\in[0,1]$ denotes the balanced accuracy of the non-target-group data. Balanced accuracy is chosen to mitigate the impact of imbalanced samples. It is defined as $\mathrm{BA}=\frac{\mathrm{TPR}+\mathrm{TNR}}{2}$ , where the true positive rate $\mathrm{TPR}=\frac{\text{number of true positives}}{\text{number of total % positives}}$ and the true negative rate $\mathrm{TNR}=\frac{\text{number of true negatives}}{\text{number of total % negatives}}$ . In the left vs. right brain hemisphere classification problem, the numbers of left and right training examples are equal, making balanced accuracy equivalent to accuracy. The expression $|\mathrm{BANT}-0.5|$ measures how close the accuracy of the non-target group is to random chance (0.5), and $\mathrm{BAT}-0.5-|\mathrm{BANT}-0.5|$ quantifies the relative accuracy divergence between the target and non-target groups. Since we are interested in generalized lateralization patterns for the target group, models that perform worse than random chance on target test sets are not considered.

Resting-State fMRI data and processing

We use resting-state fMRI data from the Human Connectome Project (HCP) [35] and the Genomics Superstruct Project (GSP) [36] for brain hemisphere classification to study lateralization. Table 2 summarizes the demographic information of the subjects involved in our experiments across both datasets.

Human Connectome Project (HCP)

Acquisition

All MRI data were collected using the same 3T Siemens Skyra magnetic resonance machines at Washington University in St. Louis with a 32-channel head coil \citesuppvan2013wu. Specifically, rs-fMRI was acquired using a gradient-echo echo-planar imaging (GE-EPI) sequence with the following parameters: repetition time (TR) = 720 ms, echo time (TE) = 33.1 ms, flip angle (FA) = 52°, bandwidth = 2290 Hz/pixel, field of view (FOV) $=208\times 180\text{ mm}^{2}$ , matrix $=104\times 90$ , voxel size $=2\times 2\times 2\text{ mm}^{3}$ , multi-band acceleration factor $=8$ , slices = 72, and total scan time of 1200 frames = 14 min and 24 s [35]. During the scan, participants were asked to open their eyes and stare at a white cross on a screen with a black background. There were two rs-fMRI sessions (REST1 and REST2) acquired on two consecutive days, each including two runs with a left-to-right (LR) and a right-to-left (RL) phase encoding direction. The T1-weighted images were acquired by using a magnetized rapid gradient-echo imaging (MPRAGE) sequence with the following parameters: TR = 2,400 ms, TE = 2.14 ms, reversal time (TI) = 1,000 ms, FA = 8°, FOV $=224\times 224\text{ mm}^{2}$ , voxel size 0.7 mm isotropic, and total scan time = 7 min and 40 s.

Preprocessing

The HCP minimal preprocessing pipeline (version 2.0) was utilized, which included steps for magnetic gradient distortion correction, EPI distortion correction, non-brain tissue removal, Montreal Neurological Institute (MNI) standard space registration, and intensity normalization \citesuppglasser2013minimal. Resultant data were denoised using independent component analysis (ICA) facilitated by the FIX tool \citesuppsalimi2014automatic. This tool effectively identifies and eliminates spatiotemporal signal components that originate from non-neuronal or structural noise, with an emphasis on head movement [35]. Subsequently, five post-processing steps were utilized based on the minimal preprocessed data, including 1) spatial smoothing with 4 mm FWHM kernel, twice the voxel resolution of HCP fMRI data \citesuppfox2005human; 2) linear detrending to minimize the effects of low-frequency drift \citesupplowe1999treatment; 3) regression of a suite of nuisance variables unrelated to neural signals, such as average signals from white matter (WM) and cerebrospinal fluid (CSF), as well as the whole brain (global signal, GS)\citesupphallquist2013nuisance, fox2009global; 4) bandpass filtering (0.01Hz-0.1Hz) \citesuppthompson2015frequency,fox2005human; 5) scrubbing to control effects of transient movement across the time series frames \citesupppower2012spurious.

Genomics Superstruct Project (GSP)

Acquisition

All imaging data were collected on matched 3T Tim Trio scanners (Siemens Healthcare, Erlangen, Germany) at Harvard University and Massachusetts General Hospital using the vendor-supplied 12-channel phased-array head coil [36]. Structural data included a high-resolution (1.2 mm isotropic) multi-echo T1-weighted magnetization-prepared gradient-echo image. Functional imaging data were acquired using a GE-EPI sequence sensitive to blood oxygenation level-dependent (BOLD) contrast with the following parameters: TR = 3,000 ms, TE = 30 ms, FA = 85°, voxel size $=3\times 3\times 3\text{ mm}^{3}$ , slices = 47, and total scan time of 124 frames = 6 min and 12 s.

Preprocessing

All fMRI data were preprocessed by SPM \citesuppfriston2003statistical and GRETNA \citesuppwang2015gretna toolkit, including the following steps: 1) removing the first four volumes to assure that the magnetization is at steady state \citesuppparkes2018evaluation; 2) slice-timing correction; 3) realignment of all volumes to the first volume to reduce the effects of head motion\citesuppparkes2018evaluation; 4) co-registration of GE-EPI data to the native, cropped, high-resolution structural image and then normalizing them to the MNI space through Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL) algorithm\citesuppashburner2007fast; 5) spatial smoothing with a 6 $mm$ FWHM kernel, twice the voxel resolution of GSP fMRI data \citesuppfox2005human; 6) linear detrending to minimize the effects of low-frequency drift \citesupplowe1999treatment; 7) six head motion parameters regression \citesuppyan2013addressing, as well as the WM, CSF and GS \citesupphallquist2013nuisance, fox2009global; 8) lowpass filtering (<0.08Hz) \citesuppyeo2011organization.

Extracting intrahemispheric brain network

We use intrahemispheric brain network connectivity as features to represent brain hemispheres. Figure 1a ①-③ illustrates the data processing workflow for obtaining intrahemispheric connections from resting-state time series. Time sequences were extracted using the Brainnetome atlas (BNA) [37], which divides the human brain into 246 regions (123 per hemisphere). Pearson correlation was computed to represent the connectivity between brain regions. Following Liang et al. \citesuppliang2014neuroplasticity, the correlation coefficients were transformed into $z$ -scores using Fisher’s $z$ transform. For HCP data, we averaged $z$ -scores across the RL and LR runs for each session. To extract half-brain features, we reordered the columns and rows of the connectivity matrix to produce two 123 $\times$ 123 matrices, representing the intrahemispheric networks for the two brain hemispheres of each subject. We then extracted the upper (or lower) triangle of these matrices (illustrated as the red and blue areas in ③ of Fig. 1) to form two 7503-dimensional feature vectors by BNA for the two hemispheres for experiments.

Table 2: Information of HCP and GSP dataset used for the experiments, where “M” denotes male and “F” denotes female for sex, “L” denotes left-handedness, “R” denotes right-handedness, and “A” denotes ambidexterity for handedness, and “SD” denotes standard deviation.

Dataset	# Total Subjects	Sex (M/F)	Handedness (L/R/A)	Age (SD)	# Sessions
HCP [35]	960	445/515	85/875/0	28.7 (3.71)	2
GSP [36]	1570	665/905	110/1449/11	21.5 (2.89)	1

Experimental setting

Multivariate classification algorithm setup

For all multivariate methods, the classification problem is binary: left brain hemispheres are labeled as 0, and right brain hemispheres are labeled as 1. For GSDA-Logit, sex is utilized as the grouping factor in the experiments, encoding males as 0 and females as 1. Given the binary nature of the grouping factor, the matrix $\mathbf{G}$ simplifies to a vector $\mathbf{g}$ in this experiment. There are two hyperparameters in GSDA-Logit: $\alpha$ and $\lambda$ . In this experiment, the value of $\alpha$ is set to 0.1, while $\lambda\in[0,1.0,2.0,5.0,8.0,10.0]$ . When $\lambda=0$ , indicating an absence of the grouping factor dependence for optimizing model weights, GSDA-Logit degenerates to a standard logistic regression for target-group data. A logistic regression classifier implemented in scikit-learn \citesuppscikit-learn with default hyperparameters is used for learning both first-order multivariate control models and conducting second-order classification.

Cross-validation strategy

First-order classification setting

Given that each participant provides two brain hemispheres (left and right), the correlation between these hemispheres might impact the effectiveness of machine learning models. To address this, we divided our subjects into two groups: for 50% of the subjects, we used their left hemispheres for training, and for the remaining 50%, we used their right hemispheres. The hemispheres not selected for training were then used for testing, as illustrated in ④ of Fig. 1a. To further validate our findings, we employed an alternative cross-validation method. This method involves holding 20% of the subjects as unseen to the models, serving as an additional test set. Training samples are drawn from the remaining 80% using the same selection process mentioned above. Since HCP data includes two scanning sessions per subject on two different days, the session not involved in training serves as an additional test set. Each cross-validation strategy was iterated 1,000 times, resulting in 1,000 models for each learning task.

Second-order classification setting

With the 1,000 first-order models for each task learned, we perform a second-order classification through the following steps:

1.

Select classification problem of interest, for example, male-specific GSDA models trained on HCP with $\alpha=0.1$ , $\lambda=5$ vs . female-specific GSDA models trained on HCP with $\alpha=0.1$ , $\lambda=5$ , with 1,000 models for each group.
2.

Perform a random stratified split of the 2,000 models into training (80%) and test (20%) sets.
3.

Train a standard logistic regression classifier using the scikit-learn \citesuppscikit-learn implementation with default hyperparameters on the training set and then evaluate the performance on the test set.
4.

Repeat steps 2 and 3 using different random seeds to split the training and test sets 1,000 times.

Data availability

This study used publicly available data from HCP (https://www.humanconnectome.org/) and GSP (https://www.neuroinfo.org/gsp/). Original data can be accessed via data use agreements. Processed intrahemispheric network data for classification is available at https://doi.org/10.5281/zenodo.10050233 for HCP and https://doi.org/10.5281/zenodo.10050234 for GSP. The Brainnetome atlas (BNA) atlas is available at http://atlas.brainnetome.org/ and the information about the lobes and gyrus is available at https://pan.cstcloud.cn/web/share.html?hash=6eRCJ0zDTFk.

Code availability

All custom preprocessing (in Matlab) and analysis (in Python) code can be found in the following repository: https://github.com/shuo-zhou/GSDA-Lateralization. An online demonstration example is available at: https://colab.research.google.com/github/shuo-zhou/GSDA-Lateralization/blob/main/gsda_demo.ipynb

\bibliographystylesupp

naturemag-doi \bibliographysuppmethod_ref

Acknowledgements

This work was supported by the STI 2030-Major Projects (2021ZD0200500, 2021ZD0201701), the National Natural Science Foundation of China (T2325006, 82021004), and the Fundamental Research Funds for the Central Universities (2233200020).

Author contributions statement

S.Z., J.L., H.L., and G.G. conceptualized the study. S.Z. and J.L. designed the research experiments. S.Z. developed the Group-Specific Discriminant Analysis algorithm and performed the classification experiments. J.L. analyzed model weights. Y.J. implemented the preprocessing pipeline and processed the data for experiments. S.Z. and H.W. developed the metric Group-Specificity Index for evaluation. S.Z., J.L., and H.W. wrote the manuscript. H.L. and G.G. performed critical revisions of the article. All authors reviewed and edited the manuscript.

Group-specific discriminant analysis reveals statistically validated sex differences in lateralization of brain functional network

Abstract

Introduction

Results

Diverged test accuracy on male and female sets

GSDA-based models learned distinct weights

Identifying sex-specific lateralized connections with dual-classification weights

Sex-specific lateralization: shared and “exclusive” connections

Discussion

Cross-validation challenges conventional statistical approach for investigating lateralization

Regions and connections identified across datasets for sex-specific lateralization

Sex differences in lateralization: strength and inter-/intra-lobe interaction patterns

Potential of GSDA for more general applications

References

Methods

Dual classification with group-specific discriminant analysis (GSDA)

Problem formulation of GSDA

GSDA with logistic loss and maximum likelihood estimation

Evaluation: group specificity index (GSI)

Resting-State fMRI data and processing

Human Connectome Project (HCP)

Acquisition

Preprocessing

Genomics Superstruct Project (GSP)

Acquisition

Preprocessing

Extracting intrahemispheric brain network

Experimental setting

Multivariate classification algorithm setup

Cross-validation strategy

First-order classification setting

Second-order classification setting

Data availability

Code availability

Acknowledgements

Author contributions statement

Extended data figure