Abstract
Hypertrophic cardiomyopathy (HCM) is an important cause of morbidity and mortality, with pathogenic variants found in about a third of cases. Large-scale genome-wide association studies (GWAS) demonstrate that common genetic variation contributes to HCM risk. Here we derive polygenic scores (PGS) from HCM GWAS and genetically correlated traits and test their performance in the UK Biobank, 100,000 Genomes Project, and clinical cohorts. We show that higher PGS significantly increases the risk of HCM in the general population, particularly among pathogenic variant carriers, where HCM penetrance differs 10-fold between those in the highest and lowest PGS quintiles. Among relatives of HCM probands, PGS stratifies risks of developing HCM and adverse outcomes. Finally, among HCM cases, PGS strongly predicts the risk of adverse outcomes and death. These findings support the broad utility of PGS across clinical settings, enabling tailored screening and surveillance and stratification of risk of adverse outcomes.
Similar content being viewed by others
Main
Hypertrophic cardiomyopathy (HCM) is a primary cardiac disease characterized by excessive hypertrophy of the left ventricle with a population prevalence of 0.2%1. While many cases follow a benign course, HCM is an important cause of sudden cardiac death in young adults, and progressive disease is complicated by arrhythmia, stroke and heart failure2,3. Although HCM has classically been considered a Mendelian disease, a causal rare variant is identified in only one-third of cases4,5, with population studies highlighting the incomplete penetrance and variable expressivity of such variants6,7. Recent genome-wide association studies (GWAS) demonstrated that common variants contribute substantially to HCM risk (SNP, h2â=â0.29), identified many contributory loci and highlighted the complex genetic architecture of HCM8,9,10. Polygenic scores (PGS) summarize the cumulative risk arising from common variants and may provide important utility for population risk prediction and prognostication9,11. Still, it remains unclear whether PGS can inform the risk of HCM and clinical outcomes across broad clinical and population settings. In this study, we develop and evaluate a PGS for HCM, assessing its utility for stratification of both disease risk and severity in (1) individuals diagnosed with HCM, (2) relatives of affected individuals who are currently recommended to undergo screening and long-term surveillance, and (3) the general population, including individuals carrying disease-associated rare variants such as those that might be identified as secondary findings.
Results
Generation and evaluation of an HCM PGS in the general population
PGS was generated using the largest published GWAS comprising a total of 5,900 unrelated HCM cases and 68,359 controls of European ancestry from seven cohorts (PGSGWAS), and multitrait analysis of GWAS (MTAG) incorporating the HCM GWAS with GWAS of three genetically correlated cardiac magnetic resonance (CMR) imaging traits (left ventricular concentricity, left ventricular end-systolic volume (LVESV) and left ventricular circumferential strain) in 36,083 European ancestry participants in the UK Biobank10 (UKB; PGSMTAG; Fig. 1). In an independent cohort of 343,182 unrelated white British ancestry participants in UKB, PGSMTAG was associated with the risk of HCM (defined using International Classification of Diseases 9 and 10 (ICD-9 and ICD-10) codes, self-reporting and/or CMR imaging; Supplementary Note; odds ratio (OR)â=â2.34 per PGSMTAG s.d., 95% confidence interval (CI)â=â2.12â2.59, Pâ<â2âÃâ10â16), provided better predictive performance than PGSGWAS (ORâ=â1.97 per PGSGWAS s.d., 95% CIâ=â1.81â2.15, Pâ<â2âÃâ10â16; Supplementary Table 1) and was therefore used for all subsequent analyses unless otherwise stated (Supplementary Fig. 1). The distribution of PGS in the UKB population-based cohort is shown in Fig. 2a. Among HCM cases, 75.1% (95% CIâ=â71.4â79.5) have a PGS above the population mean, while those with a PGS greater than one standard deviation above the mean accounted for 46.4% (95% CIâ=â41.2â51.7) of cases (Fig. 2b). Having demonstrated associations between PGS and HCM risk, we evaluated effect sizes in the general population. Individuals with PGS in the highest centile (prevalenceâ=â10.9 cases per 1,000 individuals) had significantly higher risks of HCM compared with those in the median (prevalenceâ=â0.8 cases per 1,000 individuals, ORâ=â14.5, 95% CIâ=â9.5â22.2, Pâ=â3.5âÃâ10â35; time to HCM diagnosisâhazards ratio (HR)â=â3.6, 95% CIâ=â2.6â4.8, Pâ<â2âÃâ10â16) and lowest centiles (prevalenceâ=â0.3 cases per 1,000 individuals, ORâ=â36.6, 95% CIâ=â18.6â72.2, Pâ=â2.9âÃâ10â25; Fig. 2c,d, Supplementary Fig. 2 and Supplementary Tables 2 and 3).
Bayesian genome-wide PGS were generated from a published European-ancestry HCM GWAS meta-analysis of seven caseâcontrol studies (comprising 5,900 cases and 68,359 controls; PGSGWAS), and MTAG (analyzing HCM with three genetically correlated quantitative traits measured using CMR imaging in 36,083 European ancestry UKB participantsâLV concentricity (LVconc), LEVSV and left ventricular circumferential strain (straincirc); PGSMTAG)10. The value of PGS to support clinical decision-making was evaluated across three key settings: in the general population (including among carriers of pathogenic rare variants in HCM-causing genes (sarcomere-positive) that might be returned as secondary findings), in relatives of HCM probands currently recommended to undergo cascade screening and surveillance, and in confirmed HCM cases under longitudinal follow-up. The figure is created with BioRender.com. SARC-PLP, pathogenic or likely pathogenic variant in sarcomeric HCM genes.
To validate the PGS, we analyzed associations with PGS in the UKB population. a, PGSMTAG distribution in 374,845 UKB participants with and without HCM, demonstrating higher PGS in those with HCM. b, Cumulative curve of HCM cases ranked across PGS centiles. For example, approximately 75% of HCM cases have a PGS above the population 50th centile. Dashed lines represent mean, ±1 PGS s.d. and ±2 PGS s.d. Shaded line indicates 95% CI surrounding the cumulative estimate. c, Manhattan plot of HCM PGS phenome-wide association study in UKB, showing associations with cardiovascular and metabolic phenotypes. ICD-9 and ICD-10 diagnostic codes are mapped to Phecode Map (v1.2). Mapped phenotypes exceeding phenome-wide significance threshold (Pâ=â2.7âÃâ10â5, red line) are labeled. Blue line indicates nominal significance (Pâ<â0.05). Direction of triangle indicates the direction of effect of the PGS association. d, HCM prevalence and risk in UKB across the spectrum of PGS, demonstrating significantly higher HCM prevalence in individuals with the highest PGS (top centile, nâ=â3,394), compared with the median (nâ=â68,587) and lowest groups (bottom centile, nâ=â3,431). Effect estimates generated using logistic regression adjusting for age, age2, sex and top ten genetic principal components (PCs), with unadjusted two-sided P value. Data are presented as effect estimates with 95% CI. e, Cumulative hazards for lifetime diagnosis of HCM stratified by high (highest centile, red) and median (middle quintile, orange) PGS risk in UKB. HR calculated using Cox proportional hazards model, adjusted for age, age2, sex and first ten genetic PCs, with two-sided P value.
Exploring the role of polygenic risk on the expressivity of an HCM phenotype in 30,663 white British ancestry UKB participants who underwent CMR, PGSGWAS was associated with traits that are classically seen in HCM6: increased cardiac hypertrophy (maximum left ventricular wall thickness (maxLVWT)â=â+0.13âmm per PGS s.d., Pâ=â1.1âÃâ10â80; highest versus lowest PGS centileâ=â9.8 versus 9.1âmm, Pâ=â9âÃâ10â9), increased cardiac contractility (left ventricular ejection fraction (LVEF)â=â+0.6%, Pâ=â2.7âÃâ10â64; 61.3% versus 57.7%, Pâ=â2.7âÃâ10â13), reduced chamber volumes (left ventricular end-diastolic volume (LVEDV)â=ââ2.0âml, Pâ=â1.2âÃâ10â46; 142.1 versus 154.9âml, Pâ=â8.3âÃâ10â8) and LVESV (â1.7âml, Pâ=â6.9âÃâ10â80; 55.9 versus 66.4âml, Pâ=â2.7âÃâ10â10), all of which were biventricular in nature (Supplementary Fig. 3 and Supplementary Tables 4 and 5), and persisted when excluding participants with HCM (Supplementary Table 6).
Phenome-wide association study (PheWAS) of 1,839 clinical diagnoses in the UKB identified PGS associations with hypertension and metabolic phenotypes (dyslipidemia and type 2 diabetes; Supplementary Table 7). Mendelian randomization (MR) highlighted causal influence of blood pressure and body mass index, albeit with evidence of significant pleiotropy8, and no significant associations with lipid and glycemic traits (Supplementary Table 8 and Supplementary Fig. 4). The inverse association with heart failure9 (Fig. 2e) and absence of expected associations (for example, atrial fibrillation or flutter4) may be explained by the reciprocal relationship of HCM common genetic risk with dilated cardiomyopathy (DCM)9 (UKB DCMâORâ=â0.88 per PGS decile, 95% CIâ=â0.85â0.90, Pâ<â2âÃâ10â16; ORâ=â0.69 per PGS s.d., 95% CIâ=â0.64â0.74; Supplementary Fig. 2). MR analysis identified causal protective associations with heart failure (MR inverse variance weighted (IVW) βâ=ââ0.09, Pâ=â1.3âÃâ10â3). For atrial fibrillation, MR analysis showed no overall causal relationship (MR IVW βâ=â0.02, Pâ=â0.39), with individual HCM-risk variants associated with both increased and decreased atrial fibrillation risk, highlighting the complex pleiotropic relationship (Supplementary Fig. 5).
PGS performance in non-European ancestry populations
PGS derived from one ancestry underperform when applied to different or more diverse ancestral populations12,13,14,15. We adapted the European-ancestry PGS (PGSGWAS) by applying ancestry-specific linkage disequilibrium references16 and evaluated its performance for HCM status in 16,349 UKB participants of non-European ancestry (7,542 South Asian, 7,348 African and 1,457 East Asian ancestry), and for CMR quantitative traits in a subset. PGS distributions differed between the different ancestry groups (ANOVA Pâ<â2âÃâ10â16, Tukey adjusted Pâ<â2.5âÃâ10â8 for between-group comparisons), with PGS highest in African ancestry (HCM prevalenceâ=â0.4%), and lowest in South Asian ancestry (HCM prevalenceâ=â0.1%; Supplementary Fig. 6). Although analysis within each ancestral group was limited by power, as expected, PGS performance appeared to be poorer (South Asian (9 HCM cases, ORâ=â1.82 per PGS s.d., Pâ=â0.068), African (27 cases, ORâ=â1.21 per PGS s.d., Pâ=â0.35) and insufficient East Asian ancestry cases to allow estimation (2 cases)) compared within white British (ORâ=â2.34 per PGS s.d., Pâ<â2âÃâ10â16; Supplementary Fig. 6 and Supplementary Table 9).
To improve cross-ancestral polygenic prediction, we performed GWAS in an unrelated East Asian ancestry cohort of 174 HCM cases and 776 controls recruited from Singapore (no individual SNPs reaching genome-wide significance, Supplementary Fig. 7), and combined them with PGSGWAS to generate a cross-population PGS (PGSEast Asian) using PRS-CSx16. In 111 East Asian ancestry individuals with CMR in the UKB, PGSEast Asian was nominally associated with left ventricular volumetric (LVESV, change per PGS s.d.â=ââ3.6âml, Pâ=â0.025, Padjâ=â0.11) and wall thickness (maxLVWTâ=â+0.28âmm, Pâ=â0.017, Padjâ=â0.09) traits, which were not present when using the European only PGSGWAS (Supplementary Fig. 6 and Supplementary Table 10). This suggests that even a modest ancestry-specific GWAS can improve performance when applying a PGS in a new population.
PGS modulates the penetrance of HCM-causing rare variants
Among 318,945 UKB participants with whole-exome sequencing (WES), 640 were unrelated carriers of pathogenic or likely pathogenic variants in eight genes encoding components of the cardiac sarcomere (MYBPC3, MYH7, TNNT2, TNNI3, TPM1, ACTC1, MYL3 and MYL2; âsarcomere-positiveâ)17. A total of 336 HCM cases were identified, of which 43 cases were sarcomere-positive (penetranceâ=â6.7%, 95% CIâ=â4.9â8.9) and 293 cases were sarcomere-negative (prevalenceâ=â0.09%, 95% CIâ=â0.08â0.1). Among the UKB population, there was no difference in mean PGS in sarcomere-positive and sarcomere-negative participants (Pâ=â0.60; Supplementary Fig. 8), arguing against any unmeasured relationship between rare sarcomeric variants and common variant polygenicity that might arise from selective ascertainment. PGS was associated with HCM in both sarcomere-positive (ORâ=â2.35 per PGS s.d., Pâ=â1.1âÃâ10â6) and sarcomere-negative participants (ORâ=â2.15 per PGS s.d., Pâ<â2âÃâ10â16; Fig. 3a). Among the 640 unrelated sarcomere-positive individuals in UKB, penetrance by middle-older age (median ageâ=â72 years, IQRâ±â13 years) was markedly greater in those in the highest PGS quintile (HCM penetranceâ=â17.2%, 95% CIâ=â10.8â25.3) when compared with the median (5.7%, 95% CIâ=â2.1â12.0; highest versus median quintileâORâ=â3.69, 95% CIâ=â1.46â10.67, Pâ=â0.009) and lowest quintiles (2.3%, 95% CIâ=â0.5â6.6; highest versus lowest quintileâORâ=â9.56, 95% CIâ=â2.95â43.89, Pâ=â7.3âÃâ10â4; Fig. 3b). In time-to-event analyses, the risk of HCM diagnosis (HRâ=â3.98, 95% CIâ=â1.66â9.52, Pâ=â0.001) and adverse HCM outcomes (HRâ=â1.56, 95% CIâ=â1.1â2.34, Pâ=â0.029) were similarly greater in the highest compared with median quintile (Fig. 3c and Supplementary Fig. 8).
aâc, UKB represents a broadly unselected population, as participants were not recruited based on genotype or phenotype. dâf, 100,000 Genomes Project (GeL) comprises a mix of participants recruited based on cardiomyopathy and participants recruited with other rare diseases, cancer or as relatives of patients with a rare disease. a,d, The PGS distribution is shown in 640 sarcomere-positive UKB participants (a) and 599 GeL participants (d) with and without HCM, validating that PGS is higher in cases than controls. b,e, Among sarcomere-positive individuals, the highest PGS quintile (UKB, nâ=â136; GeL, nâ=â116) was associated with increased HCM diagnosis compared with median (UKB, nâ=â111; GeL, nâ=â116) and lowest quintiles (UKB, nâ=â133; GeL, nâ=â118). Effect estimates generated using logistic regression adjusting for age, age2, sex and top ten genetic PCs, with unadjusted two-sided P value. Data are presented as effect estimates with 95% CI. c,f, The time to HCM diagnosis in highest, median and lowest quintiles, shows that those with higher PGS are at increased risk of HCM, and develop disease earlier, which is important for lifetime burden of disease morbidities. HR calculated using Cox proportional hazards model, adjusted for age, age2, sex and first ten genetic PCs, with two-sided P value.
We confirmed the modulatory role of PGS on rare variants in 100,000 Genomes Project (GeL)18 (Fig. 3d) a study that recruited individuals with rare diseases (including cardiomyopathies) and their relatives. Because a small proportion of the GeL cohort was ascertained based on cardiomyopathy, we cannot use these data to directly quantify penetrance, but can nonetheless assess the effect size for PGS in combination with a rare variant on disease risk in this cohort. There were 599 sarcomere-positive participants, of which 72 were HCM cases (proportion affectedâ=â12.0%, 95% CIâ=â9.6â15.0%). PGSGeL (generated from MTAG summary statistics leaving out the GeL cohort) was associated with prevalent HCM (ORâ=â3.53 per PGSGeL s.d., 95% CIâ=â2.59â4.80, Pâ=â9.8âÃâ10â16; ORâ=â1.60 per PGSGeL decile, 95% CIâ=â1.41â1.85). Sarcomere-positive individuals with high PGSGeL (highest quintile) were more than 9 times as likely to have been ascertained as cases compared with the median (ORâ=â9.50, 95% CIâ=â3.60â32.6, Pâ=â4.3âÃâ10â5), and 25 times as likely compared with the lowest quintiles (ORâ=â25.1, 95% CIâ=â7.3â160.3, Pâ=â1.8âÃâ10â5; Fig. 3e). The hazards of HCM diagnosis was higher in the highest quintile than at the median (HRâ=â21.8, 95% CIâ=â4.8â98.6, Pâ=â0.0002) and lowest quintiles (HRâ=â42.8, 95% CIâ=â5.0â364.0, Pâ=â0.002; Fig. 3f). Finally, of 527 sarcomere-positive individuals without a diagnosis of HCM on recruitment to the 100,000 Genomes Project, a total of 7 were diagnosed with HCM on follow-up, 5 of whom had PGSGeL in the highest quintile.
Pathogenic rare variant effects remain greater than PGS risk
It has been suggested for several diseases that extreme PGS risk confers a similar magnitude of increased risk as the presence of Mendelian pathogenic variants (for example, familial hypercholesterolemia for coronary artery disease, and BRCA1 or BRCA2 for breast cancer12,19,20). In the UKB, while sarcomere-negative individuals with PGS at the uppermost extreme (defined as the top 0.25%âa frequency comparable to population estimates of pathogenic HCM rare variants (1 in 400)6) had an 18-fold increased risk of having HCM compared with the median (ORâ=â18.1, 95% CIâ=â10.0â32.9, Pâ=â1.6âÃâ10â21), they were at significantly lower risk of HCM and severity of imaging traits compared with sarcomere-positive individuals (ORâ=â5.4, 95% CIâ=â2.8â11.6, Pâ=â2.2âÃâ10â8; Supplementary Table 11). These findings suggest that while PGS accounts for an important component of risk in sarcomere-negative individuals, genetic HCM risk is highest among carriers of rare pathogenic variants in sarcomeric genes.
PGS stratifies the risk of HCM in relatives of probands
Understanding the penetrance of HCM in relatives of probands will have important implications on clinical practice (for example, screening and longitudinal surveillance). We sought to assess whether PGS modulates penetrance in relatives of sarcomere-positive HCM cases, and stratifies risk in sarcomere-negative families, in two cohorts.
The 100,000 Genomes Project was initially designed to evaluate genetically unexplained rare disease through the recruitment of cases and their relatives, and therefore, the cohort has a higher proportion of genetically unexplained sarcomere-negative cases than in the clinical setting (pathogenic rare variants were identified in 94 out of 919 HCM cases). In all, 288 relatives of 193 HCM index cases (262 gene-negative, and 26 gene-positive relatives of 14 gene-positive HCM cases) were recruited, of whom 116 had prevalent HCM and 6 were diagnosed with HCM during follow-up. PGSGeL was higher in probands (Pâ<â2âÃâ10â16) and affected relatives (Pâ=â3.6âÃâ10â6) compared with unaffected relatives, with no difference between probands and affected relatives (Pâ=â0.99). PGS was associated with increased risk of HCM (HCMâORâ=â1.74 per PGSGeL s.d., 95% CIâ=â1.37â2.21, Pâ=â5.1âÃâ10â6; highest versus median quintileâORâ=â4.17, 95% CIâ=â1.78â10.5, Pâ=â0.0015; Fig. 4a). Of 178 relatives who did not have a diagnosis of HCM on recruitment, 6 were diagnosed on follow-up (meanâ=â5.1 years), all with PGS in the highest quintile.
To evaluate applications of PGS in families undergoing screening and surveillance for HCM, we studied the relatives of HCM cases in two cohorts, GeL and EMC cohort. a, OR for HCM among relatives of HCM probands in the two cohorts (GeL, nâ=â288; EMC, nâ=â214), stratified by PGS. b, Violin and box and whisker plot of maxLVWT in sarcomere-positive relatives stratified by highest (nâ=â40) and lowest (nâ=â38) PGSEMC quintiles. Box plot indicate median and interquartile range, whiskers denote 1.5Ã the interquartile range, outliers shown separately, and the edges of violin plots indicate minimum and maximum values. Dashed line indicates a 13-mm cutoff used for guideline diagnosis of HCM in relatives of individuals with HCM. c, Cumulative major adverse cardiovascular events (MACE) among 214 sarcomere-positive relatives of HCM index patients stratified by PGSEMC above or below the median. MACE was defined as a composite of septal reduction therapy, cardiac transplantation, aborted cardiac arrest, appropriate defibrillator shock or sudden cardiac death. To avoid inflation of PGS performance resulting from sample overlap, PGS were rederived from GWAS leaving out the cohort that the PGS was being evaluated in (GeLâPGSGeL, EMCâPGSEMC). HR calculated using Cox proportional hazards model, adjusted for sex, first four genetic PCs, and genetic relatedness matrix, with two-sided P value.
The Erasmus Medical Center (EMC) cohort comprises 214 relatives of 184 index HCM cases, all carriers of rare pathogenic variants in sarcomere-encoding genes. After clinical evaluation, 135 relatives were found to have HCM. Although the PGSEMC (derived using HCM MTAG omitting EMC cohort) was not significantly associated with HCM in relatives (ORâ=â1.4 per PGSEMC s.d., 95% CIâ=â0.97â1.98, Pâ=â0.069; Fig. 4a), it was associated with increased maxLVWT (+1.4âmm per PGSEMC s.d., 95% CIâ=â0.6â2.1, Pâ=â5.0âÃâ10â4; highest versus lowest quintileâ+3.5âmm, 95% CIâ=â1.26â6.41, Pâ=â0.0035) and, importantly, with increased risk of major adverse cardiovascular events after study enrollment (HRâ=â1.74 per PGSEMC s.d., 95% CIâ=â1.03â2.91, Pâ=â0.036; highest versus lowest quintileâHRâ=â17.7, 95% CIâ=â0.9â347, Pâ=â0.058), prevention of which is the primary motivation for cascade screening and early diagnosis (Fig. 4b,c and Supplementary Fig. 9).
PGS as a new prognostic marker in HCM
Although some individuals with HCM have a relatively benign disease course, an important proportion suffer adverse outcomes, including cardiovascular death, and risk stratification, especially for preventable sudden death, remains an urgent clinical need. We sought to investigate whether PGS was associated with adverse outcomes and clinical features of severity in individuals with HCM. In 382 HCM cases in the UKB, a PGS in the highest quintile was associated with an increased risk of death and adverse cardiovascular outcomes after HCM diagnosis (deathâhighest versus lowest quintile, HRâ=â3.88, 95% CIâ=â1.33â11.29, Pâ=â0.013; adverse outcomes (HCM composite)âHRâ=â3.50, 95% CIâ=â1.74â7.03, Pâ=â4.4âÃâ10â4; Fig. 5a,b, Supplementary Fig. 10 and Supplementary Table 12). In 683 HCM cases in GeL, cases in the highest quintile had a sixfold increased risk of death after HCM diagnosis (HRâ=â6.30, 95% CIâ=â2.68â14.78, Pâ=â1.4âÃâ10â6; Fig. 5c and Supplementary Fig. 10). In 101 sarcomere-positive HCM cases from a clinical cohort (Royal Brompton & Harefield Hospitals, RBH), higher PGS was associated with a more severe hypertrophic phenotype (maxLVWTâ=â+1.6âmm per PGS s.d., 95% CIâ=â0.61â2.63, Pâ=â0.002; left ventricular massâ=â+13.8âg per PGS s.d., 95% CIâ=â1.5â26.2, Pâ=â0.03) with no difference seen in the overall cohort of 440 cases (Supplementary Fig. 10 and Supplementary Table 13).
aâc, Cumulative all-cause mortality and adverse HCM outcomes after HCM diagnosis in 382 HCM cases from the UKB (a,b) and cumulative all-cause mortality in 683 HCM cases from GeL (c), stratified by PGS in the highest and lowest quintiles. Adverse HCM outcomes include death, stroke, cardiac arrest, implantable cardioverter-defibrillator implantation, septal reduction therapy (alcoholâseptal ablation or surgical myectomy), LVAD implantation or cardiac transplantation. HR calculated using Cox proportional hazards model, adjusted for age, age2, sex and first ten genetic PCs, with two-sided P value.
Discussion
In this study, we generated a PGS for HCM and validated it across several independent populations, showing associations with categorical disease status, quantitative traits that define HCM and describe disease severity, and, most importantly, adverse cardiovascular events. We demonstrate broad potential clinical utility for PGS across a range of settings. Notably, PGS robustly stratifies penetrance in carriers of rare pathogenic variants in sarcomere-encoding genes and identifies those in the general population at the highest risk of developing HCM, associates with HCM risk in relatives of HCM probands and acts as a new risk marker for survival and adverse events in individuals with HCM.
Findings from this study emphasize the importance of the polygenic contribution to HCM disease risk, classically considered a Mendelian disease caused by rare variants in sarcomere-encoding genes4,21. Among individuals with HCM, the recognition of polygenic, rather than sarcomeric HCM, will be of diagnostic importance, with potential implications for clinical management, reproductive counseling and family screening22.
One of the key challenges in clinical practice remains understanding the variable penetrance and expressivity that characterize rare variants in HCM-causing sarcomeric genes2,3. In relatives of patients with HCM who have inherited a pathogenic variant, clinical screening and life-long surveillance from childhood are recommended, although many will not manifest until later life, if at all, and many of those who do manifest will follow a benign course without major adverse events. Increasingly, pathogenic variants are being identified in individuals with no personal or family history of HCM, as secondary findings through opportunistic screening alongside genetic testing for other indications23. We show that PGS has large and clinically meaningful effects (estimated to be approximately tenfold when comparing quintiles but larger still at more extremes of distribution) in carriers of rare HCM-causing variants, which we expect would translate to effective risk-adjusted strategies for HCM screening and surveillance. Furthermore, while current medical treatments are only indicated in individuals with established HCM, any future development of therapies in the prevention of cardiac hypertrophy in at-risk or genetically susceptible individuals could be of particular significance in the groups with the highest risk of disease penetrance. This is especially true given that both rare variants and PGS are measurable before clinical phenotypes of HCM develop.
In relatives of individuals with HCM, PGS risk modulates HCM penetrance in carriers of rare variants and stratifies HCM risk in sarcomeric gene-negative cases. Relatives with higher PGS were more likely to have HCM, adverse outcomes and increased wall thickness, with similar magnitudes of risk in relatives of both sarcomeric gene-positive and negative HCM. Importantly, relatives in the bottom PGS quintiles had low rates of lifetime adverse events. Directly quantifying PGS with genotyping arrays may guide ongoing surveillance strategies in close relatives of affected individuals, although further prospective work is required.
Within the general population, although PGS confers an increased risk of HCM at the extremes of distribution (ORâ=â15 for the highest centile compared with median), this risk was considerably lower than the risk arising from pathogenic rare variants (ORâ=â79). Our population estimates of PGS performance in the UKB are limited by recruitment targeting participants of middle age, with survival bias and incomplete ascertainment of cases likely to result in underestimation of the true effect of PGS in the general population. The applicability of routine and widespread use of PGS for disease screening remains uncertain24. As with any population screening approach, targeting PGS screening to individuals already at higher risk based on nongenetic factors can have a large impact on the numbers needed to test.
Among individuals with HCM, disease expressivity and prognosis are highly variable4,21,25. We demonstrate that PGS can stratify the risk of serious adverse events in individuals with HCM, including a roughly fourfold to sixfold difference in risk of death when comparing those with PGS in the highest and lowest quintiles. Despite the use of current clinical risk predictors of adverse outcomes3, many individuals do not benefit from interventions aimed at reducing this (for example, an implantable cardioverter-defibrillator26). The addition of PGS to existing clinical risk factors will be an important area for future research.
One of the main limitations of this and other PGS is that it has been derived from and extensively tested in European ancestry populations only. Despite this, we show that PGS stratifies ancestral risk, with the highest PGS found in African ancestry groups where the prevalence of unexplained left ventricular hypertrophy is known to be highest27,28,29,30. Within each non-European ancestry group in the UKB, performance is reduced compared to European ancestry performance, although it still associates with HCM-related cardiac traits in South Asian and East Asian populations. Furthermore, the addition of a small and individually underpowered ancestry-specific GWAS (East Asian) can improve the predictive performance of PGS. This lends hope that performing modestly sized ancestry-specific GWAS could be sufficient to generate PGS with comparable performance to European ancestry populations16.
In conclusion, this study identifies multiple clinical applications for a PGS in HCM, including in general population screening, stratification of rare variant carriers into higher and lower risk of penetrant HCM, and as a new risk predictor of adverse outcomes in individuals with HCM.
Methods
Ethics declaration
All patients gave written informed consent, and all studies were approved by the relevant regional research ethics committees and adhered to the principles set out in the Declaration of Helsinki. The UKB study was reviewed by the National Research Ethics Service (11/NW/0382 and 21/NW/0157). The 100,000 Genomes Project was reviewed by the National Research Ethics Service (14/EE/1112 and 13/EE/032). The RBH Biobank was reviewed and approved by the South CentralâHampshire B Research Ethics Committee (09/H0504/104+5 and 19/SC/0257). The Erasmus Medical Center was reviewed and approved by the Erasmus MC Medical Ethical Review Committee. All Singaporean participants recruited from the National Heart Center Singapore gave written informed consent and the study was approved by the Singhealth Centralised Institutional Review Board (2020/2353) and the Singhealth Biobank Research Scientific Advisory Executive Committee (SBRSA 2019/001v1).
GWAS and multitrait analysis
The base data for the HCM PGS are from the largest HCM GWAS, consisting of 5,900 cases and 68,359 unrelated controls from seven cohorts (100,000 Genomes Project (471 cases, 2,355 controls), BioResource Rare Diseases (239 cases, 7,203 controls), HCM Registry (2,431 cases, 40,283 controls) and clinical cohorts from Canada (1,035 cases, 13,889 controls), Italy (277 cases, 1,293 controls), the Netherlands (999 cases, 2,117 controls) and the UK (448 cases, 1,219 controls)10. HCM was defined as primary left ventricular hypertrophy in the absence of secondary causes (uncontrolled hypertension, aortic valve disease, infiltrative cardiomyopathic processes and cases arising from complex syndromes), using a combination of clinical, imaging and ICD-9 and ICD-10 definitions. Detailed information on cohorts included in the GWAS is provided in the original publication10. Cases and controls included in the HCM GWAS were of European ancestry. The heritability of HCM attributable to common genetic variants was 0.25, assessed using genome-based restricted maximum likelihood31. Leveraging the increased power generated from jointly analyzing genetically correlated traits using the MTAG method32, MTAG of HCM was performed using mtag32 with three genetically correlated quantitative left ventricular traits derived from CMR imaging in 36,083 participants in the UKB (left ventricular concentricity, LVESV and left ventricular circumferential strain)10. Traits were selected based on the hierarchical clustering of ten CMR traits and genetic correlation with HCM. Additional details of left ventricular trait GWAS and trait selection for MTAG are reported in the Supplementary Note.
PGS derivation and evaluation
Individual SNP weighted scores were generated from the primary discovery GWAS and MTAG. The base GWAS and MTAG summary statistics were filtered to exclude rare and uncommon variants (minor allele frequency (MAF)â<â1%), and ambiguous SNPs that were not resolvable by strand flipping. A locus on chromosome 11 surrounding MYBPC3 was found to be associated with HCM in only sarcomere-positive HCM, specifically in one cohort (Netherlands), and was determined to represent a founder effect10. Variants with Pâ<â1âÃâ10â5 on chromosome 11 from 30,000,000 to 80,000,000 (GRCh37) were excluded from PGS calculation.
We calculated HCM PGS for unrelated (third degree or closer) white British participants in the UKB (application, 47602), using variants that passed genotyping QC (MAFâ>â1%, genotyping rateâ>â0.99, HWE Pâ>â1âÃâ10â6). Variants overlapping the base, target and linkage disequilibrium reference set (1000 Genomes Project Phase 3 European ancestry) were included. The individual SNP scores were generated using PRS-CS v1.0, a package that uses a Bayesian framework to model linkage disequilibrium using an external linkage disequilibrium reference set and a continuous shrinkage prior on SNP effect sizes33. The phi constant was automatically selected by an unsupervised approach (PRS-CS auto). Whole-genome PGS scores for all included UKB individuals and testing cohorts were calculated using the âscoreâ function in PLINK v1.9 (ref. 34).
PGS was applied and tested within a range of cohorts and clinical settings. Given that a key factor in the predictive power of PGS is the power of the base GWAS35, we first compared the performance of PGS generated using GWAS (PGSGWASâ=â376,730 SNP predictors) and MTAG10 (PGSMTAGâ=â374,113 SNP predictors) summary statistics in 343,182 unrelated white British ancestry participants in the UKB. Predictive performance of PGS was assessed by comparing Nagelkereâs R2, area under the receiver operating characteristic (AUROC) and association with HCM (OR per PGS s.d.).
Inclusion of participants in both the testing and GWAS datasets results in substantial inflation of PGS performance36. To prevent this, where caseâcontrol PGS testing was performed in a cohort that was included in the main GWAS (for example, GeL), PGS was generated using a leave-one-study-out GWAS and MTAG that did not include the cohort. All other methods for PGS generation remained the same. In situations where only cases are included in the assessment of PGS, the overall MTAG results were used.
Cohorts
UKB
UKB is a population-based cohort study from half a million UK participants, with detailed clinical, imaging and genetic data. Participants from UKB that were included in testing were unrelated (third degree or closer) and of white British ancestry. HCM cases were identified from self-report clinical data (hospital admissions and death registry), and CMR imaging (maxLVWTâ>â15âmm). Time to clinical event was identified from UKB first occurrences data, operation dates, and death dates. Participants in the imaging substudy were randomly invited from the overall cohort. Each underwent CMR at 1.5-T. Segmentation of the cine images was performed by using a deep learning neural network algorithm and has previously been reported6.
100,000 Genomes Project
The 100,000 Genomes Project is a national UK program that recruited probands with rare diseases and cancer from clinical centers, together with family members, and performed germline and somatic (for a subset of participants with cancer) WGS18,37. In total, 683 HCM cases were identified from Human Phenotype Ontology terms at the time of study recruitment, and ICD-9 and ICD-10 codes from preceding and subsequent clinical episodes.
EMC cohort
To evaluate the role of PGS in modulating penetrance of sarcomeric variants in relatives of HCM cases, we used a subset of 214 relatives of 184 HCM probands from an ongoing HCM registry at the EMC38,39. All individuals were carriers of pathogenic sarcomeric variants, with the exclusion of homozygous carriers or those carrying multiple pathogenic or likely pathogenic variants.
Royal Brompton and Harefield Hospitals cohort
A total of 440 unrelated white British HCM cases from RBH40 were used to assess the effect of PGS on CMR imaging traits. Data from the clinical CMR scan taken at or before study recruitment were used, and where sequential CMR scans were available, follow-up imaging data was recorded to identify changes in imaging traits.
Statistical analysis
In the UKB, PGS model performance was assessed using Nagelkerkeâs R2, adjusting the null model for age, age2, sex and first ten principal components. The predictive AUROC was determined using a randomly subsetted training (70%) and validation (30%) cohort using R-package pROC (v1.18.0)41. For association between PGS and HCM status in UKB and GeL, logistic regression was performed adjusting for age, age2, sex and first ten principal components. In EMC, this was assessed using Wald logistic mixed-effects model using GMMAT (v1.3.2) adjusting for fixed-effects of sex, age, age2 and first four principal components and incorporating a genetic relatedness matrix estimated using GCTA (v1.92.2beta)42 as a random effect. For quantitative imaging traits in UKB and RBH, PGS association was evaluated using linear regression adjusting for age, age2, sex, first ten principal components, systolic blood pressure and body surface area, and differences between means in stratified groups were performed with ANCOVA testing adjusted for age, age2, sex, body surface area, systolic blood pressure and first ten principal components. In EMC, the association of PGS with maxLVWT was assessed using a linear mixed-effects model using coxme (v2.2-17)43, adjusting for sex, age at imaging, age at imaging2, imaging modality (CMR versus transthoracic echocardiogram), first four principal components and the genetic relatedness matrix. Time-to-event data in UKB was evaluated using the Cox proportional hazards model, adjusting for age, age2, sex and first ten principal components using survival (v2.44-1.1). Hazards assumption for proportionality was assessed, and for outcomes that did not include death, a competing risk analysis was performed. In EMC, the association between PGS and clinical events was assessed using a Cox proportional hazards mixed-effects model using R-package coxme (v2.2-17)43 adjusted for sex, first four principal components, genetic relatedness matrix and presence of MYH7 rare variant genotype status. Time-to-event analysis was performed using survival (v3.5-7) and survival curves were created using survminer (v0.4.9). Although clinical data were complete for most individuals in all cohorts, where missing data was present, individuals were excluded from analysis. All statistical analysis was performed in R. For multi-ancestry analysis, ancestry as categorical variable was included in the regression model.
Rare variant status
The pathogenicity of rare variants in eight definitive HCM-causing genes17 (MYBPC3, MYH7, TNNT2, TNNI3, TPM1, ACTC1, MYL3 and MYL2) was determined using broadly similar approaches across cohorts in line with ACMG guidelines44 (Supplementary Note). Individuals without pathogenic or likely pathogenic variants were identified as gene-negative individuals.
Outcomes
For a diagnosis of HCM in the UKB, HCM cases were identified from self-reporting, ICD-9 and ICD-10 codes from hospital encounters and the national death register, and CMR imaging (maxLVWTâ>â15âmm), in the absence of aortic stenosis (Supplementary Note). For the analysis of imaging traits in HCM cases, we further refined the diagnosis by restricting only to individuals with a maxLVWT of at least 13âmm. PGS association with a range of HCM-relevant cardiac imaging traits associated with cardiac structure (maxLVWT, LVEDV, LVESV, left atrium volume and fractal dimensions) and function (LVEF, and strain measurements) was tested. Longitudinal risk of time to HCM diagnosis and for major adverse cardiovascular events was assessed. Clinical and operative outcomes were selected based on their relevance to HCM, incorporating self-reported diagnoses, hospital admission events, primary care records and death records (Supplementary Note). Diagnosis of HCM in additional cohorts (EMC, GeL and RBH) and clinical outcomes in EMC are reported in Supplementary Note.
PGS generation and testing in diverse ancestry groups
PGS generated using European ancestry GWAS have weaker performance when tested in more diverse ancestry populations12,13,14,15. We first aimed to evaluate PGS performance in participants of Afro-Caribbean (nâ=â661), East Asian (nâ=â504) and South Asian (nâ=â489) ancestry groups in UKB by applying ancestry-specific 1000 Genomes Project linkage disequilibrium reference sets to the European ancestry GWAS and MTAG when generating PGS. Ancestries of UKB participants were determined based on self-reported ancestry, followed by visualization of principal component plots and manual selection of principal component thresholds. Given that PGS are not comparable between differing ancestries due to underlying differing genetic architecture, analyses using PGS as a continuous variable were restricted within single ancestry groups. For analysis stratifying by quantiles, quantile stratification was first performed within each ancestry before being combined with other ancestries.
PRS-CSx v1.0 was used to extend the Bayesian polygenic modeling and prediction methods of PRS-CS by combining GWAS summary statistics from multiple ancestry groups and has been shown to improve cross-ancestry prediction16. We aimed to evaluate the performance of PGS generated using this approach for the prediction of HCM-associated CMR traits in East Asian ancestry participants in the UKB by combining the European ancestry GWAS with a small East Asian ancestry GWAS (Singapore cohort).
Singapore HCM GWAS
GWAS was performed on 184 cases and 776 controls of East Asian ancestry. Genotyping was performed using Infinium OmniExpress-24 kit (Illumina). Imputation was performed on the Michigan Imputation Server45 using Minimac4 (v1.5.7) and East Asian reference genomes (1000 Genomes Phase 3 (v5)46, 1000 Genomes Phase 1 (v3)46 and Genome Asia Pilot47). Postimputation QC was performed at variant (HWE Pâ>â1âÃâ10â7, genotypingâ>â0.95, information scoreâ>â0.5 and MAFâ>â1%) level. East Asian ancestry individuals were identified using principal component analysis and one of a pair of second degree or closer relatives was retained. GWAS was tested using SNPTEST (v2.5.6)48 adjusting for age, sex and first ten principal components.
PheWAS and MR
PheWAS was performed in the UKB to investigate the pleiotropic effects of the HCM PGS. ICD-9 and ICD-10 codes from death records and hospital admission episodes were translated to Phecodes (Phecode Map 1.2)49,50. For phenotypes with at least 20 cases, PGS-phenotype association was tested using logistic regression adjusted for age, age2, sex and first ten principal components as covariates. Significance threshold was adjusted for the total number of phenotypes tested (Pâ<â2.72âÃâ10â5), and data were presented with Manhattan plots grouping by body system. PheWAS was performed using the PheWAS (v0.99.5-5)51 in R (v4.0.3).
To further evaluate the directionality of effect for select significant PheWAS associations (hypertension, dyslipidemia, type 2 diabetes), two-sample bidirectional MR was performed for relevant quantitative traits (systolic and diastolic blood pressure, hypercholesterolemia, glycated hemoglobin and body mass index). To maximize MR power, the exposure trait GWAS with the largest number of significant SNPs after harmonization from the IEU GWAS database was used as the instrument52,53,54,55 (Supplementary Table 8). To further evaluate associations with atrial fibrillation and heart failure, post hoc MR was performed using summary statistics of the two largest available published GWAS56,57. For all MR analyses, instruments were harmonized with the HCM MTAG after linkage disequilibrium pruning. Two-sample MR using the IVW method was performed using the TwoSampleMR (v0.6.4)58,59 and MRInstruments (v0.3.2) in R (v4.0.3).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Data from UKB can be requested from the UKB Access Management System (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access). Data from the 100,000 Genomes Project can be accessed following the application to join the Genomics England Clinical Interpretation Partnership (https://www.genomicsengland.co.uk/research/academic/join-research-network). The PGS are available for download from the Polygenic Score Catalog (https://www.pgscatalog.org) under accessions PGS004910 and PGS004911. GWAS and MTAG results10 used to generate PGS are available for download from the GWAS Catalog (https://www.gwascatalog.org) under accessions GCST90432127 and GCST904321230.
Code availability
Custom analysis code is available from Zenodo (https://doi.org/10.5281/zenodo.11204463)60.
References
Semsarian, C., Ingles, J., Maron, M. S. & Maron, B. J. New perspectives on the prevalence of hypertrophic cardiomyopathy. J. Am. Coll. Cardiol. 65, 1249â1254 (2015).
Ommen, S. R. et al. 2020 AHA/ACC guideline for the diagnosis and treatment of patients with hypertrophic cardiomyopathy: a report of the American College of Cardiology/American Heart Association joint committee on clinical practice guidelines. Circulation 142, e558âe631 (2020).
Elliott, P. M. et al. 2014 ESC Guidelines on diagnosis and management of hypertrophic cardiomyopathy: the Task Force for the Diagnosis and Management of Hypertrophic Cardiomyopathy of the European Society of Cardiology (ESC). Eur. Heart J. 35, 2733â2779 (2014).
Marian, A. J. & Braunwald, E. Hypertrophic cardiomyopathy: genetics, pathogenesis, clinical manifestations, diagnosis, and therapy. Circ. Res. 121, 749â770 (2017).
McKenna, W. J. & Judge, D. P. Epidemiology of the inherited cardiomyopathies. Nat. Rev. Cardiol. 18, 22â36 (2021).
De Marvao, A. et al. Phenotypic expression and outcomes in individuals with rare genetic variants of hypertrophic cardiomyopathy. J. Am. Coll. Cardiol. 78, 1097â1110 (2021).
Jurgens, S. J. et al. Analysis of rare genetic variation underlying cardiometabolic diseases and traits among 200,000 individuals in the UK Biobank. Nat. Genet. 54, 240â250 (2022).
Harper, A. R. et al. Common genetic variants and modifiable risk factors underpin hypertrophic cardiomyopathy susceptibility and expressivity. Nat. Genet. 53, 135â142 (2021).
Tadros, R. et al. Shared genetic pathways contribute to risk of hypertrophic and dilated cardiomyopathies with opposite directions of effect. Nat. Genet. 53, 128â134 (2021).
Tadros, R. et al. Large-scale genome-wide association analyses identify novel genetic loci and mechanisms in hypertrophic cardiomyopathy. Nat. Genet. https://doi.org/10.1038/s41588-025-02087-4 (2025).
Biddinger, K. J. et al. Rare and common genetic variation underlying the risk of hypertrophic cardiomyopathy in a national biobank. JAMA Cardiol. 7, 715â722 (2022).
Thompson, D. J. et al. A systematic evaluation of the performance and properties of the UK Biobank Polygenic Risk Score (PRS) Release. PLoS One 19, e0307270 (2024).
Duncan, L. et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat. Commun. 10, 3328 (2019).
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584â591 (2019).
Kamiza, A. B. et al. Transferability of genetic risk scores in African populations. Nat. Med. 28, 1163â1166 (2022).
Ruan, Y. et al. Improving polygenic prediction in ancestrally diverse populations. Nat. Genet. 54, 573â580 (2022).
Ingles, J. et al. Evaluating the clinical validity of hypertrophic cardiomyopathy genes. Circ. Genom. Precis. Med. 12, e002460 (2019).
Caulfield, M. et al. The national genomics research and healthcare knowledgebase. Figshare https://doi.org/10.6084/m9.figshare.4530893.v5 (2019).
Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219â1224 (2018).
Mars, N. et al. The role of polygenic risk and susceptibility genes in breast cancer over the course of life. Nat. Commun. 11, 6383 (2020).
Ingles, J. et al. Nonfamilial hypertrophic cardiomyopathy: prevalence, natural history, and clinical implications. Circ. Cardiovasc. Genet. 10, e001620 (2017).
Watkins, H. Time to think differently about sarcomere-negative hypertrophic cardiomyopathy. Circulation 143, 2415â2417 (2021).
Ormondroyd, E. et al. Secondary findings in inherited heart conditions: a genotype-first feasibility study to assess phenotype, behavioural and psychosocial outcomes. Eur. J. Hum. Genet. 28, 1486â1496 (2020).
Adeyemo, A. et al. Responsible use of polygenic risk scores in the clinic: potential benefits, risks and gaps. Nat. Med. 27, 1876â1884 (2021).
Spirito, P. et al. Clinical course and prognosis of hypertrophic cardiomyopathy in an outpatient population. N. Engl. J. Med. 320, 749â755 (1989).
Weissler-Snir, A., Dorian, P., Rakowski, H., Care, M. & Spears, D. Primary prevention implantable cardioverter-defibrillators in hypertrophic cardiomyopathyâare there predictors of appropriate therapy? Heart Rhythm 18, 63â70 (2021).
Maron, B. J. et al. Prevalence of hypertrophic cardiomyopathy in a general population of young adults. Echocardiographic analysis of 4111 subjects in the CARDIA Study. Coronary Artery Risk Development in (Young) Adults. Circulation 92, 785â789 (1995).
Maron, B. J. et al. Prevalence of hypertrophic cardiomyopathy in a population-based sample of American Indians aged 51 to 77 years (the Strong Heart Study). Am. J. Cardiol. 93, 1510â1514 (2004).
Eberly, L. A. et al. Association of race with disease expression and clinical outcomes among patients with hypertrophic cardiomyopathy. JAMA Cardiol. 5, 83â91 (2020).
Bai, Y. et al. Prevalence, incidence and mortality of hypertrophic cardiomyopathy based on a population cohort of 21.9 million in China. Sci. Rep. 12, 18799 (2022).
Yang, J. et al. Genome partitioning of genetic variation for complex traits using common SNPs. Nat. Genet. 43, 519â525 (2011).
Turley, P. et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat. Genet. 50, 229â237 (2018).
Ge, T., Chen, C. Y., Ni, Y., Feng, Y. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559â575 (2007).
Dudbridge, F. Polygenic epidemiology. Genet. Epidemiol. 40, 268â272 (2016).
Wray, N. R. et al. Pitfalls of predicting complex traits from SNPs. Nat. Rev. Genet. 14, 507â515 (2013).
Smedley, D. 100,000 Genomes Project Pilot Investigators et al. 100,000 Genomes pilot on rare-disease diagnosis in health careâpreliminary report. N. Engl. J. Med. 385, 1868â1880 (2021).
Vriesendorp, P. A. et al. Long-term outcomes after medical and invasive treatment in patients with hypertrophic cardiomyopathy. JACC Heart Failure 2, 630â636 (2014).
Van Velzen, H. G. et al. Effect of gender and genetic mutations on outcomes in patients with hypertrophic cardiomyopathy. Am. J. Cardiol. 122, 1947â1954 (2018).
Walsh, R. et al. Reassessment of Mendelian gene pathogenicity using 7,855 cardiomyopathy cases and 60,706 reference samples. Genet. Med. 19, 192â203 (2017).
Robin, X. et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12, 77 (2011).
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76â82 (2011).
Ripatti, S. & Palmgren, J. Estimation of multivariate frailty models using penalized partial likelihood. Biometrics 56, 1016â1022 (2000).
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405â424 (2015).
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284â1287 (2016).
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68â74 (2015).
Wall, J. D. et al. The GenomeAsia 100K Project enables genetic discoveries across Asia. Nature 576, 106â111 (2019).
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 39, 906â913 (2007).
Wei, W.-Q. et al. Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record. in. PLoS ONE 12, e0175508 (2017).
Wu, P. et al. Mapping ICD-10 and ICD-10-CM Codes to Phecodes: workflow development and initial evaluation. JMIR Med. Inform. 7, e14325 (2019).
Carroll, R. J., Bastarache, L. & Denny, J. C. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics 30, 2375â2376 (2014).
Evangelou, E. et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 50, 1412â1425 (2018).
Elsworth, B. et al. The MRC IEU OpenGWAS data infrastructure. Preprint at bioRxiv https://doi.org/10.1101/2020.08.10.244293 (2020).
Chen, J. et al. The trans-ancestral genomic architecture of glycemic traits. Nat. Genet. 53, 840â860 (2021).
Willer, C. J. et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 45, 1274â1283 (2013).
Shah, S. et al. Genome-wide association and Mendelian randomisation analysis provide insights into the pathogenesis of heart failure. Nat. Commun. 11, 163 (2020).
Nielsen, J. B. et al. Biobank-driven genomic discovery yields new insight into atrial fibrillation biology. Nat. Genet. 50, 1234â1239 (2018).
Hemani, G., Tilling, K. & Davey Smith, G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. 13, e1007081 (2017).
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Zheng, S. L. hcm_pgs_code. Zenodo https://doi.org/10.5281/zenodo.11204463 (2024).
Acknowledgements
This work was supported by funding from the British Heart Foundation (RE/18/4/34215 (to S.L.Z.), FS/IPBSRF/22/27059 (to K.A.M.), FS/15/81/31817 (to C.F.), RG/19/6/34387 (to D.P.O.), RE/13/1/30181 (to H.W.)), the Medical Research Council (MC_UP_1605/13 to S.L.Z., X.X., P.I.T., A.d.M., L.C., S.A.C., D.P.O. and J.S.W.), Wellcome Trust (107469/Z/15/Z), the National Institute for Health Research (NIHR) Imperial College Biomedical Research Centre (to R.J.B., P.J.R.B. and J.S.W.), Sir Jules Thorn Charitable Trust (21JTA to J.S.W.), National Heart Lung Institute Foundation (to S.L.Z., R.J.B., A.d.M., L.C., B.P.H., S.A.C., P.J.R.B. and J.S.W.), the Dutch Heart Foundation (03-007-2022-0035 (to S.J.J.) and CVON PRIME (to C.R.B.)), Rosetrees Trust (to J.S.W.), Leducq Foundation (17CVD02 to C.R.B.), Amsterdam Cardiovascular Sciences (to C.R.B.), European Commission (LSHM-CT-2007-037273 and HEALTH-F2-2013-601456 to A.R.H., A.G. and H.W.), European Joint Programme Rare Diseases (LQTS-NEXT, ZonMW project 40-46300-98-19009 to C.R.B.), The Horstingstuit Foundation (Genetic factors underlying Sudden Cardiac Death to C.R.B.), the Fetal Medicine Foundation (495237 (to A.d.M.)). The Watkins (H.W.) and Ware (J.S.W.) laboratories are supported by the British Heart Foundation's Big Beat Challenge awarded to CureHeart (BBC/F/21/220106).
This research has been conducted in part using the UKB Resource (applications 18545, 40616 and 47602). This research was made possible through access to the data and findings generated by the 100,000 Genomes Project (project 667). The 100,000 Genomes Project is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care) and funded by the National Institute for Health Research and NHS England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100,000 Genomes Project uses data provided by patients and collected by the National Health Service as part of their care and support. Genotyping was supported by the Institute of Psychiatry Psychology and Neuroscience (IoPPN) Genomics & Biomarker Core Facility within Kingâs College London, who gratefully acknowledge capital equipment funding from the Maudsley Charity (980) and Guyâs and St Thomasâs Charity (STR130505).
The views expressed in this work are those of the authors and not necessarily those of the funders. For the purpose of open access, the authors have applied a Creative Commons Attribution (CC BY) license to any Author Accepted Manuscript version arising from this submission.
Author information
Authors and Affiliations
Consortia
Contributions
S.L.Z. conceived, designed and performed the experiments, performed statistical analysis, analyzed the data and wrote the paper with input from all authors. J.S.W., C.R.B. and H.W. conceived and designed the experiments, contributed data, wrote the paper and jointly supervised this work. S.J.J. and A.G. performed statistical analysis and analyzed the data. K.A.M., X.X., C.G., P.I.T., R.J.B., C.F., A.d.M., L.C., W.B., C.J.P., T.H.C., P.J., M.A.v.S., J.M.A.V., A.R.H., E.O., C.W.L.C., HCM GWAS Collaborators, A.P., J.B., B.P.H., P.M., Y.M.P., R.W., A.S.A., A.A.M.W., S.A.C., S.K.P., P.J.R.B., D.P.O., R.T.L., R.T. and M.M. contributed in data collection.
Corresponding author
Ethics declarations
Competing interests
S.L.Z. has acted as a consultant for Health Lumen. R.T.L. has acted as a consultant for Health Lumen and Fitfile, and received funding from Pfizer. J.S.W. has acted as a consultant for MyoKardia, Pfizer, Foresite Labs and Health Lumen, and received institutional support from Bristol Myers Squibb and Pfizer. M.M. has received research support or consultancy fees from Bristol Myers Squibb, Cytokinetics, Pfizer, Sanofi Genzyme, Biomarin and Alnylam. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Genetics thanks Jack OâSullivan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information
Supplementary Figs. 1â10 and Supplementary Note.
Supplementary Tables
Supplementary Tables 1â13.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zheng, S.L., Jurgens, S.J., McGurk, K.A. et al. Evaluation of polygenic scores for hypertrophic cardiomyopathy in the general population and across clinical settings. Nat Genet (2025). https://doi.org/10.1038/s41588-025-02094-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41588-025-02094-5