Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

    Andres Metspalu

    The timing of puberty is a highly polygenic childhood trait that is epidemiologically associated with various adult diseases. Using 1000 Genomes Project-imputed genotype data in up to ∼370,000 women, we identify 389 independent signals (P... more
    The timing of puberty is a highly polygenic childhood trait that is epidemiologically associated with various adult diseases. Using 1000 Genomes Project-imputed genotype data in up to ∼370,000 women, we identify 389 independent signals (P < 5 × 10(-8)) for age at menarche, a milestone in female pubertal development. In Icelandic data, these signals explain ∼7.4% of the population variance in age at menarche, corresponding to ∼25% of the estimated heritability. We implicate ∼250 genes via coding variation or associated expression, demonstrating significant enrichment in neural tissues. Rare variants near the imprinted genes MKRN3 and DLK1 were identified, exhibiting large effects when paternally inherited. Mendelian randomization analyses suggest causal inverse associations, independent of body mass index (BMI), between puberty timing and risks for breast and endometrial cancers in women and prostate cancer in men. In aggregate, our findings highlight the complexity of the genetic...
    To identify novel coding association signals and facilitate characterization of mechanisms influencing glycemic traits and type 2 diabetes risk, we analyzed 109,215 variants derived from exome array genotyping together with an additional... more
    To identify novel coding association signals and facilitate characterization of mechanisms influencing glycemic traits and type 2 diabetes risk, we analyzed 109,215 variants derived from exome array genotyping together with an additional 390,225 variants from exome sequence in up to 39,339 normoglycemic individuals from five ancestry groups. We identified a novel association between the coding variant (p.Pro50Thr) in AKT2 and fasting insulin, a gene in which rare fully penetrant mutations are causal for monogenic glycemic disorders. The low-frequency allele is associated with a 12% increase in fasting plasma insulin (FI) levels. This variant is present at 1.1% frequency in Finns but virtually absent in individuals from other ancestries. Carriers of the FI-increasing allele had increased 2-hour insulin values, decreased insulin sensitivity, and increased risk of type 2 diabetes (odds ratio=1.05). In cellular studies, the AKT2-Thr50 protein exhibited a partial loss of function. We ext...
    Genome-wide association studies have so far identified 56 loci associated with risk of coronary artery disease (CAD). Many CAD loci show pleiotropy; that is, they are also associated with other diseases or traits. This study sought to... more
    Genome-wide association studies have so far identified 56 loci associated with risk of coronary artery disease (CAD). Many CAD loci show pleiotropy; that is, they are also associated with other diseases or traits. This study sought to systematically test if genetic variants identified for non-CAD diseases/traits also associate with CAD and to undertake a comprehensive analysis of the extent of pleiotropy of all CAD loci. In discovery analyses involving 42,335 CAD cases and 78,240 control subjects we tested the association of 29,383 common (minor allele frequency >5%) single nucleotide polymorphisms available on the exome array, which included a substantial proportion of known or suspected single nucleotide polymorphisms associated with common diseases or traits as of 2011. Suggestive association signals were replicated in an additional 30,533 cases and 42,530 control subjects. To evaluate pleiotropy, we tested CAD loci for association with cardiovascular risk factors (lipid trait...
    The continued identification of new low-penetrance genetic variants for colorectal cancer (CRC) raises the question of their potential cumulative effect among compound carriers. We focused on 6 SNPs (rs380284, rs4464148, rs4779584,... more
    The continued identification of new low-penetrance genetic variants for colorectal cancer (CRC) raises the question of their potential cumulative effect among compound carriers. We focused on 6 SNPs (rs380284, rs4464148, rs4779584, rs4939827, rs6983267, and rs10795668), already described as risk markers, and tested their possible independent and combined contribution to CRC predisposition.Material and Methods.DNA was collected and genotyped from 2330 unselected consecutive CRC cases and controls from Estonia (166 cases and controls), Latvia (81 cases and controls), Lithuania (123 cases and controls), and Poland (795 cases and controls).Results.Beyond individual effects, the analysis revealed statistically significant linear cumulative effects for these 6 markers for all samples except of the Latvian one (correctedPvalue = 0.018 for the Estonian, correctedPvalue = 0.0034 for the Lithuanian, and correctedPvalue = 0.0076 for the Polish sample).Conclusions.The significant linear cumulat...
    Previous results from our genetic analyses using pedigrees from a French Canadian population suggested that the interval delimited by markers on chromosome 12, D12S86 and D12S378, was the most probable genomic region to contain a... more
    Previous results from our genetic analyses using pedigrees from a French Canadian population suggested that the interval delimited by markers on chromosome 12, D12S86 and D12S378, was the most probable genomic region to contain a susceptibility gene for affective disorders. Association studies with microsatellite markers using a case/control sample from the same population (n = 427) revealed significant allelic associations between the bipolar phenotype and marker NBG6. Since this marker is located in intron 9 of the P2RX7 gene, we analyzed the surrounding genomic region for the presence of polymorphisms in regulatory, coding and intron/exon junction sequences. Twenty four (24) SNPs were genotyped in a case/control sample and 12 SNPs in all pedigrees used for linkage analysis. Allelic, genotypic or family-based association studies suggest the presence of two susceptibility loci, the P2RX7 and CaMKK2 genes. The strongest association was observed in bipolar families at the non-synonymous SNP P2RX7-E13A (rs2230912, P-value = 0.000708), which results from an over-transmission of the mutant G-allele to affected offspring. This Gln460Arg polymorphism occurs at an amino acid that is conserved between humans and rodents and is located in the C-terminal domain of the P2X7 receptor, known to be essential for normal P2RX7 function.
    Sex-specific genetic effects have been proposed to be an important source of variation for human complex traits. Here we use two distinct genome-wide methods to estimate the autosomal genetic correlation (rg) between men and women for... more
    Sex-specific genetic effects have been proposed to be an important source of variation for human complex traits. Here we use two distinct genome-wide methods to estimate the autosomal genetic correlation (rg) between men and women for human height and body mass index, using individual-level (n=∼44,000) and summary-level (n=∼133,000) data from genome-wide association studies. Results are consistent and show that the between-sex genetic correlation is not significantly different from unity for both traits. In contrast, we find evidence of genetic heterogeneity between sexes for waist-hip-ratio (rg=∼0.7) and between populations for BMI (rg=∼0.9 between Europe and the USA) but not for height. The lack of evidence for substantial genetic heterogeneity for body size is consistent with empirical findings across traits and species.
    Disease incidences increase with age, but the molecular characteristics of ageing that lead to increased disease susceptibility remain inadequately understood. Here we perform a whole-blood gene expression meta-analysis in 14,983... more
    Disease incidences increase with age, but the molecular characteristics of ageing that lead to increased disease susceptibility remain inadequately understood. Here we perform a whole-blood gene expression meta-analysis in 14,983 individuals of European ancestry (including replication) and identify 1,497 genes that are differentially expressed with chronological age. The age-associated genes do not harbor more age-associated CpG-methylation sites than other genes, but are instead enriched for the presence of potentially functional CpG-methylation sites in enhancer and insulator regions that associate with both chronological age and gene expression levels. We further used the gene expression profiles to calculate the…
    The current study examined the relationship between the Five-Factor Model personality traits and physician-confirmed peptic ulcer disease (PUD) diagnosis in a large population-based adult sample, controlling for the relevant behavioral... more
    The current study examined the relationship between the Five-Factor Model personality traits and physician-confirmed peptic ulcer disease (PUD) diagnosis in a large population-based adult sample, controlling for the relevant behavioral and sociodemographic factors. Personality traits were assessed by participants themselves and by knowledgeable informants using the NEO Personality Inventory-3 (NEO PI-3). When controlling for age, sex, education, and cigarette smoking, only one of the five NEO PI-3 domain scales - higher Neuroticism - and two facet scales - lower A1: Trust and higher C1: Competence - made a small, yet significant contribution (p < 0.01) to predicting PUD in logistic regression analyses. In the light of these relatively modest associations, our findings imply that it is certain behavior (such as smoking) and sociodemographic variables (such as age, gender, and education) rather than personality traits that are associated with the diagnosis of PUD at a particular point in time. Further prospective studies with a longitudinal design and multiple assessments would be needed to fully understand if the FFM personality traits serve as risk factors for the development of PUD.
    We carried out a trans-ancestry genome-wide association and replication study of blood pressure phenotypes among up to 320,251 individuals of East Asian, European and South Asian ancestry. We find genetic variants at 12 new loci to be... more
    We carried out a trans-ancestry genome-wide association and replication study of blood pressure phenotypes among up to 320,251 individuals of East Asian, European and South Asian ancestry. We find genetic variants at 12 new loci to be associated with blood pressure (P = 3.9 × 10(-11) to 5.0 × 10(-21)). The sentinel blood pressure SNPs are enriched for association with DNA methylation at multiple nearby CpG sites, suggesting that, at some of the loci identified, DNA methylation may lie on the regulatory pathway linking sequence variation to blood pressure. The sentinel SNPs at the 12 new loci point to genes involved in vascular smooth muscle (IGFBP3, KCNK3, PDE3A and PRDM6) and renal (ARHGAP24, OSR1, SLC22A7 and TBX2) function. The new and known genetic variants predict increased left ventricular mass, circulating levels of NT-proBNP, and cardiovascular and all-cause mortality (P = 0.04 to 8.6 × 10(-6)). Our results provide new evidence for the role of DNA methylation in blood pressu...
    We propose a method (GREML-LDMS) to estimate heritability for human complex traits in unrelated individuals using whole-genome sequencing data. We demonstrate using simulations based on whole-genome sequencing data that ∼97% and ∼68% of... more
    We propose a method (GREML-LDMS) to estimate heritability for human complex traits in unrelated individuals using whole-genome sequencing data. We demonstrate using simulations based on whole-genome sequencing data that ∼97% and ∼68% of variation at common and rare variants, respectively, can be captured by imputation. Using the GREML-LDMS method, we estimate from 44,126 unrelated individuals that all ∼17 million imputed variants explain 56% (standard error (s.e.) = 2.3%) of variance for height and 27% (s.e. = 2.5%) of variance for body mass index (BMI), and we find evidence that height- and BMI-associated variants have been under natural selection. Considering the imperfect tagging of imputation and potential overestimation of heritability from previous family-based studies, heritability is likely to be 60-70% for height and 30-40% for BMI. Therefore, the missing heritability is small for both traits. For further discovery of genes associated with complex traits, a study design wit...
    A wealth of biospecimen samples are stored in modern globally distributed biobanks. Biomedical researchers worldwide need to be able to combine the available resources to improve the power of large-scale studies. A prerequisite for this... more
    A wealth of biospecimen samples are stored in modern globally distributed biobanks. Biomedical researchers worldwide need to be able to combine the available resources to improve the power of large-scale studies. A prerequisite for this effort is to be able to search and access phenotypic, clinical and other information about samples that are currently stored at biobanks in an integrated manner. However, privacy issues together with heterogeneous information systems and the lack of agreed-upon vocabularies have made specimen searching across multiple biobanks extremely challenging. We describe three case studies where we have linked samples and sample descriptions in order to facilitate global searching of available samples for research. The use cases include the ENGAGE (European Network for Genetic and Genomic Epidemiology) consortium comprising at least 39 cohorts, the SUMMIT (surrogate markers for micro- and macro-vascular hard endpoints for innovative diabetes tools) consortium ...
    Metformin is used as a first-line oral treatment for type 2 diabetes (T2D). However, the underlying mechanism is not fully understood. Here, we aimed to comprehensively investigate the pleiotropic effects of metformin. We analyzed both... more
    Metformin is used as a first-line oral treatment for type 2 diabetes (T2D). However, the underlying mechanism is not fully understood. Here, we aimed to comprehensively investigate the pleiotropic effects of metformin. We analyzed both metabolomic and genomic data of the population-based KORA cohort. To evaluate the effect of metformin treatment on metabolite concentrations, we quantified 131 metabolites in fasting serum samples and used multivariable linear regression models in three independent cross-sectional studies (n = 151 patients with T2D treated with metformin [mt-T2D]). Additionally, we used linear mixed-effect models to study the longitudinal KORA samples (n = 912) and performed mediation analyses to investigate the effects of metformin intake on blood lipid profiles. We combined genotyping data with the identified metformin-associated metabolites in KORA individuals (n = 1,809) and explored the underlying pathways. We found significantly lower (P < 5.0E-06) concentrat...
    More than 100 loci have been identified for age at menarche by genome-wide association studies; however, collectively these explain only ∼3% of the trait variance. Here we test two overlooked sources of variation in 192,974 European... more
    More than 100 loci have been identified for age at menarche by genome-wide association studies; however, collectively these explain only ∼3% of the trait variance. Here we test two overlooked sources of variation in 192,974 European ancestry women: low-frequency protein-coding variants and X-chromosome variants. Five missense/nonsense variants (in ALMS1/LAMB2/TNRC6A/TACR3/PRKAG1) are associated with age at menarche (minor allele frequencies 0.08-4.6%; effect sizes 0.08-1.25 years per allele;…
    Hundreds of genes reside in structurally complex, poorly understood regions of the human genome. One such region contains the three amylase genes (AMY2B, AMY2A and AMY1) responsible for digesting starch into sugar. Copy number of AMY1 is... more
    Hundreds of genes reside in structurally complex, poorly understood regions of the human genome. One such region contains the three amylase genes (AMY2B, AMY2A and AMY1) responsible for digesting starch into sugar. Copy number of AMY1 is reported to be the largest genomic influence on obesity, although genome-wide association studies for obesity have found this locus unremarkable. Using whole-genome sequence analysis, droplet digital PCR and genome mapping, we identified eight common structural haplotypes of the amylase locus that suggest its mutational history. We found that the AMY1 copy number in an individual's genome is generally even (rather than odd) and partially correlates with nearby SNPs, which do not associate with body mass index (BMI). We measured amylase gene copy number in 1,000 obese or lean Estonians and in 2 other cohorts totaling ∼3,500 individuals. We had 99% power to detect the lower bound of the reported effects on BMI, yet found no association.
    The association of copy number variations (CNVs), differing numbers of copies of genetic sequence at locations in the genome, with phenotypes such as intellectual disability has been almost exclusively evaluated using clinically... more
    The association of copy number variations (CNVs), differing numbers of copies of genetic sequence at locations in the genome, with phenotypes such as intellectual disability has been almost exclusively evaluated using clinically ascertained cohorts. The contribution of these genetic variants to cognitive phenotypes in the general population remains unclear. To investigate the clinical features conferred by CNVs associated with known syndromes in adult carriers without clinical preselection and to assess the genome-wide consequences of rare CNVs (frequency ≤0.05%; size ≥250 kilobase pairs [kb]) on carriers' educational attainment and intellectual disability prevalence in the general population. The population biobank of Estonia contains 52 000 participants enrolled from 2002 through 2010. General practitioners examined participants and filled out a questionnaire of health- and lifestyle-related questions, as well as reported diagnoses. Copy number variant analysis was conducted o...
    The Estonian Biobank and several other biobanks established over a decade ago are now starting to yield valuable longitudinal follow-up data for large numbers of individuals. These samples have been used in hundreds of different... more
    The Estonian Biobank and several other biobanks established over a decade ago are now starting to yield valuable longitudinal follow-up data for large numbers of individuals. These samples have been used in hundreds of different genome-wide association studies, resulting in the identification of reliable disease-associated variants. The focus of genomic research has started to shift from identifying genetic and nongenetic risk factors associated with common complex diseases to understanding the underlying mechanisms of the diseases and suggesting novel targets for therapy. However, translation of findings from genomic research into medical practice is still lagging, mainly due to insufficient evidence of clinical validity and utility. In this review, we examine the different elements required for the implementation of personalized medicine based on genomic information. First, biobanks and genome centres are required and have been established for the high-throughput genomic screening...
    Genome-wide association studies (GWAS) have uncovered numerous genetic variants (SNPs) that are associated with blood pressure (BP). Genetic variants may lead to BP changes by acting on intermediate molecular phenotypes such as coded... more
    Genome-wide association studies (GWAS) have uncovered numerous genetic variants (SNPs) that are associated with blood pressure (BP). Genetic variants may lead to BP changes by acting on intermediate molecular phenotypes such as coded protein sequence or gene expression, which in turn affect BP variability. Therefore, characterizing genes whose expression is associated with BP may reveal cellular processes involved in BP regulation and uncover how transcripts mediate genetic and environmental effects on BP variability. A meta-analysis of results from six studies of global gene expression profiles of BP and hypertension in whole blood was performed in 7017 individuals who were not receiving antihypertensive drug treatment. We identified 34 genes that were differentially expressed in relation to BP (Bonferroni-corrected p<0.05). Among these genes, FOS and PTGS2 have been previously reported to be involved in BP-related processes; the others are novel. The top BP signature genes in a...
    For human complex traits, non-additive genetic variation has been invoked to explain... more
    For human complex traits, non-additive genetic variation has been invoked to explain "missing…
    Research for the use of biomarkers in osteoarthritis (OA) is promising, however, adequate discrimination between patients and controls may be hampered due to innate differences. We set out to identify loci influencing levels of serum... more
    Research for the use of biomarkers in osteoarthritis (OA) is promising, however, adequate discrimination between patients and controls may be hampered due to innate differences. We set out to identify loci influencing levels of serum cartilage oligomeric protein (sCOMP) and urinary C-telopeptide of type II collagen (uCTX-II). Meta-analysis of genome-wide association studies was applied to standardised residuals of sCOMP (N=3316) and uCTX-II (N=4654) levels available in 6 and 7 studies, respectively, from TreatOA. Effects were estimated using a fixed-effects model. Six promising signals were followed up by de novo genotyping in the Cohort Hip and Cohort Knee study (N = 964). Subsequently, their role in OA susceptibility was investigated in large-scale genome-wide association studies meta-analyses for OA. Differential expression of annotated genes was assessed in cartilage. Genome-wide significant association with sCOMP levels was found for a SNP within MRC1 (rs691461, p = 1.7 × 10(-1...
    ABSTRACT Replying to A. R. Wood et al. 514, http://dx.doi.org/10.1038/nature13691 (2014).We thank Wood et al. for their interesting observations and although their proposed mechanism does not explain all our reported results, we... more
    ABSTRACT Replying to A. R. Wood et al. 514, http://dx.doi.org/10.1038/nature13691 (2014).We thank Wood et al. for their interesting observations and although their proposed mechanism does not explain all our reported results, we acknowledge that alternative mechanisms could be behind the observation of epistatic signals. Although we replicate our results in large, independent samples, 19/30 of our reported interactions (Table 1 in ref. 2), Wood et al. do not replicate in the InCHIANTI data set (n = 450) at a type-I error rate of 0.05/30 = 0.002, including none of our reported cis-trans interactions. Having insufficient data to replicate the discovery interactions makes it problematic to draw firm conclusions on the reported cis-trans effects.
    Given the anthropometric differences between men and women and previous evidence of sex-difference in genetic effects, we conducted a genome-wide search for sexually dimorphic associations with height, weight, body mass index, waist... more
    Given the anthropometric differences between men and women and previous evidence of sex-difference in genetic effects, we conducted a genome-wide search for sexually dimorphic associations with height, weight, body mass index, waist circumference, hip circumference, and waist-to-hip-ratio (133,723 individuals) and took forward 348 SNPs into follow-up (additional 137,052 individuals) in a total of 94 studies. Seven loci displayed significant sex-difference…
    Triglycerides are transported in plasma by specific triglyceride-rich lipoproteins; in epidemiological studies, increased triglyceride levels correlate with higher risk for coronary artery disease (CAD). However, it is unclear whether... more
    Triglycerides are transported in plasma by specific triglyceride-rich lipoproteins; in epidemiological studies, increased triglyceride levels correlate with higher risk for coronary artery disease (CAD). However, it is unclear whether this association reflects causal processes. We used 185 common variants recently mapped for plasma lipids (P…
    To further understanding of the genetic basis of type 2 diabetes (T2D) susceptibility, we aggregated published meta-analyses of genome-wide association studies (GWAS), including 26,488 cases and 83,964 controls of European, east Asian,... more
    To further understanding of the genetic basis of type 2 diabetes (T2D) susceptibility, we aggregated published meta-analyses of genome-wide association studies (GWAS), including 26,488 cases and 83,964 controls of European, east Asian, south Asian and Mexican and Mexican American ancestry. We observed a significant excess in the directional consistency of T2D risk alleles across ancestry groups, even at SNPs demonstrating only weak evidence of association. By following up the strongest signals of association from the trans-ethnic meta-analysis in an additional 21,491 cases and 55,647 controls of European ancestry, we identified seven new T2D susceptibility loci. Furthermore, we observed considerable improvements in the fine-mapping resolution of common variant association signals at several T2D susceptibility loci. These observations highlight the benefits of trans-ethnic GWAS for the discovery and characterization of complex trait loci and emphasize an exciting opportunity to extend insight into the genetic architecture and pathogenesis of human diseases across populations of diverse ancestry.

    And 81 more