Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
© 2008 Nature Publishing Group http://www.nature.com/naturegenetics B R I E F C O M M U N I C AT I O N S Amyotrophic lateral sclerosis (ALS) is a severely disabling and lethal disorder caused by progressive degeneration of motor neurons in the brain, spinal cord and brainstem. ALS affects 1–3 per 100,000 people, and the average survival time is three years from disease onset. To date, no effective treatment is available1. Familial ALS accounts for up to 10% of ALS cases, with approximately 20% of familial ALS cases linked to mutations in SOD1. Mutations in ALS2, DCTN1, VAPB and ANG have been found in rare cases of familial ALS2. Sporadic ALS accounts for 490% of ALS cases and is considered to be a multifactorial disease with an estimated heritability ranging from 0.38 to 0.85 (ref. 3). Variants in several genes, including ANG4, VEGF5, HFE6 and PON1 (ref. 7), and copy number variations in SMN1 and SMN2 (ref. 8) have been reported to be associated with ALS susceptibility. However, attempts to replicate these findings in other populations have frequently failed. For instance, sequence variations in ANG are reported to be associated with ALS in Irish and Scottish populations but rarely in English, Swedish or Italian populations. Similarly, mutations in SOD1 are found in 12%– 23% of families with ALS in the United States, the UK, Germany, Sweden and Belgium, but they are rare in families with ALS in Portugal, Switzerland and The Netherlands (F.B. and P.M.A., unpublished data). Sporadic as well as familial ALS therefore seems to be a genetically heterogeneous disease, even across European populations. Genetic variation in DPP6 is associated with susceptibility to amyotrophic lateral sclerosis Michael A van Es1,15, Paul WJ van Vught1,15, Hylke M Blauw1,15, Lude Franke2,15, Christiaan GJ Saris1, Ludo Van Den Bosch3, Sonja W de Jong1, Vianney de Jong4, Frank Baas5, Ruben van’t Slot2, Robin Lemmens3, Helenius J Schelhaas6, Anna Birve7, Kristel Sleegers8,9, Christine Van Broeckhoven8,9, Jennifer C Schymick10, Bryan J Traynor11, John HJ Wokke1, Cisca Wijmenga2,12, Wim Robberecht3, Peter M Andersen7, Jan H Veldink1, Roel A Ophoff13,14 & Leonard H van den Berg1 We identified a SNP in the DPP6 gene that is consistently strongly associated with susceptibility to amyotrophic lateral sclerosis (ALS) in different populations of European ancestry, with an overall P value of 5.04  108 in 1,767 cases and 1,916 healthy controls and with an odds ratio of 1.30 (95% confidence interval (CI) of 1.18–1.43). Our finding is the first report of a genome-wide significant association with sporadic ALS and may be a target for future functional studies. Table 1 Descriptive statistics and results for SNP rs10260404 MAFa n HWEb Cases Controls Cases Controls Cases Controls P valuec OR (95% CI)d Stage I Netherlands USA Stage I combinede 461 276 737 450 271 721 0.44 0.42 0.43 0.37 0.34 0.36 0.48 0.28 0.39 0.78 0.06 0.19 0.006 0.003 4.30  105 1.30 (1.08–1.56) 1.45 (1.13–1.86) 1.34 (1.08–1.65) Stage II Netherlands Sweden Belgium Stage II combinede 272 467 291 1,030 336 439 420 1,195 0.42 0.4 0.4 0.41 0.37 0.34 0.35 0.35 0.18 0.26 0.43 0.11 0.51 0.23 0.39 0.06 0.04 0.006 0.11 0.0002 1.26 1.31 1.21 1.26 Stages I + II combinede 1,767 1,916 0.42 0.35 0.33 0.38 5.40  108 1.30 (1.18–1.43) (1.01–1.58) (1.08–1.58) (0.96–1.51) (1.11–1.42) aMinor allele frequencies for cases and controls for each population. bHardy-Weinberg equilibrium P values for cases and controls for each population. cP values were calculated for each individual population using w2 test on allele counts. dOdds ratios (OR) were calculated for the minor allele in each population; 95% confidence intervals are shown in parentheses. eP values and ORs using data from multiple populations were calculated using the Mantel-Haenszel method. 1Department of Neurology, Rudolf Magnus Institute of Neuroscience and 2Complex Genetics Section, Department of Biomedical Genetics, University Medical Center Utrecht, Utrecht 3584 CX, The Netherlands. 3Department of Neurology, University Hospital Gasthuisberg, Leuven B-3000, Belgium. 4Departments of Neurology and 5Neurogenetics, Academic Medical Center, Amsterdam 1105 AZ, The Netherlands. 6Department of Neurology, Radboud University Nijmegen Medical Centre, Nijmegen 6525 GA, The Netherlands. 7Institute of Clinical Neuroscience, Umeå University Hospital, Umeå SE-901 85, Sweden. 8Neurodegenerative Brain Diseases Group, Department of Molecular Genetics, VIB, Antwerpen B-2610, Belgium. 9University of Antwerp, Antwerpen B-2610, Belgium. 10Laboratory of Neurogenetics, National Institute of Aging, National Institutes of Health, Bethesda, Maryland 20892, USA. 11Section on Developmental Genetic Epidemiology, National Institute of Mental Health, Bethesda, Maryland 20892, USA. 12Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen 9700 RB, The Netherlands. 13Department of Medical Genetics and Rudolf Magnus Institute of Neuroscience, University Medical Center Utrecht, Utrecht 3584 CX, The Netherlands. 14Neuropsychiatric Institute, University of California, Los Angeles, California 90095, USA. 15These authors contributed equally to this work. Correspondence should be addressed to L.H.vdB. (L.H.vandenBerg@umcutrecht.nl) or R.A.O. (Ophoff@ucla.edu). Received 26 June; accepted 16 October; published online 16 December 2007; doi:10.1038/ng.2007.52 NATURE GENETICS VOLUME 40 [ NUMBER 1 [ JANUARY 2008 29 B R I E F C O M M U N I C AT I O N S 153.4 Mb 153.6 Mb 153.8 Mb 154.0 Mb Figure 1 Schematic overview of DPP6. P values from the combined analysis of the two genomewide studies are shown for all SNPs in a 900-kb region surrounding rs10260404. rs7803828, located distal to rs10260404, had a lower P value in the combined analysis of the GWAs but did not fulfill the initial criteria for SNP selection (P o 0.01 in both GWAs). Subsequent analysis of seven SNPs in the associated 50-kb locus (r2 4 0.8) showed the lowest allelic P value for rs10260404 at P ¼ 5.04  108. The Bonferroni-corrected genome-wide significance level was set at P ¼ 0.05/311,946 ¼ 1.6  107. 154.2 Mb DPP6 0.00001 Allele frequency P value rs10260404 0.0001 0.001 0.01 0.05 0.1 1 –400 –300 –200 –100 0 100 200 300 Distance to rs10260404 (kb) 400 500 Netherlands, 467 cases and 437 controls from Sweden and 291 cases and 420 controls from Belgium (Supplementary Methods 10–8 Joint analysis of USA, and Supplementary Table 1 online). Overall Dutch, Belgian, Swedish Bonferroni-corrected 10–7 power for the study is shown in Supplemenand second Dutch cohort genome-wide significance tary Table 2 online. threshold 10–6 Genotyping of these 15 SNPs was done 10–5 with Taqman technology (Supplementary 10–4 Methods). We included 100 randomly 10–3 selected individuals from our Dutch genome10–2 wide association (GWA) study sample for 10–1 TaqMan genotyping of the 15 SNPs and 1 observed a concordance rate of 499.6% between platforms. Before this analysis, we examined whether population stratification was present in the available GWA data Linkage disequilibrium (D′) from the Dutch and US sample series using 89 82 99 98 75 99 Eigenstrat and did not detect any (see 76 95 99 67 94 Supplementary Methods and Supplemen76 59 82 69 tary Figure 1a,b online). 84 49 94 Because the sample series in our study were 59 51 Coloring (r 2) derived from different populations, we calcu56 lated overall P values and odds ratios (OR) 0 0.5 1 using the Mantel-Haenszel method as well as the w2 test on allele counts for all 15 SNPs. To identify previously unknown ALS susceptibility genes, we carried Only one SNP, rs10260404, showed genome-wide significance after out a genome-wide association study using Illumina 300K Beadchips Bonferroni correction for the 311,946 SNPs tested in the first stage. (Supplementary Methods online). After stringent quality control, we The overall P value for rs10260404 was 5.04  108 (corrected carried out association analysis on 311,946 SNPs in 461 affected P ¼ 0.017) with an odds ratio of 1.30 (95% CI ¼ 1.18–1.43) using individuals (cases) and 450 healthy controls matched in age, gender the w2 test and 5.40  108 with an OR of 1.30 (95% CI ¼ 1.18–1.43) and ethnicity from The Netherlands. The overall call rate was 99.5%. using the Mantel-Haenszel method10. The association for rs10260404 Results for all 311,946 SNPs are available online (http://www. was slightly more significant under a genotypic model (Cochranalscentrum.nl/index.php?id¼GWA.) We did not observe any genome- Armitage trend test), with a P value of 3.30  108 and with an wide significant association with ALS after Bonferroni correction for increased disease susceptibility for homozygote carriers of the risk allele multiple testing. (OR ¼ 1.60 with 95% CI ¼ 1.32–1.92) compared to heterozygotes Recently, first-stage data from a genome-wide association study of (OR ¼ 1.20 with 95% CI ¼ 1.06–1.41) in a dose-dependent manner. 276 ALS cases and 271 controls from the United States was released. P values and odds ratios for each individual population are shown No genome-wide significant findings were observed in this study9. in Table 1. The minor allele frequency for rs10260404 was 42% for Considering the genetic heterogeneity of ALS and the fact that both cases compared to 35% for controls. Results for all 15 SNPs analyzed studies were conducted with relatively small sample sizes, we hypothe- in stage 2 are shown in Supplementary Table 3 online. Rs10260404 maps to a 50-kb linkage disequilibrium (LD) block on sized that signals from truly associated SNPs might be present, although weak. We therefore decided to combine both datasets and chromosome 7q36 (r2 4 0.8), within a gene encoding dipeptidyl to follow up on all SNPs that had P o 0.01 in each study peptidase 6 (DPP6; Fig. 1). Combining the two GWA sets, we found independently and unidirectional allelic association (that is, associa- several SNPs within this 50-kb block that showed association with tion in the same direction of the allele associated). Fifteen SNPs disease at P o 0.01. To rule out LD beyond this 50-kb block, we fulfilled these criteria and were analyzed in three additional indepen- re-examined 130 SNPs in a 900-kb region surrounding rs10260404 dent populations consisting of 272 cases and 336 controls from The and did not find any SNP to be associated at P o 0.01 (Fig. 1). 30 39 43 8 27 90 rs rs 90 27 82 4 rs 78 03 73 3 88 15 rs 13 52 54 04 04 rs 26 10 rs rs 10 23 97 94 Allele frequency P value © 2008 Nature Publishing Group http://www.nature.com/naturegenetics Initial GWA allele frequency P value in Dutch and USA samples VOLUME 40 [ NUMBER 1 [ JANUARY 2008 NATURE GENETICS © 2008 Nature Publishing Group http://www.nature.com/naturegenetics B R I E F C O M M U N I C AT I O N S Comparison of LD structure in this 900-kb region showed similar haplotype structure in the Dutch, US and HapMap CEPH sample datasets (Supplementary Fig. 2a online). Further examination of the associated 50-kb LD block also indicated that similar LD structure is present in both the Dutch and US population (Supplementary Fig. 2b,c). It is therefore unlikely that the initial finding of ALS association is due to genetic variation outside the 50-kb LD block containing rs10260404. To fine-map the associated 50-kb LD block and carry out haplotype analyses, we additionally genotyped all SNPs (n ¼ 6) within this block that showed an association with disease at P o 0.01 in the combined analysis of both genome-wide studies. Genotyping of these six SNPs was done with Taqman technology (Supplementary Methods). Single-SNP analysis of these six additionally genotyped SNPs did not show any SNP to be associated more significantly than rs10260404 (Supplementary Table 4 online). We then applied a recently developed multimarker indirect association method that takes advantage of the correlation structure between SNPs in the HapMap sample (weighted haplotype analysis (WHAP); http://whap.cs.ucla.edu/) using rs10260404 and the additional six flanking SNPs11. Using this imputation method, we again identified the strongest association signal for rs10260404, with P ¼ 6.69  108 (Supplementary Methods and Supplementary Table 5 online). Subsequent haplotype analysis with Haploview, using the ‘solid spine of LD’ method to define haplotypes, showed the strongest association signal for a haplotype containing the CC alleles of flanking SNPs rs10239794 and rs10260404, with a P value of 3.01  109 and an allelic OR of 1.34 (95% CI ¼ 1.17–1.54; Supplementary Fig. 3 online). Results from examining long-range LD, fine mapping (including imputation analysis) and haplotype analysis all indicated that the strongest signal for association hinges on the ‘C’ allele of rs10260404, suggesting that the underlying variation for disease susceptibility is at this site. Because the entire associated 50-kb LD block containing rs10260404 is located within intron 3 of DPP6, and there are no known genes or microRNAs nearby, we consider this to be the putative ALS-associated gene (Fig. 1). DPP6 is located on chromosome 7q36 at location 153,380,839– 154,315,627 (Build 35). It consists of 26 exons and is 954 kb in size (OMIM 126141). DPP6 (also known as DPPX) encodes a dipeptidylpeptidase-like protein expressed predominantly in the brain, with very high expression in the amygdala, cingulate cortex, cerebellum and parietal lobe (http://symatlas.gnf.org/SymAtlas). This peptidase regulates the biological activity of neuropeptides by converting precursors to active forms or vice versa12. DPP6 binds specific voltagegated potassium channels and alters their expression and biophysical properties. Notably, differential DPP6 gene expression has been linked to spinal cord injury in rats13, and DPP6 was also identified as a nervous system–specific gene with accelerated evolutionary rate in the primate lineage14. In conclusion, we identify genetic variation in the DPP6 gene that is highly associated with ALS susceptibility in a combined sample of 1,767 cases and 1,916 healthy control subjects from European descent. The identified SNP, rs10260404, is located within an intron of DPP6, and no known functional variants within the gene have been yet identified. Further study will provide insight into genetic variation at NATURE GENETICS VOLUME 40 View publication stats [ NUMBER 1 [ JANUARY 2008 this locus, its potential effect on gene function and, ultimately, its role in disease susceptibility. Identification of a common variant within DPP6 is an exciting first step in the genetic study of sporadic ALS, and it opens up new avenues for studying the molecular basis of this devastating disease. Note: Supplementary information is available on the Nature Genetics website. ACKNOWLEDGMENTS We are indebted to the individuals and their families who participated in this project. This project has been generously supported by The Netherlands Organisation for Scientific Research (NWO) and the ‘‘Prinses Beatrix Fonds’’ (L.H.vdB.). We would also like to thank H. Kersten and M. Kersten for their generous support (L.H.vdB.) as well as J.R. van Dijk and the Adessium foundation (L.H.vdB.), the US National Institutes of Health grants GM68875 and MH078075 (R.A.O.), the Kempe Foundation (P.M.A.), the Swedish Brain Research Foundation and Bertil Hållsten (P.M.A.), the Björklund Foundation for ALS Research (P.M.A.), the Interuniversity Attraction Pole Programme P6/43 (Belgian Science Policy Office) (W.R., L.V.D.B. and C.V.B.) and the E. von Behring Chair for Neuromuscular and Neurodegenerative Disorders (W.R.). C.V.B., W.R. and L.V.D.B. are supported by the Fund for Scientific Research Flanders (FWO-F), and K.S. holds a postdoctoral fellowship of the FWO-F. P.M.A. and A.B. are supported by the ‘Swedish Brain Power Foundation’. This study used data from the SNP Database at the US National Institute of Neurological Disorders and Stroke Human Genetics Resource Center DNA and Cell Line Repository (http://ccr.coriell.org/ninds/). The authors thank E. Strengman, P. Sodaar, H. Veldman, H. Yigittop, W. Scheveneels, A. D’hondt, P. Tilkin and A. Nilsson for assistance with genotyping and DNA preparation. We also thank F.G. Jennekens and G. Hille Ris Lambers for helping with the DNA sample collection. AUTHOR CONTRIBUTIONS M.A.vE., P.W.J.vV., H.M.B. and L.F. contributed equally to this study. M.A.vE., P.W.J.vV., H.M.B. and R.vS. participated in the Illumina and TaqMan SNP genotyping and data analysis. M.A.vE., J.H.V. and L.F. were involved in the design of the study, handled genotype data and performed statistical analyses. M.A.vE., H.M.B., C.G.J.S., P.M.A., L.V.D.B., S.W.dJ., A.B., R.L., V.dJ., F.B., H.J.S., K.S., C.V.B., J.H.J.W., C.W. and W.R. were responsible for DNA collection and clinical characterization of affected individuals in the study. J.C.S. and B.J.T. obtained all DNA samples from the United States and performed genotyping experiments and analysis on these samples. M.A.vE. drafted the manuscript. R.A.O. and L.H.vdB. are lead investigators and contributed equally to this work. They designed and supervised the study and contributed in the writing of the manuscript. All authors participated in the critical revisions of the manuscript. Published online at http://www.nature.com/naturegenetics Reprints and permissions information is available online at http://npg.nature.com/ reprintsandpermissions 1. Rowland, L.P. & Shneider, N.A. N. Engl. J. Med. 344, 1688–1700 (2001). 2. Pasinelli, P. & Brown, R.H. Nat. Rev. Neurosci. 7, 710–723 (2006). 3. Graham, A.J., Macdonald, A.M. & Hawkes, C.H. J. Neurol. Neurosurg. Psychiatry 62, 562–569 (1997). 4. Greenway, M.J. et al. Nat. Genet. 38, 411–413 (2006). 5. Lambrechts, D. et al. Nat. Genet. 34, 383–394 (2003). 6. Sutedja, N.A. et al. Arch. Neurol. 64, 63–67 (2007). 7. Saeed, M. et al. Neurology 67, 771–776 (2006). 8. Veldink, J.H. et al. Neurology 65, 820–825 (2005). 9. Schymick, J.C. et al. Lancet Neurol. 6, 322–328 (2007). 10. Skol, A.D., Scott, L.J., Abecasis, G.R. & Boehnke, M. Nat. Genet. 38, 209–213 (2006). 11. Zaitlen, N., Kang, H.M., Eskin, E. & Halperin, E. Am. J. Hum. Genet. 80, 683–691 (2007). 12. Wada, K. et al. Mamm. Genome 4, 234–237 (1993). 13. Tachibana, T., Noguchi, K. & Ruda, M.A. Neurosci. Lett. 327, 133–137 (2002). 14. Dorus, S. et al. Cell 119, 1027–1040 (2004). 31