The genome-wide association study (GWAS) is a powerful approach for studying the genetic complexi... more The genome-wide association study (GWAS) is a powerful approach for studying the genetic complexities of human disease. Unfortunately, GWASs often fail to identify clinically significant associations and describing function can be a challenge. GWAS is a phenotype-to-genotype approach. It is now possible to conduct a converse genotype-to-phenotype approach using extensive electronic medical records to define a phenome. This approach associates a single genetic variant with many phenotypes across the phenome and is called a phenome-wide association study (PheWAS). The majority of PheWASs conducted have focused on variants identified previously by GWASs. This approach has been efficient for rediscovering gene-disease associations while also identifying pleiotropic effects for some single-nucleotide polymorphisms (SNPs). However, the use of SNPs identified by GWAS in a PheWAS is limited by the inherent properties of the GWAS SNPs, including weak effect sizes and difficulty when translating discoveries to function. To address these challenges, we conducted a PheWAS on 105 presumed functional stop-gain and stop-loss variants genotyped on 4235 Marshfield Clinic patients. Associations were validated on an additional 10 640 Marshfield Clinic patients. PheWAS results indicate that a nonsense variant in ARMS2 (rs2736911) is associated with age-related macular degeneration (AMD). These results demonstrate that focusing on functional variants may be an effective approach when conducting a PheWAS.European Journal of Human Genetics advance online publication, 30 July 2014; doi:10.1038/ejhg.2014.123.
Translation of results from genetic findings to inform medical practice is a highly anticipated g... more Translation of results from genetic findings to inform medical practice is a highly anticipated goal of human genetics. The aim of this paper is to review and discuss the role of genetics in medically-relevant prediction. Germline genetics presages disease onset and therefore can contribute prognostic signals that augment laboratory tests and clinical features. As such, the impact of genetic-based predictive models on clinical decisions and therapy choice could be profound. However, given that (i) medical traits result from a complex interplay between genetic and environmental factors, (ii) the underlying genetic architectures for susceptibility to common diseases are not well-understood, and (iii) replicable susceptibility alleles, in combination, account for only a moderate amount of disease heritability, there are substantial challenges to constructing and implementing genetic risk prediction models with high utility. In spite of these challenges, concerted progress has continued...
Investigative ophthalmology & visual science, Jan 20, 2014
We examined the role of DNA copy number variants (CNVs) of known glaucoma genes in relation to pr... more We examined the role of DNA copy number variants (CNVs) of known glaucoma genes in relation to primary open angle glaucoma (POAG). Our study included DNA samples from two studies (NEIGHBOR and GLAUGEN). All the samples were genotyped with the Illumina Human660W_Quad_v1 BeadChip. After removing non-blood-derived and amplified DNA samples, we applied quality control steps based on the mean Log R Ratio and the mean B allele frequency. Subsequently, data from 3057 DNA samples (1599 cases and 1458 controls) were analyzed with PennCNV software. We defined CNVs as those ≥5 kilobases (kb) in size and interrogated by ≥5 consecutive probes. We further limited our investigation to CNVs in known POAG-related genes, including CDKN2B-AS1, TMCO1, SIX1/SIX6, CAV1/CAV2, the LRP12-ZFPM2 region, GAS7, ATOH7, FNDC3B, CYP1B1, MYOC, OPTN, WDR36, SRBD1, TBK1, and GALC. Genomic duplications of CDKN2B-AS1 and TMCO1 were each found in a single case. Two cases carried duplications in the GAS7 region. Genomic ...
Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used... more Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used to identify disease-contributing mutations in human genomic studies. It can be a significant challenge to process such data, especially when a large family or cohort is sequenced. Our objective was to develop a big data toolset to efficiently manipulate genome-wide variants, functional annotations and coverage, together with conducting family based sequencing data analysis. Hadoop is a framework for reliable, scalable, distributed processing of large data sets using MapReduce programming models. Based on Hadoop and HBase, we developed SeqHBase, a big data-based toolset for analysing family based sequencing data to detect de novo, inherited homozygous, or compound heterozygous mutations that may contribute to disease manifestations. SeqHBase takes as input BAM files (for coverage at every site), variant call format (VCF) files (for variant calls) and functional annotations (for variant p...
Phenome-wide association studies (PheWAS) have demonstrated utility in validating genetic associa... more Phenome-wide association studies (PheWAS) have demonstrated utility in validating genetic associations derived from traditional genetic studies as well as identifying novel genetic associations. Here we used an electronic health record (EHR)-based PheWAS to explore pleiotropy of genetic variants in the fat mass and obesity associated gene (FTO), some of which have been previously associated with obesity and type 2 diabetes (T2D). We used a population of 10,487 individuals of European ancestry with genome-wide genotyping from the Electronic Medical Records and Genomics (eMERGE) Network and another population of 13,711 individuals of European ancestry from the BioVU DNA biobank at Vanderbilt genotyped using Illumina HumanExome BeadChip. A meta-analysis of the two study populations replicated the well-described associations between FTO variants and obesity (odds ratio [OR] = 1.25, 95% Confidence Interval = 1.11-1.24, p = 2.10 × 10(-9)) and FTO variants and T2D (OR = 1.14, 95% CI = 1.08-1.21, p = 2.34 × 10(-6)). The meta-analysis also demonstrated that FTO variant rs8050136 was significantly associated with sleep apnea (OR = 1.14, 95% CI = 1.07-1.22, p = 3.33 × 10(-5)); however, the association was attenuated after adjustment for body mass index (BMI). Novel phenotype associations with obesity-associated FTO variants included fibrocystic breast disease (rs9941349, OR = 0.81, 95% CI = 0.74-0.91, p = 5.41 × 10(-5)) and trends toward associations with non-alcoholic liver disease and gram-positive bacterial infections. FTO variants not associated with obesity demonstrated other potential disease associations including non-inflammatory disorders of the cervix and chronic periodontitis. These results suggest that genetic variants in FTO may have pleiotropic associations, some of which are not mediated by obesity.
Hermansky-Pudlak Syndrome (HPS) is a rare, autosomal recessive disorder that is characterized by ... more Hermansky-Pudlak Syndrome (HPS) is a rare, autosomal recessive disorder that is characterized by oculocutaneous albinism, a predisposition to mild bleeding caused by storage-pool deficient platelets, and a ceroid storage disorder. A gene responsible for HPS in Puerto Rico maps to chromosome 10q2 and isolation of the gene has been reported. We have now identified a variant HPS cDNA that contains the same 5' sequence as the published HPS gene and a unique 3' sequence. Analysis of genomic DNA suggests that the two cDNA are derived from alternative transcripts of a single gene; two polyadenylated transcripts were found in normal human melanocytes, human bone marrow cells, human melanoma cells, lymphoblastoid cell lines, and megakaryocytic leukemia cells by reverse transcriptase polymerase chain reaction and northern analysis. The splicing exhibited by this gene is identical to the splicing found to produce two alternative transcripts of the Chediak-Higashi Syndrome gene, another pigment disorder exhibiting platelet storage pool deficiency. These studies show that the HPS gene on chromosome 10 is complex and may have more than one biologically active transcript.
In previous studies, we characterized a 2.7-kb interstitial deletion allele of the P gene associa... more In previous studies, we characterized a 2.7-kb interstitial deletion allele of the P gene associated with tyrosinase-positive oculocutaneous albinism (OCA2) in African Americans and Africans. In this study, we investigated the frequency of this allele among OCA2 subjects in two African countries, Zimbabwe and Cameroon. The deletion allele was most common in Zimbabwe, comprising nearly all (92%) mutant alleles, which is the highest incidence reported so far. In addition, the deletion allele was widespread but less common among OCA2 Cameroonians and accounted for 65% of the mutant alleles.
Proceedings of The National Academy of Sciences, 2000
The mouse p locus encodes a gene that functions in normal pigmentation. We have characterized a r... more The mouse p locus encodes a gene that functions in normal pigmentation. We have characterized a radiation-induced mutant allele of the mouse p locus that is associated with a failure-to-thrive syndrome, in addition to diminished pigmentation. Mice homozygous for this mutant allele, p100H, show delayed growth and die within 2 wk after birth. We have discovered that the mutant mice
Albinism is associated with disrupted foveal development, though intersubject variability is beco... more Albinism is associated with disrupted foveal development, though intersubject variability is becoming appreciated. We sought to quantify this variability, and examine the relationship between foveal cone specialization and pit morphology in patients with a clinical diagnosis of albinism. We recruited 32 subjects with a clinical diagnosis of albinism. DNA was obtained from 25 subjects, and known albinism genes were analyzed for mutations. Relative inner and outer segment (IS and OS) lengthening (fovea-to-perifovea ratio) was determined from manually segmented spectral domain-optical coherence tomography (SD-OCT) B-scans. Foveal pit morphology was quantified for eight subjects from macular SD-OCT volumes. Ten subjects underwent imaging with adaptive optics scanning light ophthalmoscopy (AOSLO), and cone density was measured. We found mutations in 22 of 25 subjects, including five novel mutations. All subjects lacked complete excavation of inner retinal layers at the fovea, though four subjects had foveal pits with normal diameter and/or volume. Peak cone density and OS lengthening were variable and overlapped with that observed in normal controls. A fifth hyper-reflective band was observed in the outer retina on SD-OCT in the majority of the subjects with albinism. Foveal cone specialization and pit morphology vary greatly in albinism. Normal cone packing was observed in the absence of a foveal pit, suggesting a pit is not required for packing to occur. The degree to which retinal anatomy correlates with genotype or visual function remains unclear, and future examination of larger patient groups will provide important insight on this issue.
Dopamine β-hydroxylase (DBH) catalyzes the conversion of dopamine to norepinephrine in the centra... more Dopamine β-hydroxylase (DBH) catalyzes the conversion of dopamine to norepinephrine in the central nervous system and peripherally. DBH variants are associated with large changes in circulating DBH and implicated in multiple disorders; yet causal relationships and tissue-specific effects remain unresolved. To characterize regulatory variants in DBH, effect on mRNA expression, and role in modulating sympathetic tone and disease risk. Analysis of DBH mRNA in human tissues confirmed high expression in the locus coeruleus and adrenal gland, but also in sympathetically innervated organs (liver>lung>heart). Allele-specific mRNA assays revealed pronounced allelic expression differences in the liver (2- to 11-fold) attributable to promoter rs1611115 and exon 2 rs1108580, but only small differences in locus coeruleus and adrenals. These alleles were also associated with significantly reduced mRNA expression in liver and lung. Although DBH protein is expressed in other sympathetically innervated organs, mRNA levels were too low for analysis. In mice, hepatic Dbh mRNA levels correlated with cardiovascular risk phenotypes. The minor alleles of rs1611115 and rs1108580 were associated with sympathetic phenotypes, including angina pectoris. Testing combined effects of these variants suggested protection against myocardial infarction in 3 separate clinical cohorts. We demonstrate profound effects of DBH variants on expression in 2 sympathetically innervated organs, liver and lung, but not in adrenals and brain. Preliminary results demonstrate an association of these variants with clinical phenotypes responsive to peripheral sympathetic tone. We hypothesize that in addition to endocrine effects via circulating DBH and norepinephrine, the variants act in sympathetically innervated target organs.
The genome-wide association study (GWAS) is a powerful approach for studying the genetic complexi... more The genome-wide association study (GWAS) is a powerful approach for studying the genetic complexities of human disease. Unfortunately, GWASs often fail to identify clinically significant associations and describing function can be a challenge. GWAS is a phenotype-to-genotype approach. It is now possible to conduct a converse genotype-to-phenotype approach using extensive electronic medical records to define a phenome. This approach associates a single genetic variant with many phenotypes across the phenome and is called a phenome-wide association study (PheWAS). The majority of PheWASs conducted have focused on variants identified previously by GWASs. This approach has been efficient for rediscovering gene-disease associations while also identifying pleiotropic effects for some single-nucleotide polymorphisms (SNPs). However, the use of SNPs identified by GWAS in a PheWAS is limited by the inherent properties of the GWAS SNPs, including weak effect sizes and difficulty when translating discoveries to function. To address these challenges, we conducted a PheWAS on 105 presumed functional stop-gain and stop-loss variants genotyped on 4235 Marshfield Clinic patients. Associations were validated on an additional 10 640 Marshfield Clinic patients. PheWAS results indicate that a nonsense variant in ARMS2 (rs2736911) is associated with age-related macular degeneration (AMD). These results demonstrate that focusing on functional variants may be an effective approach when conducting a PheWAS.European Journal of Human Genetics advance online publication, 30 July 2014; doi:10.1038/ejhg.2014.123.
Translation of results from genetic findings to inform medical practice is a highly anticipated g... more Translation of results from genetic findings to inform medical practice is a highly anticipated goal of human genetics. The aim of this paper is to review and discuss the role of genetics in medically-relevant prediction. Germline genetics presages disease onset and therefore can contribute prognostic signals that augment laboratory tests and clinical features. As such, the impact of genetic-based predictive models on clinical decisions and therapy choice could be profound. However, given that (i) medical traits result from a complex interplay between genetic and environmental factors, (ii) the underlying genetic architectures for susceptibility to common diseases are not well-understood, and (iii) replicable susceptibility alleles, in combination, account for only a moderate amount of disease heritability, there are substantial challenges to constructing and implementing genetic risk prediction models with high utility. In spite of these challenges, concerted progress has continued...
Investigative ophthalmology & visual science, Jan 20, 2014
We examined the role of DNA copy number variants (CNVs) of known glaucoma genes in relation to pr... more We examined the role of DNA copy number variants (CNVs) of known glaucoma genes in relation to primary open angle glaucoma (POAG). Our study included DNA samples from two studies (NEIGHBOR and GLAUGEN). All the samples were genotyped with the Illumina Human660W_Quad_v1 BeadChip. After removing non-blood-derived and amplified DNA samples, we applied quality control steps based on the mean Log R Ratio and the mean B allele frequency. Subsequently, data from 3057 DNA samples (1599 cases and 1458 controls) were analyzed with PennCNV software. We defined CNVs as those ≥5 kilobases (kb) in size and interrogated by ≥5 consecutive probes. We further limited our investigation to CNVs in known POAG-related genes, including CDKN2B-AS1, TMCO1, SIX1/SIX6, CAV1/CAV2, the LRP12-ZFPM2 region, GAS7, ATOH7, FNDC3B, CYP1B1, MYOC, OPTN, WDR36, SRBD1, TBK1, and GALC. Genomic duplications of CDKN2B-AS1 and TMCO1 were each found in a single case. Two cases carried duplications in the GAS7 region. Genomic ...
Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used... more Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used to identify disease-contributing mutations in human genomic studies. It can be a significant challenge to process such data, especially when a large family or cohort is sequenced. Our objective was to develop a big data toolset to efficiently manipulate genome-wide variants, functional annotations and coverage, together with conducting family based sequencing data analysis. Hadoop is a framework for reliable, scalable, distributed processing of large data sets using MapReduce programming models. Based on Hadoop and HBase, we developed SeqHBase, a big data-based toolset for analysing family based sequencing data to detect de novo, inherited homozygous, or compound heterozygous mutations that may contribute to disease manifestations. SeqHBase takes as input BAM files (for coverage at every site), variant call format (VCF) files (for variant calls) and functional annotations (for variant p...
Phenome-wide association studies (PheWAS) have demonstrated utility in validating genetic associa... more Phenome-wide association studies (PheWAS) have demonstrated utility in validating genetic associations derived from traditional genetic studies as well as identifying novel genetic associations. Here we used an electronic health record (EHR)-based PheWAS to explore pleiotropy of genetic variants in the fat mass and obesity associated gene (FTO), some of which have been previously associated with obesity and type 2 diabetes (T2D). We used a population of 10,487 individuals of European ancestry with genome-wide genotyping from the Electronic Medical Records and Genomics (eMERGE) Network and another population of 13,711 individuals of European ancestry from the BioVU DNA biobank at Vanderbilt genotyped using Illumina HumanExome BeadChip. A meta-analysis of the two study populations replicated the well-described associations between FTO variants and obesity (odds ratio [OR] = 1.25, 95% Confidence Interval = 1.11-1.24, p = 2.10 × 10(-9)) and FTO variants and T2D (OR = 1.14, 95% CI = 1.08-1.21, p = 2.34 × 10(-6)). The meta-analysis also demonstrated that FTO variant rs8050136 was significantly associated with sleep apnea (OR = 1.14, 95% CI = 1.07-1.22, p = 3.33 × 10(-5)); however, the association was attenuated after adjustment for body mass index (BMI). Novel phenotype associations with obesity-associated FTO variants included fibrocystic breast disease (rs9941349, OR = 0.81, 95% CI = 0.74-0.91, p = 5.41 × 10(-5)) and trends toward associations with non-alcoholic liver disease and gram-positive bacterial infections. FTO variants not associated with obesity demonstrated other potential disease associations including non-inflammatory disorders of the cervix and chronic periodontitis. These results suggest that genetic variants in FTO may have pleiotropic associations, some of which are not mediated by obesity.
Hermansky-Pudlak Syndrome (HPS) is a rare, autosomal recessive disorder that is characterized by ... more Hermansky-Pudlak Syndrome (HPS) is a rare, autosomal recessive disorder that is characterized by oculocutaneous albinism, a predisposition to mild bleeding caused by storage-pool deficient platelets, and a ceroid storage disorder. A gene responsible for HPS in Puerto Rico maps to chromosome 10q2 and isolation of the gene has been reported. We have now identified a variant HPS cDNA that contains the same 5' sequence as the published HPS gene and a unique 3' sequence. Analysis of genomic DNA suggests that the two cDNA are derived from alternative transcripts of a single gene; two polyadenylated transcripts were found in normal human melanocytes, human bone marrow cells, human melanoma cells, lymphoblastoid cell lines, and megakaryocytic leukemia cells by reverse transcriptase polymerase chain reaction and northern analysis. The splicing exhibited by this gene is identical to the splicing found to produce two alternative transcripts of the Chediak-Higashi Syndrome gene, another pigment disorder exhibiting platelet storage pool deficiency. These studies show that the HPS gene on chromosome 10 is complex and may have more than one biologically active transcript.
In previous studies, we characterized a 2.7-kb interstitial deletion allele of the P gene associa... more In previous studies, we characterized a 2.7-kb interstitial deletion allele of the P gene associated with tyrosinase-positive oculocutaneous albinism (OCA2) in African Americans and Africans. In this study, we investigated the frequency of this allele among OCA2 subjects in two African countries, Zimbabwe and Cameroon. The deletion allele was most common in Zimbabwe, comprising nearly all (92%) mutant alleles, which is the highest incidence reported so far. In addition, the deletion allele was widespread but less common among OCA2 Cameroonians and accounted for 65% of the mutant alleles.
Proceedings of The National Academy of Sciences, 2000
The mouse p locus encodes a gene that functions in normal pigmentation. We have characterized a r... more The mouse p locus encodes a gene that functions in normal pigmentation. We have characterized a radiation-induced mutant allele of the mouse p locus that is associated with a failure-to-thrive syndrome, in addition to diminished pigmentation. Mice homozygous for this mutant allele, p100H, show delayed growth and die within 2 wk after birth. We have discovered that the mutant mice
Albinism is associated with disrupted foveal development, though intersubject variability is beco... more Albinism is associated with disrupted foveal development, though intersubject variability is becoming appreciated. We sought to quantify this variability, and examine the relationship between foveal cone specialization and pit morphology in patients with a clinical diagnosis of albinism. We recruited 32 subjects with a clinical diagnosis of albinism. DNA was obtained from 25 subjects, and known albinism genes were analyzed for mutations. Relative inner and outer segment (IS and OS) lengthening (fovea-to-perifovea ratio) was determined from manually segmented spectral domain-optical coherence tomography (SD-OCT) B-scans. Foveal pit morphology was quantified for eight subjects from macular SD-OCT volumes. Ten subjects underwent imaging with adaptive optics scanning light ophthalmoscopy (AOSLO), and cone density was measured. We found mutations in 22 of 25 subjects, including five novel mutations. All subjects lacked complete excavation of inner retinal layers at the fovea, though four subjects had foveal pits with normal diameter and/or volume. Peak cone density and OS lengthening were variable and overlapped with that observed in normal controls. A fifth hyper-reflective band was observed in the outer retina on SD-OCT in the majority of the subjects with albinism. Foveal cone specialization and pit morphology vary greatly in albinism. Normal cone packing was observed in the absence of a foveal pit, suggesting a pit is not required for packing to occur. The degree to which retinal anatomy correlates with genotype or visual function remains unclear, and future examination of larger patient groups will provide important insight on this issue.
Dopamine β-hydroxylase (DBH) catalyzes the conversion of dopamine to norepinephrine in the centra... more Dopamine β-hydroxylase (DBH) catalyzes the conversion of dopamine to norepinephrine in the central nervous system and peripherally. DBH variants are associated with large changes in circulating DBH and implicated in multiple disorders; yet causal relationships and tissue-specific effects remain unresolved. To characterize regulatory variants in DBH, effect on mRNA expression, and role in modulating sympathetic tone and disease risk. Analysis of DBH mRNA in human tissues confirmed high expression in the locus coeruleus and adrenal gland, but also in sympathetically innervated organs (liver>lung>heart). Allele-specific mRNA assays revealed pronounced allelic expression differences in the liver (2- to 11-fold) attributable to promoter rs1611115 and exon 2 rs1108580, but only small differences in locus coeruleus and adrenals. These alleles were also associated with significantly reduced mRNA expression in liver and lung. Although DBH protein is expressed in other sympathetically innervated organs, mRNA levels were too low for analysis. In mice, hepatic Dbh mRNA levels correlated with cardiovascular risk phenotypes. The minor alleles of rs1611115 and rs1108580 were associated with sympathetic phenotypes, including angina pectoris. Testing combined effects of these variants suggested protection against myocardial infarction in 3 separate clinical cohorts. We demonstrate profound effects of DBH variants on expression in 2 sympathetically innervated organs, liver and lung, but not in adrenals and brain. Preliminary results demonstrate an association of these variants with clinical phenotypes responsive to peripheral sympathetic tone. We hypothesize that in addition to endocrine effects via circulating DBH and norepinephrine, the variants act in sympathetically innervated target organs.
Uploads