ABSTRACTThe deletion of the metabolizing Glutathione S-transferase Mu 1 (GSTM1) gene was previous... more ABSTRACTThe deletion of the metabolizing Glutathione S-transferase Mu 1 (GSTM1) gene was previously associated with multiple cancers, metabolic and autoimmune disorders, as well as drug response. It is unusually common, with allele frequency reaching up to 75% in some human populations. Such high allele frequency of a derived allele with apparent impact on an otherwise conserved gene is a rare phenomenon. To investigate the evolutionary history of this locus, we analyzed 310 genomes using population genetics tools. Our analysis revealed a surprising lack of linkage disequilibrium between the deletion and the flanking single nucleotide variants in this locus, indicating gene conversion events. Tests that measure extended homozygosity and rapid change in allele frequency identified signatures of an incomplete soft-sweep in the locus. Using empirical approaches, we identified the Tanuki haplogroup, which carries the GSTM1 deletion and is found in approximately 70% of East Asian chromos...
The study of ancient genomes has burgeoned at an incredible rate in the last decade. The result i... more The study of ancient genomes has burgeoned at an incredible rate in the last decade. The result is a shift in archaeological narratives, bringing with it a fierce debate on the place of genetics in anthropological research. Archaeogenomics has challenged and scrutinized fundamental themes of anthropological research, including human origins, movement of ancient and modern populations, the role of social organization in shaping material culture, and the relationship between culture, language, and ancestry. Moreover, the discussion has inevitably invoked new debates on indigenous rights, ownership of ancient materials, inclusion in the scientific process, and even the meaning of what it is to be a human. We argue that the broad and seemingly daunting ethical, methodological, and theoretical challenges posed by archaeogenomics, in fact, represent the very cutting edge of social science research. Here, we provide a general review of the field by introducing the contemporary discussion p...
The time, extent, and genomic impact of the introgressions from archaic humans into ancestors of ... more The time, extent, and genomic impact of the introgressions from archaic humans into ancestors of extant human populations remain one of the most exciting venues of population genetics research in the last decade. Several studies have shown population-specific signatures of introgression events from Neanderthals, Denisovans, and potentially other unknown hominin populations in different human groups. Moreover, it was shown that these introgression events may have contributed to phenotypic variation in extant humans, with biomedical and evolutionary consequences. In this study, we present a comprehensive analysis of the unusually divergent haplotypes in the Eurasian genomes and showed that they can be traced back to multiple introgression events. In parallel, we document hundreds of deletion polymorphisms shared with Neanderthals. A locus-specific analysis of one such shared deletion suggests the existence of a direct introgression event from the Altai Neanderthal lineage into the anc...
ABSTRACTSalivary proteins facilitate food perception and digestion, maintain the integrity of the... more ABSTRACTSalivary proteins facilitate food perception and digestion, maintain the integrity of the mineralized tooth and oral epithelial surfaces, and shield the oro-digestive tract from environmental hazards and invading pathogens. Saliva, as one of the easiest to collect body fluids, also serves in diagnostic applications, with its proteins providing a window to body health. However, despite the availability of the human saliva proteome, the origins of individual proteins remain unclear. To bridge this gap, we analyzed the transcriptomes of 27 tissue samples derived from the three major types of human adult and fetal salivary glands and integrated these data with the saliva proteome and the proteomes and transcriptomes of 28+ other human organs, with tissue expression confirmed by 3D microscopy. Using these tools, we have linked saliva proteins to their source for the first time, an outcome with significant implications for basic research and diagnostic applications. Furthermore, o...
CD36 was identified as a core replicative senescence gene and a potential mediator of this proces... more CD36 was identified as a core replicative senescence gene and a potential mediator of this process through membrane remodeling.
The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several g... more The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several gene copy number gains in humans1, dogs2, and mice3, presumably along with increased starch consumption during the evolution of these species. Here we present evidence for additional AMY copy number expansions in several mammalian species, most of which also consume starch-rich diets. We also show that these independent AMY copy number gains are often accompanied by a gain in enzymatic activity of amylase in saliva. We used multi-species coalescent modeling to provide further evidence that these recurrent AMY gene copy number expansions were adaptive. Our findings underscore the overall importance of gene copy number amplification as a flexible and fast adaptive mechanism in evolution that can independently occur in different branches of the phylogeny.
SummaryCellular senescence, the irreversible ceasing of cell division, has been associated with o... more SummaryCellular senescence, the irreversible ceasing of cell division, has been associated with organismal aging, prevention of cancerogenesis, and developmental processes. As such, the evolutionary basis and biological features of cellular senescence remain a fascinating area of research. In this study, we conducted comparative RNAseq experiments to detect genes associated with replicative senescence in two different human cell lines and at different time points. We identified 841 and 900 genes (core senescence-associated genes) that are significantly up- and downregulated in senescent cells, respectively, in both cell lines. Our functional enrichment analysis showed that downregulated core genes are primarily involved in cell cycle processes while upregulated core gene enrichment indicated various lipid-related processes. We further demonstrated that downregulated genes are significantly more conserved than upregulated genes. Using both transcriptomics and genetic variation data, ...
Genomic structural variants (SVs) are distributed nonrandomly across the human genome. These &quo... more Genomic structural variants (SVs) are distributed nonrandomly across the human genome. These "hotspots" have been implicated in critical evolutionary innovations, as well as serious medical conditions. However, the evolutionary and biomedical features of these hotspots remain incompletely understood. In this study, we analyzed data from 2,504 genomes from the 1000 Genomes Project Consortium and constructed a refined map of 1,148 SV hotspots in human genomes. By studying the genomic architecture of these hotspots, we found that both nonallelic homologous recombination and non-homologous mechanisms act as mechanistic drivers of SV formation. We found that the majority of SV hotspots are within gene-poor regions and evolve under relaxed negative selection or neutrality. However, we found that a small subset of SV hotspots harbor genes that are enriched for anthropologically crucial functions, including blood oxygen transport, olfaction, synapse assembly, and antigen binding. ...
The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several g... more The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several gene copy number gains in humans (Perry et al., 2007), dogs (Axelsson et al., 2013), and mice (Schibler et al., 1982), possibly along with increased starch consumption during the evolution of these species. Here, we present comprehensive evidence for AMY copy number expansions that independently occurred in several mammalian species which consume diets rich in starch. We also provide correlative evidence that AMY gene duplications may be an essential first step for amylase to be expressed in saliva. Our findings underscore the overall importance of gene copy number amplification as a flexible and fast evolutionary mechanism that can independently occur in different branches of the phylogeny.
Bipolar disorder is a highly heritable mental illness, but the relevant genetic variants and mole... more Bipolar disorder is a highly heritable mental illness, but the relevant genetic variants and molecular mechanisms are largely unknown. Recent GWASs have identified an intergenic region associated with both intelligence and bipolar disorder. This region contains dozens of putative fetal brain-specific enhancers and is located ~0.7 Mb upstream of the neuronal transcription factor POU3F2. We identified a candidate causal variant, rs77910749, that falls within a highly conserved putative enhancer, LC1. This human-specific variant is a single-base deletion in a PAX6 binding site and is predicted to be functional. We hypothesized that rs77910749 alters LC1 activity and hence POU3F2 expression during neurodevelopment. Indeed, transgenic reporter mice demonstrated LC1 activity in the developing cerebral cortex and amygdala. Furthermore, ex vivo reporter assays in embryonic mouse brain and human iPSC-derived cerebral organoids revealed increased enhancer activity conferred by the variant. To...
Polymorphic duplications in humans have been shown to contribute to phenotypic diversity. However... more Polymorphic duplications in humans have been shown to contribute to phenotypic diversity. However, the evolutionary forces that maintain variable duplications across the human genome are largely unexplored. To understand the haplotypic architecture of the derived duplications, we developed a linkage-disequilibrium based method to detect insertion sites of polymorphic duplications not represented in reference genomes. This method also allows resolution of haplotypes harboring the duplications. Using this approach, we conducted genome-wide analyses and identified the insertion sites of 22 common polymorphic duplications. We found that the majority of these duplications are intrachromosomal and only one of them is an interchromosomal insertion. Further characterization of these duplications revealed significant associations to blood and skin phenotypes. Based on population genetics analyses, we found that the partial duplication of a well-characterized pigmentation-related gene, HERC2,...
ABSTRACTThe deletion of the metabolizing Glutathione S-transferase Mu 1 (GSTM1) gene was previous... more ABSTRACTThe deletion of the metabolizing Glutathione S-transferase Mu 1 (GSTM1) gene was previously associated with multiple cancers, metabolic and autoimmune disorders, as well as drug response. It is unusually common, with allele frequency reaching up to 75% in some human populations. Such high allele frequency of a derived allele with apparent impact on an otherwise conserved gene is a rare phenomenon. To investigate the evolutionary history of this locus, we analyzed 310 genomes using population genetics tools. Our analysis revealed a surprising lack of linkage disequilibrium between the deletion and the flanking single nucleotide variants in this locus, indicating gene conversion events. Tests that measure extended homozygosity and rapid change in allele frequency identified signatures of an incomplete soft-sweep in the locus. Using empirical approaches, we identified the Tanuki haplogroup, which carries the GSTM1 deletion and is found in approximately 70% of East Asian chromos...
The study of ancient genomes has burgeoned at an incredible rate in the last decade. The result i... more The study of ancient genomes has burgeoned at an incredible rate in the last decade. The result is a shift in archaeological narratives, bringing with it a fierce debate on the place of genetics in anthropological research. Archaeogenomics has challenged and scrutinized fundamental themes of anthropological research, including human origins, movement of ancient and modern populations, the role of social organization in shaping material culture, and the relationship between culture, language, and ancestry. Moreover, the discussion has inevitably invoked new debates on indigenous rights, ownership of ancient materials, inclusion in the scientific process, and even the meaning of what it is to be a human. We argue that the broad and seemingly daunting ethical, methodological, and theoretical challenges posed by archaeogenomics, in fact, represent the very cutting edge of social science research. Here, we provide a general review of the field by introducing the contemporary discussion p...
The time, extent, and genomic impact of the introgressions from archaic humans into ancestors of ... more The time, extent, and genomic impact of the introgressions from archaic humans into ancestors of extant human populations remain one of the most exciting venues of population genetics research in the last decade. Several studies have shown population-specific signatures of introgression events from Neanderthals, Denisovans, and potentially other unknown hominin populations in different human groups. Moreover, it was shown that these introgression events may have contributed to phenotypic variation in extant humans, with biomedical and evolutionary consequences. In this study, we present a comprehensive analysis of the unusually divergent haplotypes in the Eurasian genomes and showed that they can be traced back to multiple introgression events. In parallel, we document hundreds of deletion polymorphisms shared with Neanderthals. A locus-specific analysis of one such shared deletion suggests the existence of a direct introgression event from the Altai Neanderthal lineage into the anc...
ABSTRACTSalivary proteins facilitate food perception and digestion, maintain the integrity of the... more ABSTRACTSalivary proteins facilitate food perception and digestion, maintain the integrity of the mineralized tooth and oral epithelial surfaces, and shield the oro-digestive tract from environmental hazards and invading pathogens. Saliva, as one of the easiest to collect body fluids, also serves in diagnostic applications, with its proteins providing a window to body health. However, despite the availability of the human saliva proteome, the origins of individual proteins remain unclear. To bridge this gap, we analyzed the transcriptomes of 27 tissue samples derived from the three major types of human adult and fetal salivary glands and integrated these data with the saliva proteome and the proteomes and transcriptomes of 28+ other human organs, with tissue expression confirmed by 3D microscopy. Using these tools, we have linked saliva proteins to their source for the first time, an outcome with significant implications for basic research and diagnostic applications. Furthermore, o...
CD36 was identified as a core replicative senescence gene and a potential mediator of this proces... more CD36 was identified as a core replicative senescence gene and a potential mediator of this process through membrane remodeling.
The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several g... more The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several gene copy number gains in humans1, dogs2, and mice3, presumably along with increased starch consumption during the evolution of these species. Here we present evidence for additional AMY copy number expansions in several mammalian species, most of which also consume starch-rich diets. We also show that these independent AMY copy number gains are often accompanied by a gain in enzymatic activity of amylase in saliva. We used multi-species coalescent modeling to provide further evidence that these recurrent AMY gene copy number expansions were adaptive. Our findings underscore the overall importance of gene copy number amplification as a flexible and fast adaptive mechanism in evolution that can independently occur in different branches of the phylogeny.
SummaryCellular senescence, the irreversible ceasing of cell division, has been associated with o... more SummaryCellular senescence, the irreversible ceasing of cell division, has been associated with organismal aging, prevention of cancerogenesis, and developmental processes. As such, the evolutionary basis and biological features of cellular senescence remain a fascinating area of research. In this study, we conducted comparative RNAseq experiments to detect genes associated with replicative senescence in two different human cell lines and at different time points. We identified 841 and 900 genes (core senescence-associated genes) that are significantly up- and downregulated in senescent cells, respectively, in both cell lines. Our functional enrichment analysis showed that downregulated core genes are primarily involved in cell cycle processes while upregulated core gene enrichment indicated various lipid-related processes. We further demonstrated that downregulated genes are significantly more conserved than upregulated genes. Using both transcriptomics and genetic variation data, ...
Genomic structural variants (SVs) are distributed nonrandomly across the human genome. These &quo... more Genomic structural variants (SVs) are distributed nonrandomly across the human genome. These "hotspots" have been implicated in critical evolutionary innovations, as well as serious medical conditions. However, the evolutionary and biomedical features of these hotspots remain incompletely understood. In this study, we analyzed data from 2,504 genomes from the 1000 Genomes Project Consortium and constructed a refined map of 1,148 SV hotspots in human genomes. By studying the genomic architecture of these hotspots, we found that both nonallelic homologous recombination and non-homologous mechanisms act as mechanistic drivers of SV formation. We found that the majority of SV hotspots are within gene-poor regions and evolve under relaxed negative selection or neutrality. However, we found that a small subset of SV hotspots harbor genes that are enriched for anthropologically crucial functions, including blood oxygen transport, olfaction, synapse assembly, and antigen binding. ...
The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several g... more The amylase gene (AMY), which codes for a starch-digesting enzyme in animals, underwent several gene copy number gains in humans (Perry et al., 2007), dogs (Axelsson et al., 2013), and mice (Schibler et al., 1982), possibly along with increased starch consumption during the evolution of these species. Here, we present comprehensive evidence for AMY copy number expansions that independently occurred in several mammalian species which consume diets rich in starch. We also provide correlative evidence that AMY gene duplications may be an essential first step for amylase to be expressed in saliva. Our findings underscore the overall importance of gene copy number amplification as a flexible and fast evolutionary mechanism that can independently occur in different branches of the phylogeny.
Bipolar disorder is a highly heritable mental illness, but the relevant genetic variants and mole... more Bipolar disorder is a highly heritable mental illness, but the relevant genetic variants and molecular mechanisms are largely unknown. Recent GWASs have identified an intergenic region associated with both intelligence and bipolar disorder. This region contains dozens of putative fetal brain-specific enhancers and is located ~0.7 Mb upstream of the neuronal transcription factor POU3F2. We identified a candidate causal variant, rs77910749, that falls within a highly conserved putative enhancer, LC1. This human-specific variant is a single-base deletion in a PAX6 binding site and is predicted to be functional. We hypothesized that rs77910749 alters LC1 activity and hence POU3F2 expression during neurodevelopment. Indeed, transgenic reporter mice demonstrated LC1 activity in the developing cerebral cortex and amygdala. Furthermore, ex vivo reporter assays in embryonic mouse brain and human iPSC-derived cerebral organoids revealed increased enhancer activity conferred by the variant. To...
Polymorphic duplications in humans have been shown to contribute to phenotypic diversity. However... more Polymorphic duplications in humans have been shown to contribute to phenotypic diversity. However, the evolutionary forces that maintain variable duplications across the human genome are largely unexplored. To understand the haplotypic architecture of the derived duplications, we developed a linkage-disequilibrium based method to detect insertion sites of polymorphic duplications not represented in reference genomes. This method also allows resolution of haplotypes harboring the duplications. Using this approach, we conducted genome-wide analyses and identified the insertion sites of 22 common polymorphic duplications. We found that the majority of these duplications are intrachromosomal and only one of them is an interchromosomal insertion. Further characterization of these duplications revealed significant associations to blood and skin phenotypes. Based on population genetics analyses, we found that the partial duplication of a well-characterized pigmentation-related gene, HERC2,...
Uploads
Papers by Omer Gokcumen