GATA3 is a highly conserved, essential transcription factor expressed in a number of tissues, inc... more GATA3 is a highly conserved, essential transcription factor expressed in a number of tissues, including the mammary gland. GATA3 expression is required for normal development of the mammary gland where it is estimated to be the most abundant transcription factor in luminal epithelial cells. In breast cancer, GATA3 expression is highly correlated with the luminal transcriptional program. Recent genomic analysis of human breast cancers has revealed high-frequency mutation in GATA3 in luminal tumors, suggesting "driver" function(s). Here we discuss mutation of GATA3 in breast cancer and the potential mechanism(s) by which mutation may lead to a growth advantage in cancer.
LIN28 is an evolutionarily conserved RNA-binding protein with critical functions in developmental... more LIN28 is an evolutionarily conserved RNA-binding protein with critical functions in developmental timing and cancer. However molecular mechanisms underlying LIN28's oncogenic properties are yet to be described. RIP-Seq analysis revealed significant LIN28 binding within 843 mRNAs in breast cancer cells. Many of the LIN28 bound mRNAs are implicated in the regulation of RNA and cell metabolism. We identify heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1), a protein with multiple roles in mRNA metabolism, as a LIN28 interacting partner. Subsequently, we use a custom computational method to identify differentially spliced gene isoforms in LIN28 and hnRNP A1 siRNA-treated cells. Results reveal these proteins regulate alternative splicing and steady state mRNA expression of genes implicated in aspects of breast cancer biology. Notably, cells lacking LIN28 undergo significant isoform switching of the ENAH gene, resulting in a decrease in the expression of ENAH exon 11a isoform. Exp...
A comprehensive update of the classification of all available kinases was carried out. This surve... more A comprehensive update of the classification of all available kinases was carried out. This survey presents a complete global picture of this large functional class of proteins and confirms the soundness of our initial kinase classification scheme. The new survey found the total number of kinase sequences in the protein database has increased more than three-fold (from 17,310 to 59,402), and the number of determined kinase structures increased two-fold (from 359 to 702) in the past three years. However, the framework of the original two-tier classification scheme (in families and fold groups) remains sufficient to describe all available kinases. Overall, the kinase sequences were classified into 25 families of homologous proteins, wherein 22 families (approximately 98.8% of all sequences) for which three-dimensional structures are known fall into 10 fold groups. These fold groups not only include some of the most widely spread proteins folds, such as the Rossmann-like fold, ferredox...
Nicotinamide/Nicotinate mononucleotide (NMN/NaMN) adenylyltransferase is an indispensable enzyme ... more Nicotinamide/Nicotinate mononucleotide (NMN/NaMN) adenylyltransferase is an indispensable enzyme in both de novo biosynthesis and salvage of NAD+ and NADP+. In prokaryotes, it is absolutely required for cell survival, thus representing an attractive target for the development of new broad-spectrum antibacteria inhibitors. The crystal structures of E. coli NaMN adenylyltransferase (NMNAT) and its complex with deamido-NAD (NaAD) revealed that ligand binding causes large conformational changes in several loop regions around the active site. The enzyme specifically recognizes the deamidated pyridine nucleotide through interactions between nicotinate carboxylate with several protein main chain amides and a positive helix dipole. Comparison of E. coli NMNAT with those from archaeal organisms revealed extensive differences in the active site architecture, enzyme-ligand interaction mode, and bound dinucleotide conformations. The bacterial NaMN adenylyltransferase structures described here provide a foundation for structure-based design of specific inhibitors that may have therapeutic potential.
The Mi-2/nucleosome remodeling and histone deacetylase (NuRD) complex is a multiprotein machine p... more The Mi-2/nucleosome remodeling and histone deacetylase (NuRD) complex is a multiprotein machine proposed to regulate chromatin structure by nucleosome remodeling and histone deacetylation activities. Recent reports describing localization of NuRD provide new insights that question previous models on NuRD action, but are not in complete agreement. Here, we provide location analysis of endogenous MBD3, a component of NuRD complex, in two human breast cancer cell lines (MCF-7 and MDA-MB-231) using two independent genomic techniques: DNA adenine methyltransferase identification (DamID) and ChIP-seq. We observed concordance of the resulting genomic localization, suggesting that these studies are converging on a robust map for NuRD in the cancer cell genome. MBD3 preferentially associated with CpG rich promoters marked by H3K4me3 and showed cell-type specific localization across gene bodies, peaking around the transcription start site. A subset of sites bound by MBD3 was enriched in H3K27ac and was in physical proximity to promoters in three-dimensional space, suggesting function as enhancers. MBD3 enrichment was also noted at promoters modified by H3K27me3. Functional analysis of chromatin indicated that MBD3 regulates nucleosome occupancy near promoters and in gene bodies. These data suggest that MBD3, and by extension the NuRD complex, may have multiple roles in fine tuning expression for both active and silent genes, representing an important step in defining regulatory mechanisms by which NuRD complex controls chromatin structure and modification status.
Recent studies indicate that a subclass of APOBEC cytidine deaminases, which convert cytosine to ... more Recent studies indicate that a subclass of APOBEC cytidine deaminases, which convert cytosine to uracil during RNA editing and retrovirus or retrotransposon restriction, may induce mutation clusters in human tumors. We show here that throughout cancer genomes APOBEC-mediated mutagenesis is pervasive and correlates with APOBEC mRNA levels. Mutation clusters in whole-genome and exome data sets conformed to the stringent criteria indicative of an APOBEC mutation pattern. Applying these criteria to 954,247 mutations in 2,680 exomes from 14 cancer types, mostly from The Cancer Genome Atlas (TCGA), showed a significant presence of the APOBEC mutation pattern in bladder, cervical, breast, head and neck, and lung cancers, reaching 68% of all mutations in some samples. Within breast cancer, the HER2-enriched subtype was clearly enriched for tumors with the APOBEC mutation pattern, suggesting that this type of mutagenesis is functionally linked with cancer development. The APOBEC mutation pattern also extended to cancer-associated genes, implying that ubiquitous APOBEC-mediated mutagenesis is carcinogenic.
Estrogen receptor α (ERα) interacts with DNA directly or indirectly via other transcription facto... more Estrogen receptor α (ERα) interacts with DNA directly or indirectly via other transcription factors, referred to as "tethering." Evidence for tethering is based on in vitro studies and a widely used "KIKO" mouse model containing mutations that prevent direct estrogen response element DNA- binding. KIKO mice are infertile, due in part to the inability of estradiol (E2) to induce uterine epithelial proliferation. To elucidate the molecular events that prevent KIKO uterine growth, regulation of the pro-proliferative E2 target gene Klf4 and of Klf15, a progesterone (P4) target gene that opposes the pro-proliferative activity of KLF4, was evaluated. Klf4 induction was impaired in KIKO uteri; however, Klf15 was induced by E2 rather than by P4. Whole uterine chromatin immunoprecipitation-sequencing revealed enrichment of KIKO ERα binding to hormone response elements (HREs) motifs. KIKO binding to HRE motifs was verified using reporter gene and DNA-binding assays. Because the KIKO ERα has HRE DNA-binding activity, we evaluated the "EAAE" ERα, which has more severe DNA-binding domain mutations, and demonstrated a lack of estrogen response element or HRE reporter gene induction or DNA-binding. The EAAE mouse has an ERα null-like phenotype, with impaired uterine growth and transcriptional activity. Our findings demonstrate that the KIKO mouse model, which has been used by numerous investigators, cannot be used to establish biological functions for ERα tethering, because KIKO ERα effectively stimulates transcription using HRE motifs. The EAAE-ERα DNA-binding domain mutant mouse demonstrates that ERα DNA-binding is crucial for biological and transcriptional processes in reproductive tissues and that ERα tethering may not contribute to estrogen responsiveness in vivo.
To advance understanding of mechanisms leading to biological and transcriptional endpoints relate... more To advance understanding of mechanisms leading to biological and transcriptional endpoints related to estrogen action in the mouse uterus, we have mapped ERα and RNA polymerase II (PolII) binding sites using chromatin immunoprecipitation followed by sequencing of enriched chromatin fragments. In the absence of hormone, 5184 ERα-binding sites were apparent in the vehicle-treated ovariectomized uterine chromatin, whereas 17,240 were seen 1 h after estradiol (E₂) treatment, indicating that some sites are occupied by unliganded ERα, and that ERα binding is increased by E₂. Approximately 15% of the uterine ERα-binding sites were adjacent to (<10 kb) annotated transcription start sites, and many sites are found within genes or are found more than 100 kb distal from mapped genes; however, the density (sites per base pair) of ERα-binding sites is significantly greater adjacent to promoters. An increase in quantity of sites but no significant positional differences were seen between vehicle and E₂-treated samples in the overall locations of ERα-binding sites either distal from, adjacent to, or within genes. Analysis of the PolII data revealed the presence of poised promoter-proximal PolII on some highly up-regulated genes. Additionally, corecruitment of PolII and ERα to some distal enhancer regions was observed. A de novo motif analysis of sequences in the ERα-bound chromatin confirmed that estrogen response elements were significantly enriched. Interestingly, in areas of ERα binding without predicted estrogen response element motifs, homeodomain transcription factor-binding motifs were significantly enriched. The integration of the ERα- and PolII-binding sites from our uterine sequencing of enriched chromatin fragments data with transcriptional responses revealed in our uterine microarrays has the potential to greatly enhance our understanding of mechanisms governing estrogen response in uterine and other estrogen target tissues.
Kinases are a ubiquitous group of enzymes that catalyze the phosphoryl transfer reaction from a p... more Kinases are a ubiquitous group of enzymes that catalyze the phosphoryl transfer reaction from a phosphate donor (usually ATP) to a receptor substrate. Although all kinases catalyze essentially the same phosphoryl transfer reaction, they display remarkable diversity in their substrate specificity, structure, and the pathways in which they participate. In order to learn the relationship between structural fold and functional specificities in kinases, we have done a comprehensive survey of all available kinase sequences (>17,000) and classified them into 30 distinct families based on sequence similarities. Of these families, 19, covering nearly 98% of all sequences, fall into seven general structural folds for which three-dimensional structures are known. These fold groups include some of the most widespread protein folds, such as Rossmann fold, ferredoxin fold, ribonuclease H fold, and TIM beta/alpha-barrel. On the basis of this classification system, we examined the shared substrate binding and catalytic mechanisms as well as variations of these mechanisms in the same fold groups. Cases of convergent evolution of identical kinase activities occurring in different folds are discussed.
Epigenetic regulation of gene expression is fundamental for cell type-specific gene expression. H... more Epigenetic regulation of gene expression is fundamental for cell type-specific gene expression. However, integrated comparative transcriptomic and epigenomic analyses in various adult primary differentiated cells remain underrepresented. We generated promoter landscapes of DNA methylation and three important histone methylation marks (H3K4me3, H3K9me2, and H3K27me3) in two primary cell types (B lymphocytes and liver) from adult mice. In line with previous studies, we also observed distinct H3K4me3 patterns at promoters dictated by CpG content in differentiated primary cells. We further explored the distribution of initiating RNA polymerase II and elongating RNA polymerase II across genes within different promoter classes, suggesting different rate-limiting steps at CpG-rich vs. CpG-poor genes. Examination of differentially expressed genes revealed that regulation of tissue-specific genes is closely related to gene function regardless of promoter type. Although repressive chromatin marks displayed differential preference to promoters based on CpG content, we observed fine-tuning of the pattern of association of these marks with specific promoter types in a cell type-specific manner. The distribution of H3K9me2 and H3K27me3, relative to CpG content, differed substantially between the two cell types. Cell-type specific accumulation of repressive chromatin marks was also observed at silent genes in both cell types, suggesting that differentiated primary cells may exhibit cell-type specificity in the distribution of repressive chromatin marks. Epigenetic regulation of gene expression and the association of specific histone marks with promoter sequence classes are fine-tuned in a cell type-specific manner. This unexpected finding underscores the value of extensive study of epigenetic marks across cell and tissue types.
GATA3 is a highly conserved, essential transcription factor expressed in a number of tissues, inc... more GATA3 is a highly conserved, essential transcription factor expressed in a number of tissues, including the mammary gland. GATA3 expression is required for normal development of the mammary gland where it is estimated to be the most abundant transcription factor in luminal epithelial cells. In breast cancer, GATA3 expression is highly correlated with the luminal transcriptional program. Recent genomic analysis of human breast cancers has revealed high-frequency mutation in GATA3 in luminal tumors, suggesting "driver" function(s). Here we discuss mutation of GATA3 in breast cancer and the potential mechanism(s) by which mutation may lead to a growth advantage in cancer.
LIN28 is an evolutionarily conserved RNA-binding protein with critical functions in developmental... more LIN28 is an evolutionarily conserved RNA-binding protein with critical functions in developmental timing and cancer. However molecular mechanisms underlying LIN28's oncogenic properties are yet to be described. RIP-Seq analysis revealed significant LIN28 binding within 843 mRNAs in breast cancer cells. Many of the LIN28 bound mRNAs are implicated in the regulation of RNA and cell metabolism. We identify heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1), a protein with multiple roles in mRNA metabolism, as a LIN28 interacting partner. Subsequently, we use a custom computational method to identify differentially spliced gene isoforms in LIN28 and hnRNP A1 siRNA-treated cells. Results reveal these proteins regulate alternative splicing and steady state mRNA expression of genes implicated in aspects of breast cancer biology. Notably, cells lacking LIN28 undergo significant isoform switching of the ENAH gene, resulting in a decrease in the expression of ENAH exon 11a isoform. Exp...
A comprehensive update of the classification of all available kinases was carried out. This surve... more A comprehensive update of the classification of all available kinases was carried out. This survey presents a complete global picture of this large functional class of proteins and confirms the soundness of our initial kinase classification scheme. The new survey found the total number of kinase sequences in the protein database has increased more than three-fold (from 17,310 to 59,402), and the number of determined kinase structures increased two-fold (from 359 to 702) in the past three years. However, the framework of the original two-tier classification scheme (in families and fold groups) remains sufficient to describe all available kinases. Overall, the kinase sequences were classified into 25 families of homologous proteins, wherein 22 families (approximately 98.8% of all sequences) for which three-dimensional structures are known fall into 10 fold groups. These fold groups not only include some of the most widely spread proteins folds, such as the Rossmann-like fold, ferredox...
Nicotinamide/Nicotinate mononucleotide (NMN/NaMN) adenylyltransferase is an indispensable enzyme ... more Nicotinamide/Nicotinate mononucleotide (NMN/NaMN) adenylyltransferase is an indispensable enzyme in both de novo biosynthesis and salvage of NAD+ and NADP+. In prokaryotes, it is absolutely required for cell survival, thus representing an attractive target for the development of new broad-spectrum antibacteria inhibitors. The crystal structures of E. coli NaMN adenylyltransferase (NMNAT) and its complex with deamido-NAD (NaAD) revealed that ligand binding causes large conformational changes in several loop regions around the active site. The enzyme specifically recognizes the deamidated pyridine nucleotide through interactions between nicotinate carboxylate with several protein main chain amides and a positive helix dipole. Comparison of E. coli NMNAT with those from archaeal organisms revealed extensive differences in the active site architecture, enzyme-ligand interaction mode, and bound dinucleotide conformations. The bacterial NaMN adenylyltransferase structures described here provide a foundation for structure-based design of specific inhibitors that may have therapeutic potential.
The Mi-2/nucleosome remodeling and histone deacetylase (NuRD) complex is a multiprotein machine p... more The Mi-2/nucleosome remodeling and histone deacetylase (NuRD) complex is a multiprotein machine proposed to regulate chromatin structure by nucleosome remodeling and histone deacetylation activities. Recent reports describing localization of NuRD provide new insights that question previous models on NuRD action, but are not in complete agreement. Here, we provide location analysis of endogenous MBD3, a component of NuRD complex, in two human breast cancer cell lines (MCF-7 and MDA-MB-231) using two independent genomic techniques: DNA adenine methyltransferase identification (DamID) and ChIP-seq. We observed concordance of the resulting genomic localization, suggesting that these studies are converging on a robust map for NuRD in the cancer cell genome. MBD3 preferentially associated with CpG rich promoters marked by H3K4me3 and showed cell-type specific localization across gene bodies, peaking around the transcription start site. A subset of sites bound by MBD3 was enriched in H3K27ac and was in physical proximity to promoters in three-dimensional space, suggesting function as enhancers. MBD3 enrichment was also noted at promoters modified by H3K27me3. Functional analysis of chromatin indicated that MBD3 regulates nucleosome occupancy near promoters and in gene bodies. These data suggest that MBD3, and by extension the NuRD complex, may have multiple roles in fine tuning expression for both active and silent genes, representing an important step in defining regulatory mechanisms by which NuRD complex controls chromatin structure and modification status.
Recent studies indicate that a subclass of APOBEC cytidine deaminases, which convert cytosine to ... more Recent studies indicate that a subclass of APOBEC cytidine deaminases, which convert cytosine to uracil during RNA editing and retrovirus or retrotransposon restriction, may induce mutation clusters in human tumors. We show here that throughout cancer genomes APOBEC-mediated mutagenesis is pervasive and correlates with APOBEC mRNA levels. Mutation clusters in whole-genome and exome data sets conformed to the stringent criteria indicative of an APOBEC mutation pattern. Applying these criteria to 954,247 mutations in 2,680 exomes from 14 cancer types, mostly from The Cancer Genome Atlas (TCGA), showed a significant presence of the APOBEC mutation pattern in bladder, cervical, breast, head and neck, and lung cancers, reaching 68% of all mutations in some samples. Within breast cancer, the HER2-enriched subtype was clearly enriched for tumors with the APOBEC mutation pattern, suggesting that this type of mutagenesis is functionally linked with cancer development. The APOBEC mutation pattern also extended to cancer-associated genes, implying that ubiquitous APOBEC-mediated mutagenesis is carcinogenic.
Estrogen receptor α (ERα) interacts with DNA directly or indirectly via other transcription facto... more Estrogen receptor α (ERα) interacts with DNA directly or indirectly via other transcription factors, referred to as "tethering." Evidence for tethering is based on in vitro studies and a widely used "KIKO" mouse model containing mutations that prevent direct estrogen response element DNA- binding. KIKO mice are infertile, due in part to the inability of estradiol (E2) to induce uterine epithelial proliferation. To elucidate the molecular events that prevent KIKO uterine growth, regulation of the pro-proliferative E2 target gene Klf4 and of Klf15, a progesterone (P4) target gene that opposes the pro-proliferative activity of KLF4, was evaluated. Klf4 induction was impaired in KIKO uteri; however, Klf15 was induced by E2 rather than by P4. Whole uterine chromatin immunoprecipitation-sequencing revealed enrichment of KIKO ERα binding to hormone response elements (HREs) motifs. KIKO binding to HRE motifs was verified using reporter gene and DNA-binding assays. Because the KIKO ERα has HRE DNA-binding activity, we evaluated the "EAAE" ERα, which has more severe DNA-binding domain mutations, and demonstrated a lack of estrogen response element or HRE reporter gene induction or DNA-binding. The EAAE mouse has an ERα null-like phenotype, with impaired uterine growth and transcriptional activity. Our findings demonstrate that the KIKO mouse model, which has been used by numerous investigators, cannot be used to establish biological functions for ERα tethering, because KIKO ERα effectively stimulates transcription using HRE motifs. The EAAE-ERα DNA-binding domain mutant mouse demonstrates that ERα DNA-binding is crucial for biological and transcriptional processes in reproductive tissues and that ERα tethering may not contribute to estrogen responsiveness in vivo.
To advance understanding of mechanisms leading to biological and transcriptional endpoints relate... more To advance understanding of mechanisms leading to biological and transcriptional endpoints related to estrogen action in the mouse uterus, we have mapped ERα and RNA polymerase II (PolII) binding sites using chromatin immunoprecipitation followed by sequencing of enriched chromatin fragments. In the absence of hormone, 5184 ERα-binding sites were apparent in the vehicle-treated ovariectomized uterine chromatin, whereas 17,240 were seen 1 h after estradiol (E₂) treatment, indicating that some sites are occupied by unliganded ERα, and that ERα binding is increased by E₂. Approximately 15% of the uterine ERα-binding sites were adjacent to (<10 kb) annotated transcription start sites, and many sites are found within genes or are found more than 100 kb distal from mapped genes; however, the density (sites per base pair) of ERα-binding sites is significantly greater adjacent to promoters. An increase in quantity of sites but no significant positional differences were seen between vehicle and E₂-treated samples in the overall locations of ERα-binding sites either distal from, adjacent to, or within genes. Analysis of the PolII data revealed the presence of poised promoter-proximal PolII on some highly up-regulated genes. Additionally, corecruitment of PolII and ERα to some distal enhancer regions was observed. A de novo motif analysis of sequences in the ERα-bound chromatin confirmed that estrogen response elements were significantly enriched. Interestingly, in areas of ERα binding without predicted estrogen response element motifs, homeodomain transcription factor-binding motifs were significantly enriched. The integration of the ERα- and PolII-binding sites from our uterine sequencing of enriched chromatin fragments data with transcriptional responses revealed in our uterine microarrays has the potential to greatly enhance our understanding of mechanisms governing estrogen response in uterine and other estrogen target tissues.
Kinases are a ubiquitous group of enzymes that catalyze the phosphoryl transfer reaction from a p... more Kinases are a ubiquitous group of enzymes that catalyze the phosphoryl transfer reaction from a phosphate donor (usually ATP) to a receptor substrate. Although all kinases catalyze essentially the same phosphoryl transfer reaction, they display remarkable diversity in their substrate specificity, structure, and the pathways in which they participate. In order to learn the relationship between structural fold and functional specificities in kinases, we have done a comprehensive survey of all available kinase sequences (>17,000) and classified them into 30 distinct families based on sequence similarities. Of these families, 19, covering nearly 98% of all sequences, fall into seven general structural folds for which three-dimensional structures are known. These fold groups include some of the most widespread protein folds, such as Rossmann fold, ferredoxin fold, ribonuclease H fold, and TIM beta/alpha-barrel. On the basis of this classification system, we examined the shared substrate binding and catalytic mechanisms as well as variations of these mechanisms in the same fold groups. Cases of convergent evolution of identical kinase activities occurring in different folds are discussed.
Epigenetic regulation of gene expression is fundamental for cell type-specific gene expression. H... more Epigenetic regulation of gene expression is fundamental for cell type-specific gene expression. However, integrated comparative transcriptomic and epigenomic analyses in various adult primary differentiated cells remain underrepresented. We generated promoter landscapes of DNA methylation and three important histone methylation marks (H3K4me3, H3K9me2, and H3K27me3) in two primary cell types (B lymphocytes and liver) from adult mice. In line with previous studies, we also observed distinct H3K4me3 patterns at promoters dictated by CpG content in differentiated primary cells. We further explored the distribution of initiating RNA polymerase II and elongating RNA polymerase II across genes within different promoter classes, suggesting different rate-limiting steps at CpG-rich vs. CpG-poor genes. Examination of differentially expressed genes revealed that regulation of tissue-specific genes is closely related to gene function regardless of promoter type. Although repressive chromatin marks displayed differential preference to promoters based on CpG content, we observed fine-tuning of the pattern of association of these marks with specific promoter types in a cell type-specific manner. The distribution of H3K9me2 and H3K27me3, relative to CpG content, differed substantially between the two cell types. Cell-type specific accumulation of repressive chromatin marks was also observed at silent genes in both cell types, suggesting that differentiated primary cells may exhibit cell-type specificity in the distribution of repressive chromatin marks. Epigenetic regulation of gene expression and the association of specific histone marks with promoter sequence classes are fine-tuned in a cell type-specific manner. This unexpected finding underscores the value of extensive study of epigenetic marks across cell and tissue types.
Uploads