gene, brother of the regulator of imprinted sites (BORIS), is regulated by three alternative prom... more gene, brother of the regulator of imprinted sites (BORIS), is regulated by three alternative promoters
CTCF is a key organizer of the 3D genome. Its specialized paralog, BORIS, heterodimerizes with CT... more CTCF is a key organizer of the 3D genome. Its specialized paralog, BORIS, heterodimerizes with CTCF but is expressed only in male germ cells and in cancer states. Unexpectedly, BORIS-null mice have only minimal germ cell defects. To understand the CTCF-BORIS relationship, mouse models with varied CTCF and BORIS levels were generated. Whereas Ctcf+/+Boris+/+, Ctcf+/−Boris+/+, and Ctcf+/+Boris−/− males are fertile, Ctcf+/−Boris−/− (Compound Mutant; CM) males are sterile. Testes with combined depletion of both CTCF and BORIS show reduced size, defective meiotic recombination, increased apoptosis, and malformed spermatozoa. Although CM germ cells exhibit only 25% of CTCF WT expression, chromatin binding of CTCF is preferentially lost from CTCF-BORIS heterodimeric sites. Furthermore, CM testes lose the expression of a large number of spermatogenesis genes and gain the expression of developmentally inappropriate genes that are “toxic” to fertility. Thus, a combined action of CTCF and BORI...
Proceedings of the National Academy of Sciences, 2020
The DNA-binding protein CCCTC-binding factor (CTCF) and the cohesin complex function together to ... more The DNA-binding protein CCCTC-binding factor (CTCF) and the cohesin complex function together to shape chromatin architecture in mammalian cells, but the molecular details of this process remain unclear. Here, we demonstrate that a 79-aa region within the CTCF N terminus is essential for cohesin positioning at CTCF binding sites and chromatin loop formation. However, the N terminus of CTCF fused to artificial zinc fingers was not sufficient to redirect cohesin to non-CTCF binding sites, indicating a lack of an autonomously functioning domain in CTCF responsible for cohesin positioning. BORIS (CTCFL), a germline-specific paralog of CTCF, was unable to anchor cohesin to CTCF DNA binding sites. Furthermore, CTCF–BORIS chimeric constructs provided evidence that, besides the N terminus of CTCF, the first two CTCF zinc fingers, and likely the 3D geometry of CTCF–DNA complexes, are also involved in cohesin retention. Based on this knowledge, we were able to convert BORIS into CTCF with res...
An expansion of a CTG repeat at the DM1 locus causes myotonic dystrophy (DM) by altering the expr... more An expansion of a CTG repeat at the DM1 locus causes myotonic dystrophy (DM) by altering the expression of the two adjacent genes, DMPK and SIX5, and through a toxic effect of the repeat-containing RNA. Here we identify two CTCF-binding sites that flank the CTG repeat and form an insulator element between DMPK and SIX5. Methylation of these sites prevents binding of CTCF, indicating that the DM1 locus methylation in congenital DM would disrupt insulator function. Furthermore, CTCF-binding sites are associated with CTG/CAG repeats at several other loci. We suggest a general role for CTG/CAG repeats as components of insulator elements at multiple sites in the human genome.
CCCTC-binding factor (CTCF) is a conserved, essential regulator of chromatin architecture contain... more CCCTC-binding factor (CTCF) is a conserved, essential regulator of chromatin architecture containing a unique array of 11 zinc fingers (ZFs). Gene duplication and sequence divergence during early amniote evolution generated the CTCF paralog Brother Of the Regulator of Imprinted Sites (BORIS), which has a DNA binding specificity identical to that of CTCF but divergent N- and C-termini. While healthy somatic tissues express only CTCF, CTCF and BORIS are normally co-expressed in meiotic and post-meiotic germ cells, and aberrant activation of BORIS occurs in tumors and some cancer cell lines. This has led to a model in which CTCF and BORIS compete for binding to some but not all genomic target sites; however, regulation of CTCF and BORIS genomic co-occupancy is not well understood. We recently addressed this issue, finding evidence for two major classes of CTCF target sequences, some of which contain single CTCF target sites (1xCTSes) and others containing two adjacent CTCF motifs (2xCT...
Despite sharing the same sequence specificity in vitro and in vivo, CCCTC-binding factor (CTCF) a... more Despite sharing the same sequence specificity in vitro and in vivo, CCCTC-binding factor (CTCF) and its paralog brother of the regulator of imprinted sites (BORIS) are simultaneously expressed in germ cells. Recently, ChIP-seq analysis revealed two classes of CTCF/BORIS-bound regions: single CTCF target sites (1xCTSes) that are bound by CTCF alone (CTCF-only) or double CTCF target sites (2xCTSes) simultaneously bound by CTCF and BORIS (CTCF&BORIS) or BORIS alone (BORIS-only) in germ cells and in BORIS-positive somatic cancer cells. BORIS-bound regions (CTCF&BORIS and BORIS-only sites) are, on average, enriched for RNA polymerase II (RNAPII) binding and histone retention in mature spermatozoa relative to CTCF-only sites, but little else is known about them. We show that subsets of CTCF&BORIS and BORIS-only sites are occupied by several testis-specific transcriptional regulators (TSTRs) and associated with highly expressed germ cell-specific genes and histone retention in mature sperm...
The transcriptional repressor negative protein 1 (NeP1) binds specifically to the F1 element of t... more The transcriptional repressor negative protein 1 (NeP1) binds specifically to the F1 element of the chicken lysozyme gene silencer and mediates synergistic repression by v-ERBA, thyroid hormone receptor, or retinoic acid receptor. Another protein, CCCTC-binding factor (CTCF), specifically binds to 50-bp-long sequences that contain repetitive CCCTC elements in the vicinity of vertebrate c-myc genes. Previously cloned chicken, mouse, and human CTCF cDNAs encode a highly conserved 11-Zn-finger protein. Here, NeP1 was purified and DNA bases critical for NeP1-F1 interaction were determined. NeP1 is found to bind a 50-bp stretch of nucleotides without any obvious sequence similarity to known CTCF binding sequences. Despite this remarkable difference, these two proteins are identical. They have the same molecular weight, and NeP1 contains peptide sequences which are identical to sequences in CTCF. Moreover, NeP1 and CTCF specifically recognize each other's binding DNA sequence and indu...
We have isolated and analyzed human CTCF cDNA clones and show here that the ubiquitously expresse... more We have isolated and analyzed human CTCF cDNA clones and show here that the ubiquitously expressed 11-zinc-finger factor CTCF is an exceptionally highly conserved protein displaying 93% identity between avian and human amino acid sequences. It binds specifically to regulatory sequences in the promoter-proximal regions of chicken, mouse, and human c-myc oncogenes. CTCF contains two transcription repressor domains transferable to a heterologous DNA binding domain. One CTCF binding site, conserved in mouse and human c-myc genes, is found immediately downstream of the major P2 promoter at a sequence which maps precisely within the region of RNA polymerase II pausing and release. Gel shift assays of nuclear extracts from mouse and human cells show that CTCF is the predominant factor binding to this sequence. Mutational analysis of the P2-proximal CTCF binding site and transient-cotransfection experiments demonstrate that CTCF is a transcriptional repressor of the human c-myc gene. Althou...
CTCF is a ubiquitous 11 zinc finger (ZF) protein with highly versatile functions: in addition to ... more CTCF is a ubiquitous 11 zinc finger (ZF) protein with highly versatile functions: in addition to transcriptional silencing or activating in a context-dependent fashion, it organizes epigenetically controlled chromatin insulators that regulate imprinted genes in soma. Recently, we have identified a CTCF paralogue, termed BORIS for Brother of the Regulator of Imprinted Sites, that is expressed only in the testis.
CTCF and BORIS (CTCFL), two paralogous mammalian proteins sharing nearly identical DNA binding do... more CTCF and BORIS (CTCFL), two paralogous mammalian proteins sharing nearly identical DNA binding domains, are thought to function in mutually exclusive manners in DNA binding and transcriptional regulation. Here we show that these two proteins co-occupy a specific subset of regulatory elements consisting of clustered CTCF binding motifs (termed 2xCTSes). BORIS occupancy at 2xCTSes is largely invariant in BORIS-positive cancer cells, with the genomic pattern recapitulating the germline-specific BORIS binding to chromatin. In contrast to the single-motif CTCF target sites (1xCTSes), the 2xCTS elements are preferentially found at active promoters and enhancers, both in cancer and germ cells. 2xCTSes are also enriched in genomic regions that escape histone to protamine replacement in human and mouse sperm. Depletion of BORIS gene leads to altered transcription of a large number of genes and the differentiation of K562 cells, while the ectopic expression of this CTCF paralog leads to speci...
The interaction of nuclear sequence-specific DNA-binding proteins from definitive chicken erythro... more The interaction of nuclear sequence-specific DNA-binding proteins from definitive chicken erythrocytes, thymus and proliferating transformed erythroid precursor (HD3) cells with the 700-base-pair (700-bp) DNA 5'-flanking region of the chicken c-myc gene was investigated by in vitro footprint analysis. The major HD3 protein-binding activity binds to a site (site V) 200 bp upstream from the 'cap' site but, after further fractionation, a second distinct binding activity is detected to a site (site VIII) which contains both the 'CAAT' and 'SP1-binding' consensus sequences. Protein from thymus and erythrocyte cells which express c-myc at lower levels, bind to seven and eight sites respectively. In common with HD3 cell protein, they both bind to site VIII and, although binding to the sequence at site V is also detected, the footprint protection pattern is sufficiently different (site V') to suggest the involvement of different proteins in terminally differentiated and proliferating cells. The DNA-binding activities were partially fractionated by high-performance liquid chromatography gel filtration and include an erythrocyte-specific protein which binds to a c-myc gene poly(dG) homopolymer sequence similar to that found upstream of the chicken beta A-globin gene.
The chicken c-myc 5'-flanking sequence has previously been shown to bind multiple proteins pr... more The chicken c-myc 5'-flanking sequence has previously been shown to bind multiple proteins present in undifferentiated and differentiated red blood cells. In this report the protein binding to one specific region within a hypersensitive site approximately 200 base pairs upstream of the start of transcription has been analysed in detail. Using a combination of a modified agarose gel retardation assay with O-phenanthroline-copper footprinting in situ, missing contact point and methylation interference techniques, two proteins were found to bind to overlapping sequences within 180-230 bp upstream of the start of transcription. One protein resembles the transcription factor Sp1, the other is a protein which binds to three regularly spaced repeats of the core sequence CCCTC. This CCCTC-binding factor was termed CTCF. It requires additional sequences outside the three recognition motifs for tight binding. CTCF was purified to near homogeneity by sequence-specific DNA chromatography. T...
gene, brother of the regulator of imprinted sites (BORIS), is regulated by three alternative prom... more gene, brother of the regulator of imprinted sites (BORIS), is regulated by three alternative promoters
CTCF is a key organizer of the 3D genome. Its specialized paralog, BORIS, heterodimerizes with CT... more CTCF is a key organizer of the 3D genome. Its specialized paralog, BORIS, heterodimerizes with CTCF but is expressed only in male germ cells and in cancer states. Unexpectedly, BORIS-null mice have only minimal germ cell defects. To understand the CTCF-BORIS relationship, mouse models with varied CTCF and BORIS levels were generated. Whereas Ctcf+/+Boris+/+, Ctcf+/−Boris+/+, and Ctcf+/+Boris−/− males are fertile, Ctcf+/−Boris−/− (Compound Mutant; CM) males are sterile. Testes with combined depletion of both CTCF and BORIS show reduced size, defective meiotic recombination, increased apoptosis, and malformed spermatozoa. Although CM germ cells exhibit only 25% of CTCF WT expression, chromatin binding of CTCF is preferentially lost from CTCF-BORIS heterodimeric sites. Furthermore, CM testes lose the expression of a large number of spermatogenesis genes and gain the expression of developmentally inappropriate genes that are “toxic” to fertility. Thus, a combined action of CTCF and BORI...
Proceedings of the National Academy of Sciences, 2020
The DNA-binding protein CCCTC-binding factor (CTCF) and the cohesin complex function together to ... more The DNA-binding protein CCCTC-binding factor (CTCF) and the cohesin complex function together to shape chromatin architecture in mammalian cells, but the molecular details of this process remain unclear. Here, we demonstrate that a 79-aa region within the CTCF N terminus is essential for cohesin positioning at CTCF binding sites and chromatin loop formation. However, the N terminus of CTCF fused to artificial zinc fingers was not sufficient to redirect cohesin to non-CTCF binding sites, indicating a lack of an autonomously functioning domain in CTCF responsible for cohesin positioning. BORIS (CTCFL), a germline-specific paralog of CTCF, was unable to anchor cohesin to CTCF DNA binding sites. Furthermore, CTCF–BORIS chimeric constructs provided evidence that, besides the N terminus of CTCF, the first two CTCF zinc fingers, and likely the 3D geometry of CTCF–DNA complexes, are also involved in cohesin retention. Based on this knowledge, we were able to convert BORIS into CTCF with res...
An expansion of a CTG repeat at the DM1 locus causes myotonic dystrophy (DM) by altering the expr... more An expansion of a CTG repeat at the DM1 locus causes myotonic dystrophy (DM) by altering the expression of the two adjacent genes, DMPK and SIX5, and through a toxic effect of the repeat-containing RNA. Here we identify two CTCF-binding sites that flank the CTG repeat and form an insulator element between DMPK and SIX5. Methylation of these sites prevents binding of CTCF, indicating that the DM1 locus methylation in congenital DM would disrupt insulator function. Furthermore, CTCF-binding sites are associated with CTG/CAG repeats at several other loci. We suggest a general role for CTG/CAG repeats as components of insulator elements at multiple sites in the human genome.
CCCTC-binding factor (CTCF) is a conserved, essential regulator of chromatin architecture contain... more CCCTC-binding factor (CTCF) is a conserved, essential regulator of chromatin architecture containing a unique array of 11 zinc fingers (ZFs). Gene duplication and sequence divergence during early amniote evolution generated the CTCF paralog Brother Of the Regulator of Imprinted Sites (BORIS), which has a DNA binding specificity identical to that of CTCF but divergent N- and C-termini. While healthy somatic tissues express only CTCF, CTCF and BORIS are normally co-expressed in meiotic and post-meiotic germ cells, and aberrant activation of BORIS occurs in tumors and some cancer cell lines. This has led to a model in which CTCF and BORIS compete for binding to some but not all genomic target sites; however, regulation of CTCF and BORIS genomic co-occupancy is not well understood. We recently addressed this issue, finding evidence for two major classes of CTCF target sequences, some of which contain single CTCF target sites (1xCTSes) and others containing two adjacent CTCF motifs (2xCT...
Despite sharing the same sequence specificity in vitro and in vivo, CCCTC-binding factor (CTCF) a... more Despite sharing the same sequence specificity in vitro and in vivo, CCCTC-binding factor (CTCF) and its paralog brother of the regulator of imprinted sites (BORIS) are simultaneously expressed in germ cells. Recently, ChIP-seq analysis revealed two classes of CTCF/BORIS-bound regions: single CTCF target sites (1xCTSes) that are bound by CTCF alone (CTCF-only) or double CTCF target sites (2xCTSes) simultaneously bound by CTCF and BORIS (CTCF&BORIS) or BORIS alone (BORIS-only) in germ cells and in BORIS-positive somatic cancer cells. BORIS-bound regions (CTCF&BORIS and BORIS-only sites) are, on average, enriched for RNA polymerase II (RNAPII) binding and histone retention in mature spermatozoa relative to CTCF-only sites, but little else is known about them. We show that subsets of CTCF&BORIS and BORIS-only sites are occupied by several testis-specific transcriptional regulators (TSTRs) and associated with highly expressed germ cell-specific genes and histone retention in mature sperm...
The transcriptional repressor negative protein 1 (NeP1) binds specifically to the F1 element of t... more The transcriptional repressor negative protein 1 (NeP1) binds specifically to the F1 element of the chicken lysozyme gene silencer and mediates synergistic repression by v-ERBA, thyroid hormone receptor, or retinoic acid receptor. Another protein, CCCTC-binding factor (CTCF), specifically binds to 50-bp-long sequences that contain repetitive CCCTC elements in the vicinity of vertebrate c-myc genes. Previously cloned chicken, mouse, and human CTCF cDNAs encode a highly conserved 11-Zn-finger protein. Here, NeP1 was purified and DNA bases critical for NeP1-F1 interaction were determined. NeP1 is found to bind a 50-bp stretch of nucleotides without any obvious sequence similarity to known CTCF binding sequences. Despite this remarkable difference, these two proteins are identical. They have the same molecular weight, and NeP1 contains peptide sequences which are identical to sequences in CTCF. Moreover, NeP1 and CTCF specifically recognize each other's binding DNA sequence and indu...
We have isolated and analyzed human CTCF cDNA clones and show here that the ubiquitously expresse... more We have isolated and analyzed human CTCF cDNA clones and show here that the ubiquitously expressed 11-zinc-finger factor CTCF is an exceptionally highly conserved protein displaying 93% identity between avian and human amino acid sequences. It binds specifically to regulatory sequences in the promoter-proximal regions of chicken, mouse, and human c-myc oncogenes. CTCF contains two transcription repressor domains transferable to a heterologous DNA binding domain. One CTCF binding site, conserved in mouse and human c-myc genes, is found immediately downstream of the major P2 promoter at a sequence which maps precisely within the region of RNA polymerase II pausing and release. Gel shift assays of nuclear extracts from mouse and human cells show that CTCF is the predominant factor binding to this sequence. Mutational analysis of the P2-proximal CTCF binding site and transient-cotransfection experiments demonstrate that CTCF is a transcriptional repressor of the human c-myc gene. Althou...
CTCF is a ubiquitous 11 zinc finger (ZF) protein with highly versatile functions: in addition to ... more CTCF is a ubiquitous 11 zinc finger (ZF) protein with highly versatile functions: in addition to transcriptional silencing or activating in a context-dependent fashion, it organizes epigenetically controlled chromatin insulators that regulate imprinted genes in soma. Recently, we have identified a CTCF paralogue, termed BORIS for Brother of the Regulator of Imprinted Sites, that is expressed only in the testis.
CTCF and BORIS (CTCFL), two paralogous mammalian proteins sharing nearly identical DNA binding do... more CTCF and BORIS (CTCFL), two paralogous mammalian proteins sharing nearly identical DNA binding domains, are thought to function in mutually exclusive manners in DNA binding and transcriptional regulation. Here we show that these two proteins co-occupy a specific subset of regulatory elements consisting of clustered CTCF binding motifs (termed 2xCTSes). BORIS occupancy at 2xCTSes is largely invariant in BORIS-positive cancer cells, with the genomic pattern recapitulating the germline-specific BORIS binding to chromatin. In contrast to the single-motif CTCF target sites (1xCTSes), the 2xCTS elements are preferentially found at active promoters and enhancers, both in cancer and germ cells. 2xCTSes are also enriched in genomic regions that escape histone to protamine replacement in human and mouse sperm. Depletion of BORIS gene leads to altered transcription of a large number of genes and the differentiation of K562 cells, while the ectopic expression of this CTCF paralog leads to speci...
The interaction of nuclear sequence-specific DNA-binding proteins from definitive chicken erythro... more The interaction of nuclear sequence-specific DNA-binding proteins from definitive chicken erythrocytes, thymus and proliferating transformed erythroid precursor (HD3) cells with the 700-base-pair (700-bp) DNA 5'-flanking region of the chicken c-myc gene was investigated by in vitro footprint analysis. The major HD3 protein-binding activity binds to a site (site V) 200 bp upstream from the 'cap' site but, after further fractionation, a second distinct binding activity is detected to a site (site VIII) which contains both the 'CAAT' and 'SP1-binding' consensus sequences. Protein from thymus and erythrocyte cells which express c-myc at lower levels, bind to seven and eight sites respectively. In common with HD3 cell protein, they both bind to site VIII and, although binding to the sequence at site V is also detected, the footprint protection pattern is sufficiently different (site V') to suggest the involvement of different proteins in terminally differentiated and proliferating cells. The DNA-binding activities were partially fractionated by high-performance liquid chromatography gel filtration and include an erythrocyte-specific protein which binds to a c-myc gene poly(dG) homopolymer sequence similar to that found upstream of the chicken beta A-globin gene.
The chicken c-myc 5'-flanking sequence has previously been shown to bind multiple proteins pr... more The chicken c-myc 5'-flanking sequence has previously been shown to bind multiple proteins present in undifferentiated and differentiated red blood cells. In this report the protein binding to one specific region within a hypersensitive site approximately 200 base pairs upstream of the start of transcription has been analysed in detail. Using a combination of a modified agarose gel retardation assay with O-phenanthroline-copper footprinting in situ, missing contact point and methylation interference techniques, two proteins were found to bind to overlapping sequences within 180-230 bp upstream of the start of transcription. One protein resembles the transcription factor Sp1, the other is a protein which binds to three regularly spaced repeats of the core sequence CCCTC. This CCCTC-binding factor was termed CTCF. It requires additional sequences outside the three recognition motifs for tight binding. CTCF was purified to near homogeneity by sequence-specific DNA chromatography. T...
In situ DNase I footprinting confirms the in vivo presence of DUAL CTCF SITES unresolved by CTCF ... more In situ DNase I footprinting confirms the in vivo presence of DUAL CTCF SITES unresolved by CTCF ChIP_Seq alone -
CTCF & BORIS are co-bound within ChIP Seq Peaks with 2 adjacent in vivo DNase I footprints -
- F... more CTCF & BORIS are co-bound within ChIP Seq Peaks with 2 adjacent in vivo DNase I footprints -
An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence enc... more An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding a human or a non-human BORIS, or a fragment of either of the foregoing; an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence that is complementary to a nucleotide sequence encoding a human or a non-human BORIS, or a fragment of either of the foregoing; a vector comprising such an isolated or purified nucleic acid molecule; a cell comprising such a vector; an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding a human or a non-human BORIS, or a fragment of either of the foregoing; a cell line that produces a monoclonal antibody that is specific for an aforementioned isolated or purified polypeptide molecule; and the monoclonal antibody produced by the cell line; methods of diagnosing a cancer or a predisposition to a cancer in a male or female mammal; a method of prognosticating a cancer in a mammal; a method of assessing the effectiveness of treatment of a cancer in a mammal; a method of treating a mammal prophylactically or therapeutically for a cancer, and a composition comprising a carrier and an inhibitor of BORIS.
Centromeric and pericentromeric regions of mammalian chromosomes contain repetitive DNA sequences... more Centromeric and pericentromeric regions of mammalian chromosomes contain repetitive DNA sequences that exhibit a high rate of evolutionary change. The exact role of these sequences with respect to kinetochore/heterochromatin structure and function is not understood. This study investigates the effect of several repetitive elements on expression of a transgene cassette stably integrated into a mouse chromosome in mouse erythroleukemia cells. The results show that human gamma-satellite DNA derived from the pericentromeric region of human chromosome 8 prevents vector DNA-induced epigenetic silencing of an eGFP reporter gene. Electrophoretic mobility shift assay showed that gamma-satellite arrays from human chromosomes 8, X and Y contain clusters of CTCF binding sites, and that these sites are protein-bound independent of their methylation status at CpG sites. Chromatin immunoprecipitation experiments
Uploads
Papers by Victor V Lobanenkov
- Fig S6 from Pugacheva et al Genome Biol 2015
- Fig.2 from Pugacheva et. al. Gen Biol 2015