Abstract
Transcription factors (TFs) coordinate the on-and-off states of gene expression typically in a combinatorial fashion. Studies from embryonic stem cells and other cell types have revealed that a clique of self-regulated core TFs control cell identity and cell state. These core TFs form interconnected feed-forward transcriptional loops to establish and reinforce the cell-type-specific gene-expression program; the ensemble of core TFs and their regulatory loops constitutes core transcriptional regulatory circuitry (CRC). Here, we summarize recent progress in computational reconstitution and biologic exploration of CRCs across various human malignancies, and consolidate the strategy and methodology for CRC discovery. We also discuss the genetic basis and therapeutic vulnerability of CRC, and highlight new frontiers and future efforts for the study of CRC in cancer. Knowledge of CRC in cancer is fundamental to understanding cancer-specific transcriptional addiction, and should provide important insight to both pathobiology and therapeutics.
Similar content being viewed by others
Introduction
Transcriptional regulation is one of the fundamental molecular processes occurring in a cell. Sequence-specific DNA-binding proteins, also known as transcription factors (TFs), orchestrate gene-expression patterns in various cell types and growth conditions [1]. Although thousands of TFs have been identified, only a limited cohort of master TFs controls the core transcriptional programs governing cell identity [2,3,4]. Master TFs are highly expressed in a given cell type. Master TFs bind to the majority of cell-type-specific enhancers and dictate expression of cell-type-specific genes. Till now, one critical goal in biology remains to understand the composition and hierarchy of transcriptional regulatory network in each specified cell type/lineage. One of the best-studied models is embryonic stem cells (ESCs). In ESCs, three master TFs (pluripotency factors) NANOG, POU5F1/OCT4 and SOX2 form interconnected feed-forward transcriptional loops to maintain gene-expression program associated with ESC identity (Fig. 1a) [5, 6]. Such a pluripotent transcriptional regulatory network nurtured the model/concept of core transcriptional regulatory circuitry (CRC) in ESCs [7]. Apart from core TFs, additional layers of core regulators have been discovered and integrated into the pluripotent transcription circuitry including external signaling pathways [8], chromatin/histone modifiers (Polycomb group proteins [9], histone acetyltransferase MOF [10], and WDR5 [11]), basal transcription machinery TF IID complex [12], histone modifications (H3K56ac [13]), noncoding RNAs (microRNAs [14] and lncRNAs [15]), and transposable elements [16]. Insights from ESC-associated CRC have guided subsequent exploration of similar core feed-forward transcriptional networks during lineage specification, development, and tumorigenesis [17,18,19,20,21,22,23]. These studies suggest widespread existence and critical function of CRCs in both physiological and pathological conditions.
a The first model of core transcriptional regulatory circuitry in human embryonic stem cells. Gene loci and super enhancers are depicted as rectangles. b Convergence of genomic defect and oncogenic signaling dysregulation on feed-forward core transcriptional circuit in human cancer. c A model of phase-separated liquid condensates in regulating super enhancer (SE) driven gene expression. SE domains enrich asymmetric loading of core TFs, BET bromodomain proteins and additional chromatin regulatory factors/co-factors (e.g., p300 and HDACs). Regionally concentrated BET proteins, mediators, and RNA polymerase II are capable to form liquid condensates to favor productive transcription of SE targets within a topologically associating domain.
This review mainly focuses on the progress of CRC studies in human cancers, diseases of uncontrolled cell growth. We have learned from genomic sequencing projects that cancer is driven by genetic alterations [24]. Oncogenic genetic abnormalities invariably reprogram transcriptome in order to establish and maintain cancerous identity/state [25]. As featured by defective terminal differentiation, cancer cells are often locked into a growth state resembling either stem/progenitor cells or cells in a certain developmental stage. A growing body of evidence shows that dysregulated transcriptional programs in cancers are dominated by a clique of interconnected master TFs [21, 26,27,28,29,30]. These master regulators form feed-forward autoregulatory loops, and function in a combinatorial way to enhance expression of cancer-promoting genes. Aberrations in cancer genome, signaling transduction, and epigenetic regulation (e.g., super enhancer (SE)) activate cancer-specific expression of master TFs, which frequently hijack lineage-specific TFs, pioneer factors, epigenetic readers, mediators, and chromatin-regulating machineries to reshape epigenome [4, 21, 29, 31,32,33,34,35]. Cancer-specific activation of core TFs and subsequent rewiring of lineage-associated CRC components establish the genetic basis of oncogenic CRCs (Fig. 1b). Phase-separated condensation of core TFs, BRD4, mediators, and RNA polymerase II emerges as a biophysical mechanism to ensure high local availability of chromatin regulators, and transcription machineries for productive oncogenic transcription (Fig. 1c) [36,37,38].
Elucidating the core transcriptional regulatory programs will provide better understanding of molecular carcinogenesis. Lineage-specific components in CRC can inform on both cell-of-origin in cancer and selective oncogenic dependencies. Moreover, disruption of CRC in cancer cells by either genetic approach or pharmacological inhibition greatly impairs their malignant characteristics and tumorigenicity. Thus, CRC represents a mechanism of oncogenic addiction and a potential target for novel therapeutic interventions in cancer. Here, we summarize recent efforts to identify, characterize, and target CRCs across various human cancers, and highlight key insights that have emerged from these seminal studies.
Strategy and methodology for CRC identification
Inspired from ESC studies, self-regulation and interconnection are two important mechanisms to stabilize TF network. Key features of CRC include (1) self-regulated expression of each core TF, (2) direct regulation among core factors, and (3) feed-forward transcriptional control. Hence, identification of CRC is highly dependent on mapping of TF binding sites and biological verification of cross regulation. Genomic occupancy analysis of candidate core TFs enables systematic annotation of their direct targets. Indeed, initial modeling of CRCs in ESCs [5], hepatocytes [18], and T-cell acute lymphoblastic leukemia (T-ALL) [21] was based on genome-wide discovery of TF-DNA interactions via chromatin immunoprecipitation (ChIP) followed by either microarray hybridization or high-throughput sequencing (ChIP-seq). Typically, core TFs occupy their own promoters/enhancers. Meanwhile, core TFs often bind in close proximity to cis-regulatory elements of their target genes, producing a âco-occupancyâ pattern of genomic binding. Potential biochemical mechanism of this âco-occupancyâ pattern is associated with either direct proteinâprotein interactions (e.g., SOX2-OCT4 heterodimer) or co-existence of core TFs within multi-subunit protein complexes. The mutual co-occupancy of core TFs within their own cis-regulatory elements constitutes an interconnected autoregulatory loop, while co-loading of TFs across the vast majority of downstream target genes implicates their substantial co-operation in gene regulation.
To ensure feed-forward regulation within CRC, core TFs commonly bind to cis-regulatory elements and open chromatin regions, including promoters, enhancers, DNase I hypersensitive sites, and SEs [4]. Since TF motifs are overrepresented in genomic regions occupied by respective TFs, systematic identification of TF motifs across the cis-regulatory elements of a given sample will provide raw materials to reconstruct a regulatory network. SEs have been characterized as constituent enhancers, which are bound densely by multiple TFs [4, 31]. Since SEs are frequently engaged in expression of lineage-specific and/or disease-promoting genes including core TFs, SE-driven TFs serve as promising candidates for core TFs in CRC. Based on the principles of âself-regulationâ and âinterconnection,â several computational programs have been developed to wire potential regulatory nodes and to model CRC. Saint-André et al. [23] developed a âCRC Mapperâ program to reconstruct human CRC models of SE-associated TFs (Fig. 2a). This approach utilized FIMO software [39] package from the MEME suite [40] to scan TF motifs inside SE regions (with 500-bp extension of each SE boundary), and then retrieved both auto-regulated TFs and all possible fully interconnected regulatory circuitries. Top-scoring circuitry of which TFs exhibited the highest frequency of occurrence across all predicted circuitries was designated as CRC in a given sample. âCRC Mapperâ has demonstrated excellent performance to recapitulate experimentally verified CRCs in ESCs and T-ALL. So far, this program has been implemented to build CRC models in a large variety of samples [19, 41, 42]. Remarkably, dbCoRC (http://dbcorc.cam-su.org), a comprehensive online database of âCRC Mapperâ-inferred circuitries in over 230 human/murine samples has been developed and is freely accessible, representing a valuable resource to explore transcriptional regulatory networks [41]. Using similar principles of motif-based TF connectivity for network construction, Lin et al. [26] developed a âColtronâ python package (https://pypi.org/project/coltron), which quantified degree of inward (IN) and outward (OUT) regulation of SE-regulated TFs (NODEs) across putative nucleosome-free regions (NFRs) of their constituent enhancers. Based on motif analysis, IN-degree for a given SE-regulated TF (NODE) was defined as the number of SE-regulated TFs bound to its proximal/assigned SE; OUT-degree was calculated as the number of TF-associated SEs bound by a given SE-regulated TF. Total degree (INâ+âOUT) of each NODE can be applied to rank the connectivity of TFs. CRC was defined as the top-ranked autoregulatory TF network (CLIQUE), which showed highest average enrichment of core TFs among all CLIQUEs [27]. The capability of âColtronâ to predict motif-based TF binding is enhanced, as it scans TF motifs inside NFRs instead of whole SE domain (Fig. 2b) [43]. Further, either incorporation of experimentally derived TF footprints data (DNase I hypersensitive sites) or accessible regions from ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) can improve the prediction of TF binding sites [2, 27, 44,45,46]. In addition to genomic occupancy and TF connectivity, regulatory readout of core TFs and their functional impact on cell behaviors are other fundamental aspects for CRC. Since core TFs with feed-forward transcriptional regulation often show positively correlated expression, co-expression analysis of TFs among cancer type/subtype of interest can be employed to shortlist candidate core TFs. Furthermore, genetic manipulation assays represent the âgold standardâ to validate both regulatory circuitry among core TFs and functional importance of TFs in maintaining cell identity. Depletion of individual core TFs will reduce expression of other members within CRC, and subsequently impair cellular phenotypes (e.g., proliferation and differentiation). Systematic analysis of putative downstream targets will also help to identify important mediators/executors in regulating cell identity.
a Schematic of enhancer-centric reconstruction of core transcriptional regulatory circuitry. b Flow chart showing the key steps of CRC reconstruction by CRC Mapper and Coltron. Both methods model CRC based on TF connectivity among SE regions, while they differ in several parameters during steps of motif scanning and reconstructing TF loops.
Experimentally verified CRCs and their implications in cancer pathobiology and therapeutics
To gain insight into the roles of CRCs in human cancers, we next summarize current knowledge of biologically verified CRCs in human cancers.
T-cell acute lymphoblastic leukemia (T-ALL)
T-ALL is a disease with cancerous proliferation of immature thymocytes. Genetic lesions affecting an oncogenic TF TAL1/SCL are frequently observed in human T-ALL cells [47]. Either chromosomal translocation involving T-cell receptor loci or intrachromosomal deletion of an 80-kb DNA fragment in 1p33 involving STIL (SIL-TAL1 deletion or TAL1d) has been shown to contribute to the overexpression of TAL1 [47]. TAL1 activation blocks T-cell differentiation, reactivates stem cell genes, and leads to malignant transformation [48]. In TAL1-positive T-ALL (e.g., Jurkat and CCRF-CEM cells), Sanda et al. identified a positive feed-forward regulatory circuitry among TAL1, RUNX1, and GATA3 based on ChIP-seq and expressional analyses [21]. TAL1 transcriptional complex included HEB, E2A, LMO1/2, and MYB; MYB was further established as an additional core TF to extend the aforementioned core regulatory loop in T-ALL [49] (Fig. 3a). Remarkably, Mansour et al. reported a mechanism of monoallelic TAL1 activation involving somatic mutation of an enhancer element at 7.5-kb upstream of the transcription start site of TAL1 [49]. These mutations introduced de novo MYB binding site to the TAL1 SE, facilitating the subsequent recruitment of MYB, histone acetyltransferase CBP, and additional components of TAL1 complex. CRISPR/Cas9-mediated deletion of TAL1 enhancer mutation profoundly impaired both MYB binding and H3K27ac signals in this SE. Of note, TAL1, RUNX1, GATA3, and MYB bound to the respective enhancers or SE regions of each other, as well as those of their own. They also co-occupied the regulatory elements of many downstream targets, such as the leukemogenic TRIB2, ARID5B, CDK6, and lncRNAs [50,51,52]. Among these targets, ARID5B was further characterized as a collaborating oncogenic factor to maintain expression of core TFs and participate in the transcriptional program of T-ALL [51]. Functional studies demonstrated that depletion of either TAL1 or other components of the circuitry not only disrupted the expression of all core TFs, but also attenuated survival of TAL1-positive T-ALL cells. In addition, RUNX1 has been reported to be required for the growth of both TAL1-induced murine T-ALL cells and human T-ALL cells [53]. Given the prevalent alterations of TAL1 expression in T-ALL, activation of TAL1 and subsequent establishment of T-ALL-associated CRC are likely the primary events during malignant transformation of TAL1-positive cases. Either selective degradation of bromodomain and extra terminal (BET) proteins or targeted inhibition of kinases of RNA polymerase II carboxyl-terminal domain (e.g., CDK7 and CDK9) effectively collapsed the core regulatory circuitries in T-ALL cells, hence exhibiting potent antineoplastic activities [54,55,56].
Diagrams showing the core regulatory circuitries and key downstream targets from extended network in a TAL1-positive T-cell acute lymphoblastic leukemia, b group 3 medulloblastoma, c group 4 medulloblastoma, d MYCN-driven neuroblastoma (sympathoadrenal subtype), e fusion-positive rhabdomyosarcoma, f dedifferentiated liposarcoma, g esophageal adenocarcinoma and h esophageal squamous cell carcinoma. Genomic alterations that contribute to the aberrant expression of core TFs in each disease are indicated in the red dashed boxes. Representative downstream targets of core regulatory circuitries are listed in blue dashed boxes. Gene loci and super enhancers are depicted as rectangles.
Medulloblastoma
Medulloblastoma is the most common type of pediatric brain tumor, which can be classified into four molecular subtypes: WNT, SHH, Group 3, and Group 4 [57]. Lin et al. charted the active enhancer landscape of 28 primary medulloblastoma specimens based on simultaneous profiling of DNA methylome, transcriptome, and chromatin loading of BRD4 and H3K27ac [26]. In this study, âColtronâ was developed for computational reconstruction of subtype-specific TF regulatory networks. This analysis of SE-regulated TFs identified cliques of TFs with a high likelihood of motif co-occurrence, interconnectivity, and proteinâprotein interaction. A CRC blueprint was generated for each subgroup, providing valuable insights to heterogeneous identities of medulloblastomas. WNT-subtype medulloblastomas may be driven by core TFs including LEF1, MAF, RUNX2, and EMX2. SHH subtype potentially enrolled OTX1, BARHL2, MAFF, and GLI2 as core TFs. Group 3 tumors were initially predicted to enrich CRX, HLX, LHX2, and OTX2, yet the top-ranked CRC model was revised to include OTX2, NRL, and CRX (Fig. 3b) after re-analysis of TF-enhancer interaction networks using an updated TF motif database (allocation of MAF motifs to NRL motifs) [58]. In support of this, OTX2 is frequently amplified and overexpressed in this group [59, 60]. OTX2 has been shown as a pathogenic TF that shapes the chromatin landscape of Group 3 medulloblastomas [61]. Further, OTX2 was recruited to SE regions of NRL and CRX. OTX2 expression also showed a strong positive correlation with both NRL and CRX expression in this subtype. Meanwhile, NRL and CRX were inferred to occupy ~2/3 of all SE-associated genes in Group 3 [58]. Functionally, these two core TFs cooperated to regulate a photoreceptor transcriptional program and expression of several oncogenes (e.g., CCND2 and BCL2L1), suggesting identity of photoreceptor lineage for Group 3 medulloblastomas. Group 4 medulloblastomas relied on the feed-forward co-expression of LMX1A, LHX2, and EOMES (Fig. 3c) [26]. Group 4-specific regulatory circuitry was partially verified by ChIP-seq analysis of LMX1A and LHX2. Notably, Group 4 CRC also resembled an expressional pattern of progenitors in cerebellar upper rhombic lip, suggesting a potential cell-of-origin for these tumors. Collectively, these medulloblastoma studies exemplify insightful applications of CRC reconstruction in understanding of tumor cell-of-origin and pathobiology.
Neuroblastoma
As a cancer derived from multipotent neural crest cells (NCCs) of the peripheral sympathetic nervous system, neuroblastoma represents one of the most common pediatric malignancies. Boeva and colleagues applied âColtronâ to infer CRC models from SE landscapes of human neuroblastoma cell lines, patient-derived xenografts (PDXs), and primary NCC lines [28]. They reported three types of identity among these samples. Group 1 samples comprised the majority of neuroblastoma cell lines and PDXs, and engaged SE-associated master TFs (e.g., PHOX2B, HAND2 and GATA3) to control sympathetic adrenergic cell identity. This module of core TFs showed strong co-expression and constituted a conserved CRC in both group 1 neuroblastoma cell lines and PDXs. In addition to the physical interaction between PHOX2B and GATA3, all three TFs co-localized in the regulatory sequences of their own loci, as well as many other prominent driver genes including MYCN and ALK. Group 2 samples resembled the SE pattern of NCCs, which were driven by a CRC module containing AP-1 TFs, e.g., FOSL1 and FOSL2. The third group of neuroblastomas was characterized by a mixture of both Groups 1 and 2 modules; indeed, most primary neuroblastoma specimens exhibited mixed expression of TFs of both modules. In line with the observation that most MYCN-amplified neuroblastoma cells enriched the adrenergic TF module, functional genomics studies with CRISPR-Cas9 screen identified MYCN, PHOX2B, HAND2, GATA3, and additional SE-regulated TFs including ISL1 and TBX2 as key dependency genes showing selective requirement for proliferation and survival of MYCN-amplified neuroblastoma cells [32]. Compound depletion of MYCN and TBX2 exerted a greater deleterious impact on both cell growth and downstream FOXM1/E2F network [62]. Notably, MYCN, PHOX2B, HAND2, GATA3, ISL1, and TBX2 demonstrated clustered binding across open chromatin regions (as âepicentersâ) of their own genes and interdependent expression. ASCL1 and an auxiliary adapter protein (LMO1) further expanded this adrenergic transcriptional regulatory circuitry by co-binding to target regions (Fig. 3d) [63]. Both ASCL1 and LMO1 maintained proliferative potential of neuroblastoma cells. ASCL1 overexpression also suppressed pathways involved in the development of sympathetic nervous system and noradrenergic lineage commitment.
Remarkably, MYCN and TBX2 are frequently targeted by somatic genomic alterations in high-risk neuroblastoma samples. Genomic amplification of MYCN rendered its function as an exceptional CRC member whose overexpression surpassed the regulatory impact from many other core TFs [32]. In contrast, in case of cells with single copy of MYCN (e.g., CLB-GA), MYCN expression was dependent on TBX2 [62]. Segmental gains and rare focal amplification of chromosome 17q coincided with elevated expression of TBX2 [62]. Germline single nucleotide polymorphism (rs2168101 G>T) within intron 1 of LMO1 has also been associated with predisposition to neuroblastoma [64]. This risk allele increased LMO1 expression by creating a GATA3 binding site within its SE element. Hence, both somatic and germline genomic changes contribute to neuroblastoma development via aberrant activation of adrenergic transcriptional regulatory module.
Several attempts to disrupt the adrenergic transcriptional regulatory circuitry in neuroblastoma have been made, including combined treatment of CDK7 inhibitor (THZ1) with either BET protein inhibitor (JQ1) or HDAC inhibitor (Panobinostat) [32, 62]. Combination of THZ1 and JQ1 demonstrated synergistic repressive effects on both cell viability and CRC gene expression. Since plasticity of CRCs in neuroblastomas has been observed during chemotherapeutic treatment [28], incorporation of CRC-disrupting agents into current therapeutic scheme may provide additional benefit to neuroblastoma patients.
Rhabdomyosarcoma (RMS)
RMS is an aggressive soft-tissue cancer that arises from skeletal muscle precursors. Fusion-positive rhabdomyosarcoma (FP-RMS) is defined by translocations involving chimeric TFs either PAX3-FOXO1 (P3F) or PAX7-FOXO1 [65]. Amplification of MYCN is also frequently detected in FP-RMS [66]. Four core TF modules have been identified in RMS samples: (1) a normal- and tissue-specific module including NR4A1 and MEF2D; (2) a RMS-general module including MYOD1 and MYOG; (3) a fusion-negative (FN)-RMS module including PAX7 and AP-1 members; and (4) a FP-RMS-specific module including MYCN and the oncofusion TFs [34]. In FN-RMS with RAS mutations, aberrant MAPK activity enhanced cell proliferation, whereas blocked myogenic differentiation [35]. Targeted inhibition of RAS pathway with MEK inhibitor (trametinib) rewired substantially the core TF connectivity (modeled by âColtronâ) via MYOG-dependent chromatin remodeling and SE formation at genes required for late myogenic differentiation [35]. Although more biological verification is required to support the subtype-specific CRC in FN-RMS, this study implicates a mechanism that signaling driver mutations integrate with core TF circuitry to promote cancer development.
By far, FP-RMS represents an insightful example of CRC driven by oncogenic fusion TF. Gryder et al. found that P3F reprogramed the cis-regulatory landscape of RMS and established a myogenic SE circuitry together with additional master TFs (MYOG, MYOD, and MYCN) [29]. HiChIP analysis of H3K27ac verified the interactions between core TF genes and their SEs [34]. These core TFs were overexpressed in FP-RMS compared to normal tissues, and collaborated on a myogenic transcriptional program. Ablation of MYOG, MYOD, or MYCN attenuated the expression of all core TFs, while silencing of P3F reduced MYOD and MYCN, but not MYOG. In addition, P3F, MYOG, and MYOD enhanced expression of SOX8, which was a marker for muscle satellite cells and a repressor of transcriptional program associated with differentiation from myoblasts to myotubes [34]. SOX8 also co-localized with other core TFs across the majority of SEs, coordinating the transcriptional chromatin topologies in FP-RMS [67]. The SEs controlling P3F (at FOXO1 locus), MYOD, MYCN, and SOX8 were co-occupied by all core TFs (Fig. 3e). However, MYOG SE was bound by all core TFs except P3F, which was thought to be consistent with the sequential activation of TFs during normal myogenesis. Disruption of SOX8 paradoxically elevated expression of P3F, MYOD1, and MYOG, indicative of a potential negative feedback to block terminal differentiation in FP-RMS. Hence, SOX8 is more likely a co-regulatory TF, which acts as a downstream target of CRC and provides either positive or negative feedback to distinct subsets of core TFs. Importantly, functional screening of SE-associated TFs using a DNA-binding domain-focused CRISPR knockout library affirmed the oncogenic dependency of FP-RMS on SOX8 and the rest of core TFs [34]. Together, P3F and MYCN calibrate the balance between myogenic and anti-myogenic core oncogenic TF circuitries to maintain dedifferentiated state of FP-RMS cells.
Biochemical studies revealed that P3F functioned as a pioneering factor to open chromatin, and subsequently recruited mediator proteins/co-activators (e.g., MED1, p300, BRD4) to enable enhancer looping and transcriptional activation [29]. Functional interplay between P3F and BRD4 yielded a significant susceptibility of FP-RMS cells to BRD inhibition by JQ1. JQ1 destabilized P3F protein and demonstrated a remarkable, selective suppression of P3F targets including core TFs and SE-driven kinases (e.g., ALK). Interestingly, both p300 and HDAC showed extensive co-occupancy with P3F, core TFs and BRD4 across the âepicentersâ within SE regions [34, 67]. A JQ1-like SE selectivity was recapitulated by small molecules targeting histone-acetylation writers (p300/CBP) and erasers (HDAC1/2/3), but not general transcriptional modulators (e.g., triptolide, α-Amanitin, and flavopiridol). Triple inhibition of HDAC1/2/3 with either Entinostat or combination of a HDAC1/2 inhibitor Merck60 and a HDAC3 inhibitor LW3 disrupted profoundly the expression of core TFs in FP-RMS cells. Such a SE selectivity of HDAC inhibitors was achieved through excessive increase in the chromatin accessibility of genomic loci, which were co-loaded with both HDAC and core TFs. In contrast, HDAC binding sites without co-binding of core TFs showed a weaker change. As a result of HDAC inhibition, local concentration of acetylated histones was increased around âepicenters.â Aberrant increase in histone acetylation enhanced H3K27ac spreading toward distal boundaries of H3K27ac domains, especially for shorter TF genes (e.g., SOX8, MYOG, MYOD1, and MYCN). Meanwhile, hyper-acetylation of histones also triggered new chromatin contacts at the expense of endogenous, transcription-supportive SE loops within CTCF-defined domains. For instance, Entinostat induced enhancer spreading and aberrant new interactions across 40-kb gap between two MYOD1 SEs, while it diminished their initial interaction. Furthermore, quantitative ChIP-seq analysis (by ChIP-Rx assay) upon HDAC inhibition revealed a general reduction in occupancy of RNA polymerase II, P3F, SOX8, and MYOD1 at regulatory elements of core TFs. In the meantime, genomic binding of MYOG, p300, HDAC2, and HDAC3 remained stable, whereas loading of YY1, RAD21, and BRD4 was significantly increased. The unloading of RNA polymerase II from chromatin may act through two mechanisms: (1) inhibition of pause release at the global scale and (2) alteration of phase separation at core TFs. Notably, HDAC inhibition rapidly dissolved RNA polymerase II condensates, which have been reported to dynamically touch SEs [36,37,38], while phase-separated BRD4 resisted chromatin hyper-acetylation. Therefore, P3F-driven CRC in FP-RMS is vulnerable to agents targeting the histone-acetylation axis.
Dedifferentiated liposarcoma (DDLPS)
DDLPS represents a high-grade subtype of adipocytic mesenchymal tumors. Enhancer profiling of both established cell lines and primary DDLPS tumors enabled discovery of SE-driven FOSL2, MYC, and RUNX1 as top co-operative core TFs (Fig. 3f) [68]. Genomic occupancy analysis revealed a substantial binding and co-loading of BRD2, BRD3, BRD4, FOSL2, and RUNX proteins across active enhancers, especially SE regions of core TFs. Moreover, FOSL2, MYC, and RUNX1 demonstrated inter-dependent expression in DDLPS cells, which was further supported by their robust co-expression in primary tumor specimens. The prominent function of FOSL2 suggests high activity of AP-1 complex in DDLPS. As genomic studies have identified frequent amplification of another AP-1 member JUN in this disease [69], JUN amplification likely contributes to the DDLPS-specific regulatory circuitry. In search of key downstream targets of CRC, SNAI2 was identified as a leading hit whose expression was elevated in DDLPS tumors and was associated with adverse prognosis. Of note, all of the above-mentioned BET bromodomain proteins, core TFs and SNAI2 were essential for both proliferation and tumorigenicity of DDLPS cells. BET protein degraders (e.g., ARV-825 and dBET6) inhibited preferentially SE-associated genes including core TFs and SNAI2, and exerted stronger anti-DDLPS activities than conventional BET bromodomain inhibitors. Therefore, BET protein degraders are promising modalities to subvert oncogenic dependence on disease-driving CRC.
Esophageal cancer
Esophageal cancer ranks globally among the top ten most common cancers. This disease can be sub-grouped into esophageal adenocarcinoma (EAC) and esophageal squamous cell carcinoma (ESCC), two entities showing many contrasting features at epidemiological, clinical, and molecular levels [70]. In line with the significant disparities in transcriptome, cancer genome, DNA methylatome, and chromatin profiles, EAC and ESCC display subtype-specific enhancer topography and master TF connectivity [71, 72].
By analyzing EAC-specific SE engagement and co-expression, Chen et al. identified a clique of four master TFs (i.e., ELF3, KLF5, GATA6, and EHF) forming an interconnected circuitry to dictate EAC transcriptome [71]. Overexpression of these four TFs in EAC cells was mutually dependent on the expression of each other. ChIP-seq analysis of ELF3, KLF5, and GATA6 revealed co-occupancy of these TFs at the SE regions of their own and EHF, except for a weak signal of ELF3 binding at GATA6 enhancers (Fig. 3g). Based on integrative analysis of H3K27ac ChIP-seq and circular chromosome conformation capture sequencing (4C-seq), five constituent enhancers (E1âE5) of ELF3 were identified within SE. Among these ELF3 enhancers, E1 demonstrated a strong EAC-specific enhancer activity and was concurrently bound by ELF3, KLF5, and GATA6. Blocking chromatin accessibility of E1 by CRISPR/dCas9-KRAB was able to decrease expression of ELF3, as well as the rest core TFs. Furthermore, ELF3, KLF5, and GATA6 localized in close proximity to each other across the EAC genome. Genes with their enhancers bound by one or more core TFs showed higher expression than those with no binding. Consistent with substantial involvement of these core TFs in EAC transcriptome, they played positive roles to maintain survival and proliferation of EAC cells. SE-driven LIF, LIFR and HNF4A represented leading candidates for CRC downstream targets which exhibited both disease-specificity and therapeutic venerability [71, 73].
In ESCC cells, TP63, SOX2, and KLF5 established a disease-driving CRC to maintain chromatin accessibility and oncogenic transcription (Fig. 3h) [72]. TP63, SOX2, and KLF5 were reported to interact directly with each other and have overlapping occupancy at the SE elements including those of their own. Knockdown of individual core TF abolished expression of all three TFs and downstream targets (e.g., ALDH3A1 and BRD4), which was accompanied by a marked decrease in chromatin accessibility at target enhancers. Moreover, TP63, SOX2, and KLF5 maintained transcription of enhancer RNAs from various TP63 enhancers (e.g., E2, E7, and E8). Strikingly, genomic deletion of these TP63 enhancer elements was sufficient to abolish the expression of all three core TFs and inhibit colony forming ability of ESCC cells. Interestingly, BRD4 was one of the downstream targets of ESCC-specific CRC, as knockdown of any core TF disrupted both transcription and protein expression of BRD4. BET protein degraders (e.g., ARV-771, MZ1, and ARV-825) and HDAC inhibitors (Romidepsin, Panobinostat and JNJ-26481585) were identified as potent anti-ESCC agents. Single agent treatment of Romidepsin degraded TP63 and abolished the expression of all core TFs in a proteasome-dependent manner. Further, combination of Romidepsin and ARV-771 induced substantial chromatin remodeling in ESCC cells, yielding a strong synergistic anti-ESCC effect both in vitro and in vivo.
Interestingly, KLF5 appears as a common core TFs in both EAC and ESCC, albeit the co-operative master TFs are very different. In addition, KLF5 engaged a mutual crosstalk with collaborating factors GATA4 and GATA6 to maintain oncogenic transcriptional regulatory network in gastric cancer [74]. These studies suggest that rewiring connectivity of master TFs is fundamental to disease/subtype identity.
Of note, amplification of SOX2 and TP63 is common in ESCC [75], representing a genomic driver for ESCC-specific CRC. Genomic amplification and somatic SE duplication of KLF5 have also been reported in squamous cell carcinoma, EAC, and gastric cancer [74, 76, 77]. In addition, CCAT1, a SE-driven long noncoding RNA was reported to mediate the binding affinity of SOX2-TP63 complex on target SE (i.e., enhancer of EGFR) [78]. Engagement of noncoding components in CRC may further extend the connectivity and function of TFs.
Implementation of circuitry analysis in cancer research
Based on findings from biologically verified CRCs, core TFs are vital for cell identity and transcriptional homeostasis under both physiological and pathological conditions. Many other studies have also implemented CRC analysis in uncovering critical transcriptional programs in various cancer types (Table 1) [27, 30, 79,80,81,82,83,84]. For instance, by analyzing publically available ChIP-seq data from the ENCODE project [85], Fournier et al. modeled core TF connectivity including ESR1, FOXA1, FOSL2, and JUND in MCF7 breast cancer cells; HNF4A, FOXA1, FOXA2, and CEBPB in HEPG2 liver cancer cells; as well as FOXA1, FOXA2, FOSL2, JUND, and ATF3 in A549 lung carcinoma cells [83]. Depletion of FOXA effectively inhibited the viability and clonogenicity of all three cell lines. With information of active chromatin architecture, computational reconstruction of enhancer/NFR-centric CRCs represents an insightful approach to discover TFs with principal functions in a disease/development/subtype-specific transcriptional regulatory network. TF circuitry analysis implicated SOX9, RFX2, SOX2, and ZBTB16 as generally essential factors for ependymoma cells [30]. Top-scored CRC of chronic lymphocytic leukemia predicted PAX5, ETV6, and IRF2 as centralized regulators [27]. Other examples of enhancer/NFR-centric CRC network construction include gastrointestinal stromal tumors [82, 86], multiple myeloma [44], and keratinocyte stem cells [46]. In addition, dbCoRC serves as a valuable database to explore H3K27ac/enhancer-centric core circuitries in over 230 samples including cervical adenocarcinoma, chronic myelogenous leukemia, colorectal cancer, diffuse large B-cell lymphoma, gastric cancer, small cell lung cancer, pancreatic cancer, and prostate cancer.
Conclusions
The above examples demonstrate the successful application of CRC models to address scientific questions related to cell identity, cancer biology, and therapeutic responses, highlighting the prevalence and importance of CRC in human cancers.
Current paradigm of CRC in human cancers involves four principles: (1) enrolment of master TFs showing high expression and substantial chromatin loading, (2) existence of autoregulation and mutual regulation of core TFs based on either ChIP analysis or motif detection among their regulatory elements, (3) evidence of co-expression of core TFs and transcriptional co-dependencies on each other, and (4) essential function of core TFs and downstream targets in maintaining malignant characteristics of cancer cells.
Computational reconstruction, ChIP-seq, and biologic verification represent the main technical route for CRC studies. Computational algorithms are highly dependent on TF motif analysis. As TFs function in concert with additional co-factors and partners, deviation of actual TF occupancy from motif-based binding sites could compromise the power of computational modeling. Update and refinement of TF motif database warrant more accurate and comprehensive prediction of TF connectivity and network of TFs. Additional efforts can include incorporation of either publically available TF ChIP-seq data or TF ChIP-seq signals from cells of similar identity [87]. In supplement to the enhancer/NFR-centric CRC prediction, integrative analysis of genomic profiling, DNA methylome, chromatin compartmentalization, and enhancer-promoter interactions will extend and clarify regulatory loops. Further improvements in detection and assignment of open chromatin regions to target genes (e.g., 4C-seq) will benefit CRC construction. Since cancer cells are vulnerable to genetic inactivation of core TFs, functional genomics data from genome-wide CRISPR or shRNA loss-of-function viability screening will be useful to interrogate functional importance of candidate core TFs. To this end, the Cancer Dependency Map project (https://depmap.org/portal/), which hosts functional genomics data of over 700 cell lines, serves as a valuable resource to identify TF dependency and cancer vulnerabilities.
Remarkably, both germline polymorphisms and genomic abnormalities can lead to either overexpression or de novo fusion of TFs. These genetically activated TFs are further integrated into the aberrant oncogenic transcriptional programs via CRC formation in collaboration with additional core factors. Driver mutations in signaling pathways can also fuel tumor-promoting CRC via activating their downstream executive TFs. Therefore, CRC operates as a nexus of genomic, signaling, and epigenetic dysregulations in oncogenesis (Fig. 1b). In addition, cancer-associated CRCs often hijack a stem/progenitor cell TF network, which prevents transformed cells from terminal differentiation and in turn conveys information of cell-of-origin. Systematic survey of TF aberrations across human cancer genomes will provide valuable insights for cancer-associated CRC; cancers driven by fusion TFs are fertile grounds of CRC study.
Knowledge of the biologic basis of CRC is growing. CRC frequently enrolls master TFs with pioneer factor activity and/or cancer-specific genomic lesion. Core TFs recruit chromatin/histone modifiers, mediators, and cohesin proteins to reshape epigenetic landscape and establish an oncogenic state [83]. Broad chromatin domain and clustered binding of core TFs at âepicentersâ concentrate further co-factors and co-regulators with intrinsically disordered regions (e.g., MED1 and BRD4) to form phase-separated condensates. After compartmentalization, mediator droplets are able to enrich transcription apparatus. Therefore, these condensates in turn provide high local concentrations of trans-activating TFs, chromatin regulators, and transcription machineries for robust expression of target genes (Fig. 1c). Liquidâliquid phase separation emerges as a biophysical mechanism to form dynamic, multivalent condensates of TFs, mediators, and RNA polymerase II to activate SE-associated transcription, especially the expression of core TFs [34, 36,37,38]. However, regulation and structural basis of the functional dynamics and cooperativity among interacting components within condensates await further exploration. Besides, negative feedback mechanisms that calibrate the strength of feed-forward regulation remain poorly explored.
Therapeutic targeting of histone-acetylation axis and transcriptional apparatus represents an auspicious strategy to disrupt cancer-associated CRC (Table 2). Since BET proteins, CDK7, CDK9, and HDACs, serve as actionable targets in many cancers addicted to CRC, combination of CRC-disrupting agents with either current treatment modalities or novel antitumor compounds may improve therapeutic response. Of interest to note, condensates of RNA polymerase II are sensitive to transcriptional stress, while those of BRD4 and mediators are more resilient [34, 36]. Selective blockage of phase separation or co-inhibition of multiple components in phase-separated condensates may have synergistic effects to break transcriptional circuitry in tumor cells. Although many aspects remain unexplored, plasticity and heterogeneity of CRC have been implicated in human cancers [28, 35]. CRC-guided tumor stratification and therapeutic development are promising fields for future investigation.
References
Lambert SA, Jolma A, Campitelli LF, Das PK, Yin Y, Albu M, et al. The human transcription factors. Cell. 2018;172:650â65.
Neph S, Stergachis AB, Reynolds A, Sandstrom R, Borenstein E, Stamatoyannopoulos JA. Circuitry and dynamics of human transcription factor regulatory networks. Cell. 2012;150:1274â86.
Zhang S, Tian D, Tran NH, Choi KP, Zhang L. Profiling the transcription factor regulatory networks of human cell types. Nucleic Acids Res. 2014;42:12380â7.
Whyte WA, Orlando DA, Hnisz D, Abraham BJ, Lin CY, Kagey MH, et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153:307â19.
Boyer LA, Lee TI, Cole MF, Johnstone SE, Levine SS, Zucker JP, et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell. 2005;122:947â56.
Jaenisch R, Young R. Stem cells, the molecular circuitry of pluripotency and nuclear reprogramming. Cell. 2008;132:567â82.
Cole MF, Young RA. Mapping key features of transcriptional regulatory circuitry in embryonic stem cells. Cold Spring Harb Symp Quant Biol. 2008;73:183â93.
Chen X, Xu H, Yuan P, Fang F, Huss M, Vega VB, et al. Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell. 2008;133:1106â17.
Endoh M, Endo TA, Endoh T, Fujimura Y, Ohara O, Toyoda T, et al. Polycomb group proteins Ring1A/B are functionally linked to the core transcriptional regulatory circuitry to maintain ES cell identity. Development. 2008;135:1513â24.
Li X, Li L, Pandey R, Byun JS, Gardner K, Qin Z, et al. The histone acetyltransferase MOF is a key regulator of the embryonic stem cell core transcriptional network. Cell Stem Cell. 2012;11:163â78.
Ang YS, Tsai SY, Lee DF, Monk J, Su J, Ratnakumar K, et al. Wdr5 mediates self-renewal and reprogramming via the embryonic stem cell core transcriptional network. Cell. 2011;145:183â97.
Pijnappel WW, Esch D, Baltissen MP, Wu G, Mischerikow N, Bergsma AJ, et al. A central role for TFIID in the pluripotent transcription circuitry. Nature. 2013;495:516â9.
Xie W, Song C, Young NL, Sperling AS, Xu F, Sridharan R, et al. Histone h3 lysine 56 acetylation is linked to the core transcriptional network in human embryonic stem cells. Mol Cell. 2009;33:417â27.
Marson A, Levine SS, Cole MF, Frampton GM, Brambrink T, Johnstone S, et al. Connecting microRNA genes to the core transcriptional regulatory circuitry of embryonic stem cells. Cell. 2008;134:521â33.
Guttman M, Donaghey J, Carey BW, Garber M, Grenier JK, Munson G, et al. lincRNAs act in the circuitry controlling pluripotency and differentiation. Nature. 2011;477:295â300.
Kunarso G, Chia NY, Jeyakani J, Hwang C, Lu X, Chan YS, et al. Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat Genet. 2010;42:631â4.
Wontakal SN, Guo X, Smith C, MacCarthy T, Bresnick EH, Bergman A, et al. A core erythroid transcriptional network is repressed by a master regulator of myelo-lymphoid differentiation. Proc Natl Acad Sci USA. 2012;109:3832â7.
Odom DT, Dowell RD, Jacobsen ES, Nekludova L, Rolfe PA, Danford TW, et al. Core transcriptional regulatory circuitry in human hepatocytes. Mol Syst Biol. 2006;2:2006.0017.
Aldiri I, Xu B, Wang L, Chen X, Hiler D, Griffiths L, et al. The dynamic epigenetic landscape of the retina during development, reprogramming, and tumorigenesis. Neuron. 2017;94:550â68.e10.
Lim CY, Tam WL, Zhang J, Ang HS, Jia H, Lipovich L, et al. Sall4 regulates distinct transcription circuitries in different blastocyst-derived stem cell lineages. Cell Stem Cell. 2008;3:543â54.
Sanda T, Lawton LN, Barrasa MI, Fan ZP, Kohlhammer H, Gutierrez A, et al. Core transcriptional regulatory circuit controlled by the TAL1 complex in human T cell acute lymphoblastic leukemia. Cancer Cell. 2012;22:209â21.
Joo MS, Koo JH, Kim TH, Kim YS, Kim SG. LRH1-driven transcription factor circuitry for hepatocyte identity: super-enhancer cistromic analysis. EBioMedicine. 2019;40:488â503.
Saint-Andre V, Federation AJ, Lin CY, Abraham BJ, Reddy J, Lee TI, et al. Models of human core transcriptional regulatory circuitries. Genome Res. 2016;26:385â96.
Garraway LA, Lander ES. Lessons from the cancer genome. Cell. 2013;153:17â37.
Bradner JE, Hnisz D, Young RA. Transcriptional addiction in Cancer. Cell. 2017;168:629â43.
Lin CY, Erkek S, Tong Y, Yin L, Federation AJ, Zapatka M, et al. Active medulloblastoma enhancers reveal subgroup-specific cellular origins. Nature. 2016;530:57â62.
Ott CJ, Federation AJ, Schwartz LS, Kasar S, Klitgaard JL, Lenci R, et al. Enhancer architecture and essential core regulatory circuitry of chronic lymphocytic leukemia. Cancer Cell. 2018;34:982â95.e7.
Boeva V, Louis-Brennetot C, Peltier A, Durand S, Pierre-Eugene C, Raynal V, et al. Heterogeneity of neuroblastoma cell identity defined by transcriptional circuitries. Nat Genet. 2017;49:1408â13.
Gryder BE, Yohe ME, Chou HC, Zhang X, Marques J, Wachtel M, et al. PAX3-FOXO1 establishes myogenic super enhancers and confers BET bromodomain vulnerability. Cancer Discov. 2017;7:884â99.
Mack SC, Pajtler KW, Chavez L, Okonechnikov K, Bertrand KC, Wang X, et al. Therapeutic targeting of ependymoma as informed by oncogenic enhancer profiling. Nature. 2018;553:101â5.
Pott S, Lieb JD. What are super-enhancers? Nat Genet. 2015;47:8â12.
Durbin AD, Zimmerman MW, Dharia NV, Abraham BJ, Iniguez AB, Weichert-Leahey N, et al. Selective gene dependencies in MYCN-amplified neuroblastoma include the core transcriptional regulatory circuitry. Nat Genet. 2018;50:1240â6.
Boulay G, Sandoval GJ, Riggi N, Iyer S, Buisson R, Naigles B, et al. Cancer-specific retargeting of BAF complexes by a prion-like domain. Cell. 2017;171:163â78.e19.
Gryder BE, Pomella S, Sayers C, Wu XS, Song Y, Chiarella AM, et al. Histone hyperacetylation disrupts core gene regulatory architecture in rhabdomyosarcoma. Nat Genet. 2019;51:1714â22.
Yohe ME, Gryder BE, Shern JF, Song YK, Chou HC, Sindiri S, et al. MEK inhibition induces MYOG and remodels super-enhancers in RAS-driven rhabdomyosarcoma. Sci Transl Med. 2018;10:eaan4470.
Cho WK, Spille JH, Hecht M, Lee C, Li C, Grube V, et al. Mediator and RNA polymerase II clusters associate in transcription-dependent condensates. Science. 2018;361:412â5.
Sabari BR, DallâAgnese A, Boija A, Klein IA, Coffey EL, Shrinivas K, et al. Coactivator condensation at super-enhancers links phase separation and gene control. Science. 2018;361:eaar3958.
Chong S, Dugast-Darzacq C, Liu Z, Dong P, Dailey GM, Cattoglio C, et al. Imaging dynamic and selective low-complexity domain interactions that control gene transcription. Science. 2018;361:eaar2555.
Grant CE, Bailey TL, Noble WS. FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011;27:1017â8.
Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37:W202â208.
Huang M, Chen Y, Yang M, Guo A, Xu Y, Xu L, et al. dbCoRC: a database of core transcriptional regulatory circuitries modeled by H3K27ac ChIP-seq signals. Nucleic Acids Res. 2018;46:D71â7.
Wang L, Hiler D, Xu B, AlDiri I, Chen X, Zhou X, et al. Retinal cell type DNA methylation and histone modifications predict reprogramming efficiency and retinogenesis in 3D organoid cultures. Cell Rep. 2018;22:2601â14.
Ramsey SA, Knijnenburg TA, Kennedy KA, Zak DE, Gilchrist M, Gold ES, et al. Genome-wide histone acetylation data improve prediction of mammalian transcription factor binding sites. Bioinformatics. 2010;26:2071â5.
Jin Y, Chen K, De Paepe A, Hellqvist E, Krstic AD, Metang L, et al. Active enhancer and chromatin accessibility landscapes chart the regulatory network of primary multiple myeloma. Blood. 2018;131:2138â50.
Assi SA, Imperato MR, Coleman DJL, Pickin A, Potluri S, Ptasinska A, et al. Subtype-specific regulatory network rewiring in acute myeloid leukemia. Nat Genet. 2019;51:151â62.
Mercado N, Schutzius G, Kolter C, Estoppey D, Bergling S, Roma G, et al. IRF2 is a master regulator of human keratinocyte stem cell fate. Nat Commun. 2019;10:4676.
Girardi T, Vicente C, Cools J, De Keersmaecker K. The genetics and molecular biology of T-ALL. Blood. 2017;129:1113â23.
Sanda T, Leong WZ. TAL1 as a master oncogenic transcription factor in T-cell acute lymphoblastic leukemia. Exp Hematol. 2017;53:7â15.
Mansour MR, Abraham BJ, Anders L, Berezovskaya A, Gutierrez A, Durbin AD, et al. Oncogene regulation. An oncogenic super-enhancer formed through somatic mutation of a noncoding intergenic element. Science. 2014;346:1373â7.
Tan SH, Yam AW, Lawton LN, Wong RW, Young RA, Look AT, et al. TRIB2 reinforces the oncogenic transcriptional program controlled by the TAL1 complex in T-cell acute lymphoblastic leukemia. Leukemia. 2016;30:959â62.
Leong WZ, Tan SH, Ngoc PCT, Amanda S, Yam AWY, Liau WS, et al. ARID5B as a critical downstream target of the TAL1 complex that activates the oncogenic transcriptional program and promotes T-cell leukemogenesis. Genes Dev. 2017;31:2343â60.
Ngoc PCT, Tan SH, Tan TK, Chan MM, Li Z, Yeoh AEJ, et al. Identification of novel lncRNAs regulated by the TAL1 complex in T-cell acute lymphoblastic leukemia. Leukemia. 2018;32:2138â51.
Choi A, Illendula A, Pulikkan JA, Roderick JE, Tesell J, Yu J, et al. RUNX1 is required for oncogenic Myb and Myc enhancer activity in T-cell acute lymphoblastic leukemia. Blood. 2017;130:1722â33.
Kwiatkowski N, Zhang T, Rahl PB, Abraham BJ, Reddy J, Ficarro SB, et al. Targeting transcription regulation in cancer with a covalent CDK7 inhibitor. Nature. 2014;511:616â20.
Olson CM, Jiang B, Erb MA, Liang Y, Doctor ZM, Zhang Z, et al. Pharmacological perturbation of CDK9 using selective CDK9 inhibition or degradation. Nat Chem Biol. 2018;14:163â70.
Winter GE, Mayer A, Buckley DL, Erb MA, Roderick JE, Vittori S, et al. BET bromodomain proteins function as master transcription elongation factors independent of CDK9 recruitment. Mol Cell. 2017;67:5â18.e19.
Northcott PA, Korshunov A, Witt H, Hielscher T, Eberhart CG, Mack S, et al. Medulloblastoma comprises four distinct molecular variants. J Clin Oncol. 2011;29:1408â14.
Garancher A, Lin CY, Morabito M, Richer W, Rocques N, Larcher M, et al. NRL and CRX define photoreceptor identity and reveal subgroup-specific dependencies in medulloblastoma. Cancer Cell. 2018;33:435â49.e6.
Boon K, Eberhart CG, Riggins GJ. Genomic amplification of orthodenticle homologue 2 in medulloblastomas. Cancer Res. 2005;65:703â7.
Adamson DC, Shi Q, Wortham M, Northcott PA, Di C, Duncan CG, et al. OTX2 is critical for the maintenance and progression of Shh-independent medulloblastomas. Cancer Res. 2010;70:181â91.
Boulay G, Awad ME, Riggi N, Archer TC, Iyer S, Boonseng WE, et al. OTX2 activity at distal regulatory elements shapes the chromatin landscape of group 3 medulloblastoma. Cancer Discov. 2017;7:288â301.
Decaesteker B, Denecker G, Van Neste C, Dolman EM, Van Loocke W, Gartlgruber M, et al. TBX2 is a neuroblastoma core regulatory circuitry component enhancing MYCN/FOXM1 reactivation of DREAM targets. Nat Commun. 2018;9:4866.
Wang L, Tan TK, Durbin AD, Zimmerman MW, Abraham BJ, Tan SH, et al. ASCL1 is a MYCN- and LMO1-dependent member of the adrenergic neuroblastoma core regulatory circuitry. Nat Commun. 2019;10:5622.
Oldridge DA, Wood AC, Weichert-Leahey N, Crimmins I, Sussman R, Winter C, et al. Genetic predisposition to neuroblastoma mediated by a LMO1 super-enhancer polymorphism. Nature. 2015;528:418â21.
Skapek SX, Ferrari A, Gupta AA, Lupo PJ, Butler E, Shipley J, et al. Rhabdomyosarcoma. Nat Rev Dis Prim. 2019;5:1.
Driman D, Thorner PS, Greenberg ML, Chilton-MacNeill S, Squire J. MYCN gene amplification in rhabdomyosarcoma. Cancer. 1994;73:2231â7.
Gryder BE, Wu L, Woldemichael GM, Pomella S, Quinn TR, Park PMC, et al. Chemical genomics reveals histone deacetylases are required for core regulatory transcription. Nat Commun. 2019;10:3004.
Chen Y, Xu L, Mayakonda A, Huang ML, Kanojia D, Tan TZ, et al. Bromodomain and extraterminal proteins foster the core transcriptional regulatory programs and confer vulnerability in liposarcoma. Nat Commun. 2019;10:1353.
Cancer Genome Atlas Research Network.Comprehensive and integrated genomic characterization of adult soft tissue sarcomas. Cell. 2017;171:950â65.e28.
Lin DC, Dinh HQ, Xie JJ, Mayakonda A, Silva TC, Jiang YY, et al. Identification of distinct mutational patterns and new driver genes in oesophageal squamous cell carcinomas and adenocarcinomas. Gut. 2018;67:1769â79.
Chen L, Huang M, Plummer J, Pan J, Jiang YY, Yang Q, et al. Master transcription factors form interconnected circuitry and orchestrate transcriptional networks in oesophageal adenocarcinoma. Gut. 2020;69:630â40.
Jiang YY, Jiang Y, Li CQ, Zhang Y, Dakle P, Kaur H, et al. TP63, SOX2, and KLF5 establish a core regulatory circuitry that controls epigenetic and transcription patterns in esophageal squamous cell carcinoma cell lines. Gastroenterology. 2020. In press. https://doi.org/10.1053/j.gastro.2020.06.050.
Pan J, Silva TC, Gull N, Yang Q, Plummer JT, Chen S, et al. Lineage-specific epigenomic and genomic activation of oncogene HNF4A promotes gastrointestinal adenocarcinomas. Cancer Res. 2020;80:2722â36.
Chia NY, Deng N, Das K, Huang D, Hu L, Zhu Y, et al. Regulatory crosstalk between lineage-survival oncogenes KLF5, GATA4 and GATA6 cooperatively promotes gastric cancer development. Gut. 2015;64:707â19.
Lin DC, Hao JJ, Nagata Y, Xu L, Shang L, Meng X, et al. Genomic and molecular characterization of esophageal squamous cell carcinoma. Nat Genet. 2014;46:467â73.
Zhang X, Choi PS, Francis JM, Imielinski M, Watanabe H, Cherniack AD, et al. Identification of focally amplified lineage-specific super-enhancers in human epithelial cancers. Nat Genet. 2016;48:176â82.
Zhang X, Choi PS, Francis JM, Gao GF, Campbell JD, Ramachandran A, et al. Somatic superenhancer duplications and hotspot mutations lead to oncogenic activation of the KLF5 transcription factor. Cancer Discov. 2018;8:108â25.
Jiang Y, Jiang YY, Xie JJ, Mayakonda A, Hazawa M, Chen L, et al. Co-activation of super-enhancer-driven CCAT1 by TP63 and SOX2 promotes squamous cancer progression. Nat Commun. 2018;9:3619.
Bleu M, Gaulis S, Lopes R, Sprouffske K, Apfel V, Holwerda S, et al. PAX8 activates metabolic genes via enhancer elements in Renal Cell Carcinoma. Nat Commun. 2019;10:3739.
van Groningen T, Akogul N, Westerhout EM, Chan A, Hasselt NE, Zwijnenburg DA, et al. A NOTCH feed-forward loop drives reprogramming from adrenergic to mesenchymal state in neuroblastoma. Nat Commun. 2019;10:1530.
Riddick G, Kotliarova S, Rodriguez V, Kim HS, Linkous A, Storaska AJ, et al. A core regulatory circuit in glioblastoma stem cells links MAPK activation to a transcriptional program of neural stem cell identity. Sci Rep. 2017;7:43605.
Ran L, Chen Y, Sher J, Wong EWP, Murphy D, Zhang JQ, et al. FOXF1 defines the core-regulatory circuitry in gastrointestinal stromal tumor. Cancer Discov. 2018;8:234â51.
Fournier M, Bourriquen G, Lamaze FC, Cote MC, Fournier E, Joly-Beauparlant C, et al. FOXA and master transcription factors recruit Mediator and Cohesin to the core transcriptional regulatory circuitry of cancer cells. Sci Rep. 2016;6:34962.
Tsuzuki S, Yasuda T, Kojima S, Kawazu M, Akahane K, Inukai T, et al. Targeting MEF2D-fusion oncogenic transcriptional circuitries in B-cell precursor acute lymphoblastic leukemia. Blood Cancer Discov. 2020. https://doi.org/10.1158/2643-3230.BCD-19-0080.
Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57â74.
Hemming ML, Lawlor MA, Zeid R, Lesluyes T, Fletcher JA, Raut CP, et al. Gastrointestinal stromal tumor enhancers support a transcription factor network predictive of clinical outcome. Proc Natl Acad Sci USA. 2018;115:E5746â55.
Gheorghe M, Sandve GK, Khan A, Cheneby J, Ballester B, Mathelier A. A map of direct TF-DNA interactions in the human genome. Nucleic Acids Res. 2019;47:e21.
Acknowledgements
We thank Dr Tuan Zea Tan for kind help with computational analysis and Dr Takaomi Sanda for kind discussion.
Funding
This work is funded by the National Institute of Health (R01-CA200992-03 to HPK; R35CA197628, R01CA137060, R01CA157644, R01CA172558, and R01CA213138 to MM), the Howard Hughes Medical Institute (HHMI-55108547 to MM), National Research Foundation Singapore under its Singapore Translational Research (STaR) Investigator Award (NMRC/STaR/0021/2014 to HPK), the Singapore Ministry of Education Academic Research Fund Tier 2 (MOE2017-T2-1-033 to HPK), the Singapore Ministry of Healthâs National Medical Research Council (NMRC) Centre Grant awarded to National University Cancer Institute of Singapore (NCIS), the National Research Foundation Singapore and the Singapore Ministry of Education under its Research Centres of Excellence initiatives, and is additionally supported by NMRC Open Fund Young Individual Research Grant (MOH-OFYIRG18May-0001 to LX), a Seed Funding Program within the NCIS Centre Grant, a NCIS Yong Siew Yoon Research grant through donations from the Yong Loo Lin Trust, and the Wendy Walk Foundation. MM is also supported by a Wellcome Trust Senior Investigator Award (WT-101880), a Leukemia and Lymphoma Scholar award, the Norman and Sadie Lee Foundation (for Pediatric Cancer), the Falk Trust through a Falk Medical Research Trust Catalyst Award, the Pediatric Cancer Research Foundation, the Cancer Research Institute through a Clinic and Laboratory Integration Program grant, and the California Institute for Regenerative Medicine through DISC2-10061. MM is a Howard Hughes Medical Institute (HHMI) Faculty Scholar.
Author information
Authors and Affiliations
Contributions
Conceptualization, YC, LX, and HPK; visualization, YC, LX, and RY-TL; writingâoriginal draft, YC and LX; writingâreview and editing, YC, LX, MM, and HPK; funding acquisition, LX, MM, and HPK; supervision, MM and HPK.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chen, Y., Xu, L., Lin, R.YT. et al. Core transcriptional regulatory circuitries in cancer. Oncogene 39, 6633â6646 (2020). https://doi.org/10.1038/s41388-020-01459-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41388-020-01459-w