New Research In
Physical Sciences
Social Sciences
Featured Portals
Articles by Topic
Biological Sciences
Featured Portals
Articles by Topic
- Agricultural Sciences
- Anthropology
- Applied Biological Sciences
- Biochemistry
- Biophysics and Computational Biology
- Cell Biology
- Developmental Biology
- Ecology
- Environmental Sciences
- Evolution
- Genetics
- Immunology and Inflammation
- Medical Sciences
- Microbiology
- Neuroscience
- Pharmacology
- Physiology
- Plant Biology
- Population Biology
- Psychological and Cognitive Sciences
- Sustainability Science
- Systems Biology
ORChestrating the human DNA replication program

In humans, more than 6 billion bp of DNA need to be copied accurately and completely every cell division. Given the size of human chromosomes, it would take a pair of bidirectional DNA replication forks, emanating from a single replication origin, more than 40 days to copy just chromosome 1. To duplicate the genome completely within the confines of S-phase, multiple start sites of DNA replication need to be distributed across each of the chromosomes. Understanding how start sites of DNA replication are selected and regulated across a diverse array of cell types has been a major question in the field since the discovery of the double helix. In PNAS, Miotto et al. (1) describe the genome-wide localization of a key replication initiation factor, the origin recognition complex (ORC), and provide insight into the genomic features that define the location and distribution of human replication origins.
ORC is a conserved and essential heterohexameric protein complex that is essential for the initiation of DNA replication (2). In G1 of the cell cycle, ORC, together with Cdt1 and Cdc6, directs the loading of the replicative helicase Mcm2–7 onto DNA to form the prereplicative complex (pre-RC). The assembly of the pre-RC in G1 serves to “license” potential origins for activation in the subsequent S-phase (3). Cyclin-dependent kinase and Dbf4-dependent kinase activities during S-phase result in the recruitment of additional proteins to the pre-RC to form the preinitiation complex, unwinding of DNA, and initiation of DNA synthesis. The transactivating factors required for initiation of DNA replication are conserved from yeast to mammals, and the regulated initiation of DNA replication has recently been reconstituted from a minimal complement of ∼42 proteins (4). Despite an in-depth understanding of the transacting factors required for the initiation of DNA replication, considerably less is known about cis-acting genomic features that define replication origins in the context of mammalian chromosomes.
In Saccharomyces cerevisiae, origins were first identified as short (∼200 bp) autonomously replicating sequences (ARSs) required for the maintenance and propagation of episomal DNA (5). S. cerevisiae ORC binds to a cis-acting sequence element termed the ARS consensus sequence that is necessary, but not sufficient, for origin function. In contrast, ORC purified from higher eukaryotes exhibits little if any preference for binding specific sequences (6, 7). The lack of sequence specificity observed for ORC in higher eukaryotes suggests that additional chromatin features are likely involved in specifying replication origins.
Chromatin immunoprecipitation (ChIP) coupled with genomic microarrays (ChIP-chip) or, more recently, next-generation sequencing allows for the comprehensive identification of genomic loci associated with specific DNA binding factors. In S. cerevisiae, ORC and the Mcm2–7 complex were among the first eukaryotic DNA binding factors to be mapped by ChIP-chip and provided the first genome-wide map of potential replication origins (8). The distribution of ORC was subsequently mapped by ChIP-chip in Drosophila melanogaster (9). However, despite these early successes in characterizing the genome-wide distribution of ORC, the identification of mammalian ORC binding sites remained a challenge. In the 15 years since the advent of genome-wide ChIP, numerous laboratories have attempted to map ORC binding sites precisely in mammalian cell lines with limited success. Anecdotally, the reasons for failure include low correlation among replicates, low signal-to-noise ratio, minimal overlap between different subunits, and little concordance with other genomic approaches for mapping replication initiation events. The failures were initially attributed to the lack of availability of ChIP-grade ORC antibodies; however, attempts to use tagged ORC subunits were also largely ineffective. Alternatively, due to the lack of observed sequence specificity for metazoan ORC, ORC may associate transiently or promiscuously with any accessible sequence in a stochastic manner. In 2013, Dellino et al. (10) overcame many of these challenges by “ChIPing” ORC from low-density chromatin associated with promoters and enhancers. In this partial map of human sequences associated with ORC, they identified ∼8,000 binding sites throughout the human genome that were concordant between replicates. These 8,000 sites were far fewer than the estimated 50,000–80,000 replication origins required to duplicate the genome.
The Struhl group (1) now provides a more complete view of ORC localization throughout the human genome via the identification of nearly 50,000 sequences associated with ORC in K562 cells (Fig. 1). ORC localizes to open chromatin accessible by DNase I and is frequently found associated with promoters and enhancers. Not surprisingly, active chromatin marks, including H3K27Ac, H3K9Ac, and H3K4 methylation, are also enriched. No cis-acting sequence motifs were identified, but ORC binding sites were frequently co-occupied by transcription factors (including E2F, Myc, NF-κB, GATA, and AP-1). Despite the frequent colocalization with transcription factor binding sites, no predictive patterns of colocalization were identified. The chromatin states and joint occupancy with multiple transcription factors are similar to the observed occupancy of Drosophila ORC at highly occupied target regions of the fly genome (11) and are consistent with ORC localizing to accessible chromatin in a promiscuous manner independent of primary sequence. Given the colocalization with various DNA binding factors in these regions, the overall frequency of ORC occupancy across the population for a given locus remains to be determined.
ORC density is a determinant of replication timing and genome stability. Schematic of ORC localization in early- and late-replicating domains. ORC localizes to open and accessible chromatin marked by DNase I accessibility, transcription factor (TF) occupancy, and activating chromatin modifications. Late-replicating domains have decreased ORC density and frequently give rise to chromosomal fragile sites.
The comprehensive identification of ORC binding sites provides a map of potential replication initiation sites throughout the human genome. Prior genome-wide experiments have identified replication origins based on the enrichment of specific DNA structures that arise during replication initiation. The identification of replication bubbles and short nascent strands and the distribution of Okazaki fragments have all been used to localize mammalian replication origins (12). A common theme to emerge is that replication origins are enriched at regulatory elements (promoters, enhancers, and CpG islands) in open and active chromatin. Despite gross similarities in the genomic features associated with origin activity, there has been only limited agreement and concordance between approaches at the nucleotide level. This lack of agreement is also true for the ORC mapping data presented here: Only about 12–15% of the origins mapped by short nascent strands are within 1–2 kb of an ORC binding site (1). Although these discrepancies may be due to technical limitations of either the ChIP (13) or short nascent strand (14) assay, they could also be due to initiation events being uncoupled from the site of ORC binding. For example, there is a significant excess of the Mcm2–7 helicase complex loaded onto DNA relative to ORC (15). Elegant in vitro and in vivo experiments from the Remus laboratory suggest that in S. cerevisiae, the Mcm2–7 complex can translocate along DNA and be displaced by transcription, resulting in de novo initiation sites (16). Similarly, in Drosophila, the genome-wide distribution of the Mcm2–7 complex as cells enter S-phase is distinct from ORC binding sites and shaped by active transcription (17). Thus, ORC may only pinpoint sites of helicase loading and not ultimately where initiation occurs.
The temporal order by which specific DNA sequences are copied during S-phase is referred to as the DNA replication timing program (18). Mammalian genomes are partitioned into discrete early- and late-replicating domains that span hundreds to thousands of kilobases. The time at which a sequence replicates is Miotto et al. describe the genome-wide localization of a key replication initiation factor, the origin recognition complex (ORC), and provide insight into the genomic features that define the location and distribution of human replication origins.
dependent on the density of replication origins and their time of activation during S-phase. The establishment of the replication timing program has been linked to chromatin environment (19), nuclear architecture (20) and the availability of limiting replication initiation factors (21, 22). In PNAS, Miotto et al. (1) provide evidence that ORC density is also a determinant of replication timing. Indeed, instead of simply reporting correlations and the density of ORC binding sites as a function of replication timing, they use mathematical modeling to predict replication timing accurately solely from the density of ORC binding sites and presumed initiation events. They also find a paucity of ORC binding in the proximity of chromosomal fragile sites and deletions frequently occurring in cancer cell lines. Together, these results suggest that the distribution and density of ORC are determinants of genome stability.
The comprehensive identification of ORC binding sites throughout the human genome provides insight into the mechanisms that specify potential replication origins. The sequence-independent localization of ORC to promoters and enhancers marked by active and accessible chromatin in both Drosophila and human cells suggests a conserved chromatin-based mechanism to specify ORC binding sites and potential replication origins in higher eukaryotes. Defining ORC binding sites by chromatin-based mechanisms provides a flexible system to specify the DNA replication program in different cell types, and the frequent colocalization at active promoters may decrease the likelihood of head-to-head collisions between RNA and DNA polymerases (23). The field is beginning to identify the genomic and chromatin features correlated with ORC binding and origin function; however, the precise chromatin-based rules that govern origin selection and activation at specific loci remain an enigma. Recent and rapid advances in CRISPR/Cas9 technologies to edit the genome precisely and modify chromatin structure (24) will hopefully allow for the rigorous testing of models predicted from the observed correlations between chromatin state and the DNA replication program.
Acknowledgments
This research is supported by the NIH (Grant R01-GM104097).
References
- ↵.
- Miotto B,
- Ji Z,
- Stuhl K
- ↵.
- Bell SP,
- Kaguni JM
- ↵.
- Siddiqui K,
- On KF,
- Diffley JFX
- ↵
- ↵.
- Leonard AC,
- Méchali M
- ↵.
- Vashee S, et al.
- ↵.
- Remus D,
- Beall EL,
- Botchan MR
- ↵.
- Wyrick JJ, et al.
- ↵.
- MacAlpine HK,
- Gordân R,
- Powell SK,
- Hartemink AJ,
- MacAlpine DM
- ↵.
- Dellino GI, et al.
- ↵.
- modEncode Consortium,
- et al.
- ↵.
- Hyrien O
- ↵.
- Teytelman L,
- Thurtle DM,
- Rine J,
- van Oudenaarden A
- ↵.
- Foulk MS,
- Urban JM,
- Casella C,
- Gerbi SA
- ↵.
- Edwards MC, et al.
- ↵
- ↵.
- Powell SK, et al.
- ↵
- ↵.
- Birney E, et al., ENCODE Project Consortium, NISC Comparative Sequencing Program, Baylor College of Medicine Human Genome Sequencing Center, Washington University Genome Sequencing Center, Broad Institute, Children’s Hospital Oakland Research Institute
- ↵
- ↵.
- Mantiero D,
- Mackenzie A,
- Donaldson A,
- Zegerman P
- ↵
- ↵
- ↵
Citation Manager Formats
Sign up for Article Alerts
Article Classifications
- Biological Sciences
- Cell Biology
See related content: