( comparative genomics |
conserved noncoding element )
Contributed by Eric S. Lander, March 3, 2007 (sent for review January 26, 2007) Conserved noncoding elements (CNEs) constitute the majority of sequences under purifying selection in the human genome, yet their function remains largely unknown. Experimental evidence suggests that many of these elements play regulatory roles, but little is known about regulatory motifs contained within them. Here we describe a systematic approach to discover and characterize regulatory motifs within mammalian CNEs by searching for long motifs (12-22 nt) with significant enrichment in CNEs and studying their biochemical and genomic properties. Our analysis identifies 233 long motifs (LMs), matching a total of
Genetics
Systematic discovery of regulatory motifs in conserved regions of the human genome, including thousands of CTCF insulator sites
, 
,
,
, 
,
¶||
Broad Institute of MIT and Harvard, Massachusetts Institute of Technology and Harvard Medical School, Cambridge, MA 02142;
Division of Health Sciences and Technology,
Computer Science and Artificial Intelligence Laboratory, and ||Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139; and ¶Whitehead Institute for Biomedical Research, Cambridge, MA 02142
60,000 conserved instances across the human genome. These motifs include 16 previously known regulatory elements, such as the histone 3'-UTR motif and the neuron-restrictive silencer element, as well as striking examples of novel functional elements. The most highly enriched motif (LM1) corresponds to the X-box motif known from yeast and nematode. We show that it is bound by the RFX1 protein and identify thousands of conserved motif instances, suggesting a broad role for the RFX family in gene regulation. A second group of motifs (LM2*) does not match any previously known motif. We demonstrate by biochemical and computational methods that it defines a binding site for the CTCF protein, which is involved in insulator function to limit the spread of gene activation. We identify nearly 15,000 conserved sites that likely serve as insulators, and we show that nearby genes separated by predicted CTCF sites show markedly reduced correlation in gene expression. These sites may thus partition the human genome into domains of expression.
Author contributions: X.X. and E.S.L. designed research; X.X. and E.S.L. performed research; X.X., T.S.M., A.G., K.L.-T., M.K., and E.S.L. contributed new reagents/analytic tools; X.X., T.S.M., A.G., K.L.-T., M.K., and E.S.L. analyzed data; and X.X., T.S.M., A.G., K.L.-T., M.K., and E.S.L. wrote the paper.
The authors declare no conflict of interest.
Freely available online through the PNAS open access option.

To whom correspondence should be addressed.
www.pnas.org/cgi/doi/10.1073/pnas.0701811104
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
A. Williams and R. A. Flavell The role of CTCF in regulating nuclear organization J. Exp. Med., April 14, 2008; 205(4): 747 - 750. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. N. Levasseur, J. Wang, M. O. Dorschner, J. A. Stamatoyannopoulos, and S. H. Orkin Oct4 dependence of chromatin structure within the extended Nanog locus in ES cells Genes & Dev., March 1, 2008; 22(5): 575 - 580. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bao, M. Zhou, and Y. Cui CTCFBSDB: a CTCF-binding site database for characterization of vertebrate genomic insulators Nucleic Acids Res., January 11, 2008; 36(suppl_1): D83 - D87. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Bryne, E. Valen, M.-H. E. Tang, T. Marstrand, O. Winther, I. da Piedade, A. Krogh, B. Lenhard, and A. Sandelin JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update Nucleic Acids Res., January 11, 2008; 36(suppl_1): D102 - D106. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. G. Engstrom, S. J. Ho Sui, O. Drivenes, T. S. Becker, and B. Lenhard Genomic regulatory blocks underlie extensive microsynteny conservation in insects Genome Res., December 1, 2007; 17(12): 1898 - 1908. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al. Targeted discovery of novel human exons by comparative genomics Genome Res., December 1, 2007; 17(12): 1763 - 1773. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Renda, I. Baglivo, B. Burgess-Beusse, S. Esposito, R. Fattorusso, G. Felsenfeld, and P. V. Pedone Critical DNA Binding Interactions of the Insulator Protein CTCF: A SMALL NUMBER OF ZINC FINGERS MEDIATE STRONG BINDING, AND A SINGLE FINGER-DNA INTERACTION CONTROLS BINDING AT IMPRINTED LOCI J. Biol. Chem., November 16, 2007; 282(46): 33336 - 33345. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Duan, R. E. Person, H.-H. Lee, S. Huang, J. Donadieu, R. Badolato, H. L. Grimes, T. Papayannopoulou, and M. S. Horwitz Epigenetic Regulation of Protein-Coding and MicroRNA Genes by the Gfi1-Interacting Tumor Suppressor PRDM5 Mol. Cell. Biol., October 1, 2007; 27(19): 6889 - 6902. [Abstract] [Full Text] [PDF] |
||||