Sign up for PNAS Online eTocs
Link: Info for AuthorsLink: Editorial BoardLink: AboutLink: SubscribeLink: AdvertiseLink: ContactLink: Sitemap Link: PNAS Home
Proceedings of the National Academy of Sciences
Link: Current Issue "" Link: Archives "" Link: Online Submission ""  Link: Advanced Search

Published online on February 14, 2007, 10.1073/pnas.0611574104

This Article
Right arrow Full Text (PDF)
Right arrow Supporting Information
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a colleague
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My File Cabinet
Right arrow Download to citation manager
Right arrow Request Copyright Permission
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Thomas, B. C.
Right arrow Articles by Freeling, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Thomas, B. C.
Right arrow Articles by Freeling, M.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg  
What's this?

Genetics
Arabidopsis intragenomic conserved noncoding sequence

( gene regulation | small RNA | transcription factor )

Brian C. Thomas *, Lakshmi Rapaka {dagger}, Eric Lyons {dagger}, Brent Pedersen *, and Michael Freeling {dagger}{ddagger}

*College of Natural Resources and {dagger}Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720

Contributed by Michael Freeling, December 28, 2006 (sent for review October 11, 2006)

After the most recent tetraploidy in the Arabidopsis lineage, most gene pairs lost one, but not both, of their duplicates. We manually inspected the 3,179 retained gene pairs and their surrounding gene space still present in the genome using a custom-made viewer application. The display of these pairs allowed us to define intragenic conserved noncoding sequences (CNSs), identify exon annotation errors, and discover potentially new genes. Using a strict algorithm to sort high-scoring pair sequences from the bl2seq data, we created a database of 14,944 intragenomic Arabidopsis CNSs. The mean CNS length is 31 bp, ranging from 15 to 285 bp. There are {approx}1.7 CNSs associated with a typical gene, and Arabidopsis CNSs are found in all areas around exons, most frequently in the 5' upstream region. Gene ontology classifications related to transcription, regulation, or "response to . . ." external or endogenous stimuli, especially hormones, tend to be significantly overrepresented among genes containing a large number of CNSs, whereas protein localization, transport, and metabolism are common among genes with no CNSs. There is a 1.5% overlap between these CNSs and the 218,982 putative RNAs in the Arabidopsis Small RNA Project database, allowing for two mismatches. These CNSs provide a unique set of noncoding sequences enriched for function. CNS function is implied by evolutionary conservation and independently supported because CNS-richness predicts regulatory gene ontology categories.


Author contributions: B.C.T. and M.F. designed research; B.C.T., L.R., B.P., and M.F. performed research; B.C.T., E.L., and B.P. contributed new reagents/analytic tools; B.C.T. and M.F. analyzed data; and B.C.T. wrote the paper.

The authors declare no conflict of interest.

{ddagger}To whom correspondence should be addressed.

Michael Freeling, E-mail: freeling{at}nature.berkeley.edu

www.pnas.org/cgi/doi/10.1073/pnas.0611574104
Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg    What's this?


This article has been cited by other articles in HighWire Press-hosted journals:


Home page
Nucleic Acids ResHome page
J. Duvick, A. Fu, U. Muppirala, M. Sabharwal, M. D. Wilkerson, C. J. Lawrence, C. Lushbough, and V. Brendel
PlantGDB: a resource for comparative plant genomics
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D959 - D965.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. Uchida, B. Townsley, K.-H. Chung, and N. Sinha
Regulation of SHOOT MERISTEMLESS genes via an upstream-conserved noncoding sequence coordinates leaf development
PNAS, October 2, 2007; 104(40): 15953 - 15958.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Salvi, G. Sponza, M. Morgante, D. Tomes, X. Niu, K. A. Fengler, R. Meeley, E. V. Ananiev, S. Svitashev, E. Bruggemann, et al.
Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize
PNAS, July 3, 2007; 104(27): 11376 - 11381.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
M. Freeling, L. Rapaka, E. Lyons, B. Pedersen, and B. C. Thomas
G-Boxes, Bigfoot Genes, and Environmental Response: Characterization of Intragenomic Conserved Noncoding Sequences in Arabidopsis
PLANT CELL, May 1, 2007; 19(5): 1441 - 1457.
[Abstract] [Full Text] [PDF]