Folio Bioscience, clinical sample procurement  Sign up for PNAS Online eTocs
Link: Info for AuthorsLink: Editorial BoardLink: AboutLink: SubscribeLink: AdvertiseLink: ContactLink: Sitemap Link: PNAS Home
Proceedings of the National Academy of Sciences
Link: Current Issue "" Link: Archives "" Link: Online Submission ""  Link: Advanced Search

Published online on April 24, 2006, 10.1073/pnas.0601688103 OPEN ACCESS ARTICLE


This Article
Free via Open Access: OA
Right arrow Full Text (PDF)
Right arrow Supporting Information
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a colleague
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My File Cabinet
Right arrow Download to citation manager
Right arrow Request Copyright Permission
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Rigoutsos, I.
Right arrow Articles by Platt, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rigoutsos, I.
Right arrow Articles by Platt, D.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg  
What's this?

Genetics
Short blocks from the noncoding parts of the human genome have instances within nearly all known genes and relate to biological processes

( junk DNA | pattern discovery | posttranscriptional gene silencing | pyknons | RNA interference )

Isidore Rigoutsos *, Tien Huynh, Kevin Miranda, Aristotelis Tsirigos, Alice McHardy, and Daniel Platt

IBM Thomas J. Watson Research Center, P.O. Box 218, Yorktown Heights, NY 10598

Communicated by Thomas E. Shenk, Princeton University, Princeton, NJ, March 4, 2006 (received for review November 16, 2005)

Using an unsupervised pattern-discovery method, we processed the human intergenic and intronic regions and catalogued all variable-length patterns with identically conserved copies and multiplicities above what is expected by chance. Among the millions of discovered patterns, we found a subset of 127,998 patterns, termed pyknons, which have additional nonoverlapping instances in the untranslated and protein-coding regions of 30,675 transcripts from 20,059 human genes. The pyknons arrange combinatorially in the untranslated and coding regions of numerous human genes where they form mosaics. Consecutive instances of pyknons in these regions show a strong bias in their relative placement, favoring distances of {approx}22 nucleotides. We also found pyknons to be enriched in a statistically significant manner in genes involved in specific processes, e.g., cell communication, transcription, regulation of transcription, signaling, transport, etc. For {approx}1/3 of the pyknons, the intergenic/intronic instances of their reverse complement lie within 380,084 nonoverlapping regions, typically 60-80 nucleotides long, which are predicted to form double-stranded, energetically stable, hairpin-shaped RNA secondary structures; additionally, the pyknons subsume {approx}40% of the known microRNA sequences, thus suggesting a possible link with posttranscriptional gene silencing and RNA interference. Cross-genome comparisons reveal that many of the pyknons have instances in the 3' UTRs of genes from other vertebrates and invertebrates where they are overrepresented in similar biological processes, as in the human genome. These unexpected findings suggest potential unique functional connections between the coding and noncoding parts of the human genome.


Author contributions: I.R. designed research; I.R., T.H., A.T., and A.M. performed research; I.R., T.H., K.M., A.T., A.M., and D.P. analyzed data; and I.R. wrote the paper.

Conflict of interest statement: No conflicts declared.

Freely available online through the PNAS open access option.

*To whom correspondence should be addressed.

Isidore Rigoutsos, E-mail: rigoutso{at}us.ibm.com

www.pnas.org/cgi/doi/10.1073/pnas.0601688103
Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg    What's this?


This article has been cited by other articles in HighWire Press-hosted journals:


Home page
Nucleic Acids ResHome page
A. Tsirigos and I. Rigoutsos
Human and mouse introns are linked to the same processes and functions through each genome's most frequent non-conserved motifs
Nucleic Acids Res., May 1, 2008; (2008) gkn155v1.
[Abstract] [Full Text] [PDF]


Home page
Evid Based Complement Alternat MedHome page
F. Chiappelli and O. S. Cajulis
Transitioning Toward Evidence-Based Research in the Health Sciences for the XXI Century
Evid. Based Complement. Altern. Med., December 4, 2007; (2007) nem123v1.
[Abstract] [Full Text] [PDF]