Keystone Symposia 2008 Conference Schedule  Sign up for PNAS Online eTocs
Link: Info for AuthorsLink: Editorial BoardLink: AboutLink: SubscribeLink: AdvertiseLink: ContactLink: Sitemap Link: PNAS Home
Proceedings of the National Academy of Sciences
Link: Current Issue "" Link: Archives "" Link: Online Submission ""  Link: Advanced Search

Published online on January 27, 2003, 10.1073/pnas.0337561100
PNAS | February 4, 2003 | vol. 100 | no. 3 | 1140-1145


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a colleague
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My File Cabinet
Right arrow Download to citation manager
Right arrow Request Copyright Permission
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via ISI Web of Science (44)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Guigó, R.
Right arrow Articles by Brent, M. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Guigó, R.
Right arrow Articles by Brent, M. R.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg  
What's this?

 Previous Article  | Table of Contents |  Next Article 

Genetics
Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes

Roderic Guigó*,dagger , Emmanouil T. Dermitzakisdagger ,Dagger , Pankaj Agarwal§, Chris P. Ponting, Genís Parra*, Alexandre ReymondDagger , Josep F. Abril*, Evan Keibler||, Robert LyleDagger , Catherine UclaDagger , Stylianos E. AntonarakisDagger , and Michael R. Brent||,**

* Research Group in Biomedical Informatics, Institut Municipal d'Investigació Mèdica/Universitat Pompeu Fabra/Centre de Regulació Genòmica, E08003 Barcelona, Catalonia, Spain; Dagger  Division of Medical Genetics, University of Geneva Medical School and University Hospitals, 1211 Geneva, Switzerland; § GlaxoSmithKline, UW2230, 709 Swedeland Road, King of Prussia, PA 19406;  Medical Research Council Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, United Kingdom; and || Department of Computer Science, Washington University, One Brookings Drive, St. Louis, MO 63130

Communicated by Robert H. Waterston, Washington University School of Medicine, St. Louis, MO, December 11, 2002 (received for review October 21, 2002)

A primary motivation for sequencing the mouse genome was to accelerate the discovery of mammalian genes by using sequence conservation between mouse and human to identify coding exons. Achieving this goal proved challenging because of the large proportion of the mouse and human genomes that is apparently conserved but apparently does not code for protein. We developed a two-stage procedure that exploits the mouse and human genome sequences to produce a set of genes with a much higher rate of experimental verification than previously reported prediction methods. RT-PCR amplification and direct sequencing applied to an initial sample of mouse predictions that do not overlap previously known genes verified the regions flanking one intron in 139 predictions, with verification rates reaching 76%. On average, the confirmed predictions show more restricted expression patterns than the mouse orthologs of known human genes, and two-thirds lack homologs in fish genomes, demonstrating the sensitivity of this dual-genome approach to hard-to-find genes. We verified 112 previously unknown homologs of known proteins, including two homeobox proteins relevant to developmental biology, an aquaporin, and a homolog of dystrophin. We estimate that transcription and splicing can be verified for >1,000 gene predictions identified by this method that do not overlap known genes. This is likely to constitute a significant fraction of the previously unknown, multiexon mammalian genes.


dagger R.G. and E.T.D. contributed equally to this work.

** To whom correspondence should be addressed. E-mail: brent{at}cse.wustl.edu.

www.pnas.org/cgi/doi/10.1073/pnas.0337561100
Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg    What's this?


This article has been cited by other articles in HighWire Press-hosted journals:


Home page
Brief Funct Genomic ProteomicHome page
C. Ansong, S. O. Purvine, J. N. Adkins, M. S. Lipton, and R. D. Smith
Proteogenomics: needs and roles to be filled by proteomics in genome annotation
Brief Funct Genomic Proteomic, March 10, 2008; (2008) eln010v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. J. Fullwood, J. J. S. Tan, P. W. P. Ng, K. P. Chiu, J. Liu, C. L. Wei, and Y. Ruan
The use of multiple displacement amplification to amplify complex DNA libraries
Nucleic Acids Res., March 1, 2008; 36(5): e32 - e32.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al.
Targeted discovery of novel human exons by comparative genomics
Genome Res., December 1, 2007; 17(12): 1763 - 1773.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. M. Andres, C. de Hemptinne, and J. Bertranpetit
Heterogeneous Rate of Protein Evolution in Serotonin Genes
Mol. Biol. Evol., December 1, 2007; 24(12): 2707 - 2715.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
O. Emanuelsson, U. Nagalakshmi, D. Zheng, J. S. Rozowsky, A. E. Urban, J. Du, Z. Lian, V. Stolc, S. Weissman, M. Snyder, et al.
Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome
Genome Res., June 1, 2007; 17(6): 886 - 897.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. Parra, A. Reymond, N. Dabbouseh, E. T. Dermitzakis, R. Castelo, T. M. Thomson, S. E. Antonarakis, and R. Guigo
Tandem chimerism as a means to increase protein complexity in the human genome
Genome Res., January 1, 2006; 16(1): 37 - 44.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. L. Guenet
The mouse genome
Genome Res., December 1, 2005; 15(12): 1729 - 1740.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. R. Brent
Genome annotation past, present, and future: How to define an ORF at each locus
Genome Res., December 1, 2005; 15(12): 1777 - 1786.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Castelo, A. Reymond, C. Wyss, F. Camara, G. Parra, S. E. Antonarakis, R. Guigo, and E. Eyras
Comparative gene finding in chicken indicates that we are closing in on the set of multi-exonic widely expressed human genes
Nucleic Acids Res., April 4, 2005; 33(6): 1935 - 1939.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. Ayele, B. J. Haas, N. Kumar, H. Wu, Y. Xiao, S. Van Aken, T. R. Utterback, J. R. Wortman, O. R. White, and C. D. Town
Whole genome shotgun sequencing of Brassica oleracea and its application to gene discovery and annotation in Arabidopsis
Genome Res., April 1, 2005; 15(4): 487 - 495.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
C. Wei, P. Lamesch, M. Arumugam, J. Rosenberg, P. Hu, M. Vidal, and M. R. Brent
Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions
Genome Res., April 1, 2005; 15(4): 577 - 582.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Yandell, A. M. Bailey, S. Misra, S. Shu, C. Wiel, M. Evans-Holm, S. E. Celniker, and G. M. Rubin
A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome
PNAS, February 1, 2005; 102(5): 1566 - 1571.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Premzl, J. E. Gready, L. S. Jermiin, T. Simonic, and J. A. Marshall Graves
Evolution of Vertebrate Genes Related to Prion and Shadoo Proteins--Clues from Comparative Genomic Analysis
Mol. Biol. Evol., December 1, 2004; 21(12): 2210 - 2231.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. T. Dermitzakis, E. Kirkness, S. Schwarz, E. Birney, A. Reymond, and S. E. Antonarakis
Comparison of Human Chromosome 21 Conserved Nongenic Sequences (CNGs) With the Mouse and Dog Genomes Shows That Their Selective Constraint Is Independent of Their Genic Environment
Genome Res., May 1, 2004; 14(5): 852 - 859.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
C. Dewey, J. Q. Wu, S. Cawley, M. Alexandersson, R. Gibbs, and L. Pachter
Accurate Identification of Novel Human Genes Through Simultaneous Gene Prediction in Human, Mouse, and Rat
Genome Res., April 1, 2004; 14(4): 661 - 664.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. Q. Wu, D. Shteynberg, M. Arumugam, R. A. Gibbs, and M. R. Brent
Identification of Rat Genes by TWINSCAN Gene Prediction, RT-PCR, and Direct Sequencing
Genome Res., April 1, 2004; 14(4): 665 - 671.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
D. Kampa, J. Cheng, P. Kapranov, M. Yamanaka, S. Brubaker, S. Cawley, J. Drenkow, A. Piccolboni, S. Bekiranov, G. Helt, et al.
Novel RNAs Identified From an In-Depth Analysis of the Transcriptome of Human Chromosomes 21 and 22
Genome Res., March 1, 2004; 14(3): 331 - 342.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
B. M. Porcel, O. Delfour, V. Castelli, V. De Berardinis, L. Friedlander, C. Cruaud, A. Ureta-Vidal, C. Scarpelli, P. Wincker, V. Schachter, et al.
Numerous Novel Annotations of the Human Genome Sequence Supported by a 5'-End-Enriched cDNA Collection
Genome Res., March 1, 2004; 14(3): 463 - 471.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. M. Meyer and R. Durbin
Gene structure conservation aids similarity based gene prediction
Nucleic Acids Res., February 4, 2004; 32(2): 776 - 783.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Nekrutenko, W.-Y. Chung, and W.-H. Li
ETOPE: evolutionary test of predicted exons
Nucleic Acids Res., July 1, 2003; 31(13): 3564 - 3567.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
W. H. Majoros, M. Pertea, C. Antonescu, and S. L. Salzberg
GlimmerM, Exonomy and Unveil: three ab initio eukaryotic genefinders
Nucleic Acids Res., July 1, 2003; 31(13): 3601 - 3604.
[Abstract] [Full Text] [PDF]