Previous Article |
Table of Contents
| Next Article
* Research Group in Biomedical Informatics, Institut
Municipal d'Investigació Mèdica/Universitat Pompeu
Fabra/Centre de Regulació Genòmica, E08003
Barcelona, Catalonia, Spain; Communicated by Robert H. Waterston, Washington
University School of Medicine, St. Louis, MO, December 11, 2002 (received for review October 21, 2002)
A primary motivation for sequencing the mouse genome was to
accelerate the discovery of mammalian genes by using sequence conservation between mouse and human to identify coding exons. Achieving this goal proved challenging because of the large
proportion of the mouse and human genomes that is apparently conserved
but apparently does not code for protein. We developed a two-stage procedure that exploits the mouse and human genome sequences to produce
a set of genes with a much higher rate of experimental verification
than previously reported prediction methods. RT-PCR amplification and
direct sequencing applied to an initial sample of mouse predictions
that do not overlap previously known genes verified the regions
flanking one intron in 139 predictions, with verification rates
reaching 76%. On average, the confirmed predictions show more
restricted expression patterns than the mouse orthologs of known human
genes, and two-thirds lack homologs in fish genomes, demonstrating the
sensitivity of this dual-genome approach to hard-to-find genes. We
verified 112 previously unknown homologs of known proteins,
including two homeobox proteins relevant to developmental biology, an
aquaporin, and a homolog of dystrophin. We estimate that transcription
and splicing can be verified for >1,000 gene predictions identified by
this method that do not overlap known genes. This is likely to
constitute a significant fraction of the previously unknown, multiexon
mammalian genes.
Genetics
Comparison of mouse and human genomes followed by experimental
verification yields an estimated 1,019 additional genes
,
,
,
,
,
,
,
, and
,**
Division of
Medical Genetics, University of Geneva Medical School and University
Hospitals, 1211 Geneva, Switzerland;
§ GlaxoSmithKline, UW2230, 709 Swedeland Road, King of
Prussia, PA 19406; ¶ Medical Research Council Functional
Genetics Unit, Department of Human Anatomy and Genetics, University
of Oxford, South Parks Road, Oxford OX1 3QX, United Kingdom; and
Department of Computer Science, Washington
University, One Brookings Drive, St. Louis, MO
63130
R.G. and E.T.D. contributed equally to this
work.
www.pnas.org/cgi/doi/10.1073/pnas.0337561100
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
C. Ansong, S. O. Purvine, J. N. Adkins, M. S. Lipton, and R. D. Smith Proteogenomics: needs and roles to be filled by proteomics in genome annotation Brief Funct Genomic Proteomic, March 10, 2008; (2008) eln010v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Fullwood, J. J. S. Tan, P. W. P. Ng, K. P. Chiu, J. Liu, C. L. Wei, and Y. Ruan The use of multiple displacement amplification to amplify complex DNA libraries Nucleic Acids Res., March 1, 2008; 36(5): e32 - e32. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al. Targeted discovery of novel human exons by comparative genomics Genome Res., December 1, 2007; 17(12): 1763 - 1773. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Andres, C. de Hemptinne, and J. Bertranpetit Heterogeneous Rate of Protein Evolution in Serotonin Genes Mol. Biol. Evol., December 1, 2007; 24(12): 2707 - 2715. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Emanuelsson, U. Nagalakshmi, D. Zheng, J. S. Rozowsky, A. E. Urban, J. Du, Z. Lian, V. Stolc, S. Weissman, M. Snyder, et al. Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome Genome Res., June 1, 2007; 17(6): 886 - 897. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Parra, A. Reymond, N. Dabbouseh, E. T. Dermitzakis, R. Castelo, T. M. Thomson, S. E. Antonarakis, and R. Guigo Tandem chimerism as a means to increase protein complexity in the human genome Genome Res., January 1, 2006; 16(1): 37 - 44. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Guenet The mouse genome Genome Res., December 1, 2005; 15(12): 1729 - 1740. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Brent Genome annotation past, present, and future: How to define an ORF at each locus Genome Res., December 1, 2005; 15(12): 1777 - 1786. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Castelo, A. Reymond, C. Wyss, F. Camara, G. Parra, S. E. Antonarakis, R. Guigo, and E. Eyras Comparative gene finding in chicken indicates that we are closing in on the set of multi-exonic widely expressed human genes Nucleic Acids Res., April 4, 2005; 33(6): 1935 - 1939. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ayele, B. J. Haas, N. Kumar, H. Wu, Y. Xiao, S. Van Aken, T. R. Utterback, J. R. Wortman, O. R. White, and C. D. Town Whole genome shotgun sequencing of Brassica oleracea and its application to gene discovery and annotation in Arabidopsis Genome Res., April 1, 2005; 15(4): 487 - 495. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Wei, P. Lamesch, M. Arumugam, J. Rosenberg, P. Hu, M. Vidal, and M. R. Brent Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions Genome Res., April 1, 2005; 15(4): 577 - 582. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Yandell, A. M. Bailey, S. Misra, S. Shu, C. Wiel, M. Evans-Holm, S. E. Celniker, and G. M. Rubin A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome PNAS, February 1, 2005; 102(5): 1566 - 1571. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Premzl, J. E. Gready, L. S. Jermiin, T. Simonic, and J. A. Marshall Graves Evolution of Vertebrate Genes Related to Prion and Shadoo Proteins--Clues from Comparative Genomic Analysis Mol. Biol. Evol., December 1, 2004; 21(12): 2210 - 2231. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. T. Dermitzakis, E. Kirkness, S. Schwarz, E. Birney, A. Reymond, and S. E. Antonarakis Comparison of Human Chromosome 21 Conserved Nongenic Sequences (CNGs) With the Mouse and Dog Genomes Shows That Their Selective Constraint Is Independent of Their Genic Environment Genome Res., May 1, 2004; 14(5): 852 - 859. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Dewey, J. Q. Wu, S. Cawley, M. Alexandersson, R. Gibbs, and L. Pachter Accurate Identification of Novel Human Genes Through Simultaneous Gene Prediction in Human, Mouse, and Rat Genome Res., April 1, 2004; 14(4): 661 - 664. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Q. Wu, D. Shteynberg, M. Arumugam, R. A. Gibbs, and M. R. Brent Identification of Rat Genes by TWINSCAN Gene Prediction, RT-PCR, and Direct Sequencing Genome Res., April 1, 2004; 14(4): 665 - 671. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Kampa, J. Cheng, P. Kapranov, M. Yamanaka, S. Brubaker, S. Cawley, J. Drenkow, A. Piccolboni, S. Bekiranov, G. Helt, et al. Novel RNAs Identified From an In-Depth Analysis of the Transcriptome of Human Chromosomes 21 and 22 Genome Res., March 1, 2004; 14(3): 331 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. M. Porcel, O. Delfour, V. Castelli, V. De Berardinis, L. Friedlander, C. Cruaud, A. Ureta-Vidal, C. Scarpelli, P. Wincker, V. Schachter, et al. Numerous Novel Annotations of the Human Genome Sequence Supported by a 5'-End-Enriched cDNA Collection Genome Res., March 1, 2004; 14(3): 463 - 471. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Meyer and R. Durbin Gene structure conservation aids similarity based gene prediction Nucleic Acids Res., February 4, 2004; 32(2): 776 - 783. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Nekrutenko, W.-Y. Chung, and W.-H. Li ETOPE: evolutionary test of predicted exons Nucleic Acids Res., July 1, 2003; 31(13): 3564 - 3567. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. H. Majoros, M. Pertea, C. Antonescu, and S. L. Salzberg GlimmerM, Exonomy and Unveil: three ab initio eukaryotic genefinders Nucleic Acids Res., July 1, 2003; 31(13): 3601 - 3604. [Abstract] [Full Text] [PDF] |
||||