Previous Article |
Table of Contents
| Next Article
EVOLUTION
Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes



*Center for Biomolecular Science and Engineering and
Howard Hughes Medical Institute, Department of Computer Science, University of California, Santa Cruz, CA 95064; and
Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA 16802
Edited by Michael S. Waterman, University of Southern California, Los Angeles, CA, and approved July 11, 2003 (received for review April 9, 2003)
This study examines genomic duplications, deletions, and rearrangements that have happened at scales ranging from a single base to complete chromosomes by comparing the mouse and human genomes. From whole-genome sequence alignments, 344 large (>100-kb) blocks of conserved synteny are evident, but these are further fragmented by smaller-scale evolutionary events. Excluding transposon insertions, on average in each megabase of genomic alignment we observe two inversions, 17 duplications (five tandem or nearly tandem), seven transpositions, and 200 deletions of 100 bases or more. This includes 160 inversions and 75 duplications or transpositions of length >100 kb. The frequencies of these smaller events are not substantially higher in finished portions in the assembly. Many of the smaller transpositions are processed pseudogenes; we define a "syntenic" subset of the alignments that excludes these and other small-scale transpositions. These alignments provide evidence that
2% of the genes in the human/mouse common ancestor have been deleted or partially deleted in the mouse. There also appears to be slightly less nontransposon-induced genome duplication in the mouse than in the human lineage. Although some of the events we detect are possibly due to misassemblies or missing data in the current genome sequence or to the limitations of our methods, most are likely to represent genuine evolutionary events. To make these observations, we developed new alignment techniques that can handle large gaps in a robust fashion and discriminate between orthologous and paralogous alignments.
comparative genomics | cross-species alignments | synteny | chromosomal inversion | breakpoints
See commentary on page 11188.
To whom correspondence should be addressed. E-mail: kent{at}biology.ucsc.edu.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
Related Commentary in PNAS:
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
B. Hooghe, P. Hulpiau, F. van Roy, and P. De Bleser ConTra: a promoter alignment analysis tool for identification of transcription factor binding sites across species Nucleic Acids Res., May 3, 2008; (2008) gkn195v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Glazov, S. McWilliam, W. C. Barris, and B. P. Dalrymple Origin, Evolution, and Biological Role of miRNA Cluster in DLK-DIO3 Genomic Region in Placental Mammals Mol. Biol. Evol., May 1, 2008; 25(5): 939 - 948. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lu, Y. Fu, S. Kumar, Y. Shen, K. Zeng, A. Xu, R. Carthew, and C.-I Wu Adaptive Evolution of Newly Emerged Micro-RNA Genes in Drosophila Mol. Biol. Evol., May 1, 2008; 25(5): 929 - 938. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Stanke, M. Diekhans, R. Baertsch, and D. Haussler Using native and syntenically mapped cDNA alignments to improve de novo gene finding Bioinformatics, March 1, 2008; 24(5): 637 - 644. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bao, M. Zhou, and Y. Cui CTCFBSDB: a CTCF-binding site database for characterization of vertebrate genomic insulators Nucleic Acids Res., January 11, 2008; 36(suppl_1): D83 - D87. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Courcelle, Y. Beausse, S. Letort, O. Stahl, R. Fremez, C. Ngom-Bru, J. Gouzy, and T. Faraut Narcisse: a mirror view of conserved syntenies Nucleic Acids Res., January 11, 2008; 36(suppl_1): D485 - D490. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Miller, K. Rosenbloom, R. C. Hardison, M. Hou, J. Taylor, B. Raney, R. Burhans, D. C. King, R. Baertsch, D. Blankenberg, et al. 28-Way vertebrate alignment and conservation track in the UCSC Genome Browser Genome Res., December 1, 2007; 17(12): 1797 - 1808. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. G. Engstrom, S. J. Ho Sui, O. Drivenes, T. S. Becker, and B. Lenhard Genomic regulatory blocks underlie extensive microsynteny conservation in insects Genome Res., December 1, 2007; 17(12): 1898 - 1908. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. McBride, J. R. Arguello, and B. C. O'Meara Five Drosophila Genomes Reveal Nonneutral Evolution and the Signature of Host Specialization in the Chemoreceptor Superfamily Genetics, November 1, 2007; 177(3): 1395 - 1416. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, G. M. Cooper, G. Asimenos, D. J. Thomas, C. N. Dewey, A. Siepel, E. Birney, D. Keefe, A. S. Schwartz, M. Hou, et al. Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome Genome Res., June 1, 2007; 17(6): 760 - 774. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Piriyapongsa, L. Marino-Ramirez, and I. K. Jordan Origin and Evolution of Human microRNAs From Transposable Elements Genetics, June 1, 2007; 176(2): 1323 - 1337. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kikuta, M. Laplante, P. Navratilova, A. Z. Komisarczuk, P. G. Engstrom, D. Fredman, A. Akalin, M. Caccamo, I. Sealy, K. Howe, et al. Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates Genome Res., May 1, 2007; 17(5): 545 - 555. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. W. Messer and P. F. Arndt The Majority of Recent Short DNA Insertions in the Human Genome Are Tandem Duplications Mol. Biol. Evol., May 1, 2007; 24(5): 1190 - 1197. [Abstract] [Full Text] [PDF] |
||||
![]() |
Rhesus Macaque Genome Sequencing and Analysis Cons, R. A. Gibbs, J. Rogers, M. G. Katze, R. Bumgarner, G. M. Weinstock, E. R. Mardis, K. A. Remington, R. L. Strausberg, J. C. Venter, et al. Evolutionary and Biomedical Insights from the Rhesus Macaque Genome Science, April 13, 2007; 316(5822): 222 - 234. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Harris, J. Rogers, and A. Milosavljevic Human-Specific Changes of Genome Structure Detected by Genomic Triangulation Science, April 13, 2007; 316(5822): 235 - 237. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Kuhn, D. Karolchik, A. S. Zweig, H. Trumbower, D. J. Thomas, A. Thakkapallayil, C. W. Sugnet, M. Stanke, K. E. Smith, A. Siepel, et al. The UCSC genome browser database: update 2007 Nucleic Acids Res., January 12, 2007; 35(suppl_1): D668 - D673. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. P. Hubbard, B. L. Aken, K. Beal, B. Ballester, M. Caccamo, Y. Chen, L. Clarke, G. Coates, F. Cunningham, T. Cutts, et al. Ensembl 2007 Nucleic Acids Res., January 12, 2007; 35(suppl_1): D610 - D617. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Thomas, H. Trumbower, A. D. Kern, B. L. Rhead, R. M. Kuhn, D. Haussler, and W. J. Kent Variation resources at UC Santa Cruz Nucleic Acids Res., January 12, 2007; 35(suppl_1): D716 - D720. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Coulombe-Huntington and J. Majewski Characterization of intron loss events in mammals Genome Res., January 1, 2007; 17(1): 23 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Chaisson, B. J. Raphael, and P. A. Pevzner Microinversions in mammalian evolution PNAS, December 26, 2006; 103(52): 19824 - 19829. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ma, L. Zhang, B. B. Suh, B. J. Raney, R. C. Burhans, W. J. Kent, M. Blanchette, D. Haussler, and W. Miller Reconstructing contiguous regions of an ancestral genome Genome Res., December 1, 2006; 16(12): 1557 - 1565. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Wilson, J. Cheung, D. W. Martindale, S. W. Scherer, and B. F. Koop Comparative analysis of the paired immunoglobulin-like receptor (PILR) locus in six mammalian genomes: duplication, conversion, and the birth of new genes Physiol Genomics, November 21, 2006; 27(3): 201 - 218. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Soderlund, W. Nelson, A. Shoemaker, and A. Paterson SyMAP: A system for discovering and viewing syntenic regions of FPC maps. Genome Res., September 1, 2006; 16(9): 1159 - 1168. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kayser, E. J. Vowles, D. Kappei, and W. Amos Microsatellite Length Differences Between Humans and Chimpanzees at Autosomal Loci Are Not Found at Equivalent Haploid Y Chromosomal Loci Genetics, August 1, 2006; 173(4): 2179 - 2186. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Li, X. Duan, H. Jiang, Y. Sun, Y. Tang, Z. Yuan, J. Guo, W. Liang, L. Chen, J. Yin, et al. Genome-Wide Analysis of Basic/Helix-Loop-Helix Transcription Factor Family in Rice and Arabidopsis Plant Physiology, August 1, 2006; 141(4): 1167 - 1184. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Xing, Q. Wang, and C. Lee Evolutionary Divergence of Exon Flanks: A Dissection of Mutability and Selection Genetics, July 1, 2006; 173(3): 1787 - 1791. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. N. Dewey and L. Pachter Evolution at the nucleotide level: the problem of multiple whole-genome alignment. Hum. Mol. Genet., April 15, 2006; 15(suppl_1): R51 - R56. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Vowles and W. Amos Quantifying Ascertainment Bias and Species-Specific Length Differences in Human and Chimpanzee Microsatellites Using Genome Sequences Mol. Biol. Evol., March 1, 2006; 23(3): 598 - 607. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Simons, M. Pheasant, I. V. Makunin, and J. S. Mattick Transposon-free regions in mammalian genomes Genome Res., February 1, 2006; 16(2): 164 - 172. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Elango, J. W. Thomas, NISC Comparative Sequencing Program, and S. V. Yi Variable molecular clocks in hominoids PNAS, January 31, 2006; 103(5): 1370 - 1375. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. SCHATTNER, S. BARBERAN-SOLER, and T. M. LOWE A computational screen for mammalian pseudouridylation guide H/ACA RNAs RNA, January 1, 2006; 12(1): 15 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Hinrichs, D. Karolchik, R. Baertsch, G. P. Barber, G. Bejerano, H. Clawson, M. Diekhans, T. S. Furey, R. A. Harte, F. Hsu, et al. The UCSC Genome Browser Database: update 2006 Nucleic Acids Res., January 1, 2006; 34(suppl_1): D590 - D598. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kawaji, T. Kasukawa, S. Fukuda, S. Katayama, C. Kai, J. Kawai, P. Carninci, and Y. Hayashizaki CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis Nucleic Acids Res., January 1, 2006; 34(suppl_1): D632 - D636. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yancopoulos, O. Attie, and R. Friedberg Efficient sorting of genomic permutations by translocation, inversion and block interchange Bioinformatics, August 15, 2005; 21(16): 3340 - 3346. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Flannick and S. Batzoglou Using multiple alignments to improve seeded local alignment algorithms Nucleic Acids Res., August 12, 2005; 33(14): 4563 - 4577. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel, G. Bejerano, J. S. Pedersen, A. S. Hinrichs, M. Hou, K. Rosenbloom, H. Clawson, J. Spieth, L. W. Hillier, S. Richards, et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes Genome Res., August 1, 2005; 15(8): 1034 - 1050. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Glazov, M. Pheasant, E. A. McGraw, G. Bejerano, and J. S. Mattick Ultraconserved elements in insect genomes: A highly conserved intronic sequence implicated in the control of homothorax mRNA splicing Genome Res., June 1, 2005; 15(6): 800 - 808. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Hampson, B. S. Gaut, and P. Baldi Statistical detection of chromosomal homology using shared-gene density alone Bioinformatics, April 15, 2005; 21(8): 1339 - 1348. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Dahary, O. Elroy-Stein, and R. Sorek Naturally occurring antisense: Transcriptional leakage or real overlap? Genome Res., March 1, 2005; 15(3): 364 - 368. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Milosavljevic, R. A. Harris, E. J. Sodergren, A. R. Jackson, K. J. Kalafus, A. Hodgson, A. Cree, W. Dai, M. Csuros, B. Zhu, et al. Pooled genomic indexing of rhesus macaque Genome Res., February 1, 2005; 15(2): 292 - 301. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Bourque, E. M. Zdobnov, P. Bork, P. A. Pevzner, and G. Tesler Comparative architectures of mammalian and chicken genomes reveal highly variable rates of genomic rearrangements across different lineages Genome Res., January 1, 2005; 15(1): 98 - 110. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ovcharenko, G. G. Loots, B. M. Giardine, M. Hou, J. Ma, R. C. Hardison, L. Stubbs, and W. Miller Mulan: Multiple-sequence local alignment and visualization for studying function and evolution Genome Res., January 1, 2005; 15(1): 184 - 194. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Blanchette, E. D. Green, W. Miller, and D. Haussler Reconstructing large regions of an ancestral mammalian genome in silico Genome Res., December 1, 2004; 14(12): 2412 - 2423. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Valverde-Garduno, B. Guyot, E. Anguita, I. Hamlett, C. Porcher, and P. Vyas Differences in the chromatin structure and cis-element organization of the human and mouse GATA1 loci: implications for cis-element identification Blood, November 15, 2004; 104(10): 3106 - 3116. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Stankiewicz, C. J. Shaw, M. Withers, K. Inoue, and J. R. Lupski Serial segmental duplications during primate evolution result in complex human genome architecture Genome Res., November 1, 2004; 14(11): 2209 - 2220. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Zhao, J. Shetty, L. Hou, A. Delcher, B. Zhu, K. Osoegawa, P. de Jong, W. C. Nierman, R. L. Strausberg, and C. M. Fraser Human, Mouse, and Rat Genome Large-Scale Rearrangements: Stability Versus Speciation Genome Res., October 1, 2004; 14(10a): 1851 - 1860. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Whitsett, C. J. Bachurski, K. C. Barnes, P. A. Bunn Jr., L. M. Case, D. N. Cook, D. Crooks, M. W. Duncan, L. Dwyer-Nield, R. C. Elston, et al. Functional Genomics of Lung Disease Am. J. Respir. Cell Mol. Biol., August 1, 2004; 31(2/S1): S1 - S81. [Full Text] [PDF] |
||||
![]() |
C. Muller, M. Denis, L. Gentzbittel, and T. Faraut The Iccare web server: an attempt to merge sequence and mapping information for plant and animal species Nucleic Acids Res., July 1, 2004; 32(suppl_2): W429 - W434. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Dowton Assessing the Relative Rate of (Mitochondrial) Genomic Change Genetics, June 1, 2004; 167(2): 1027 - 1030. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. I. Su, T. Wiltshire, S. Batalov, H. Lapp, K. A. Ching, D. Block, J. Zhang, R. Soden, M. Hayakawa, G. Kreiman, et al. A gene atlas of the mouse and human protein-encoding transcriptomes PNAS, April 20, 2004; 101(16): 6062 - 6067. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Bourque, P. A. Pevzner, and G. Tesler Reconstructing the Genomic Architecture of Ancestral Mammals: Lessons From Human, Mouse, and Rat Genomes Genome Res., April 1, 2004; 14(4): 507 - 516. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brudno, A. Poliakov, A. Salamov, G. M. Cooper, A. Sidow, E. M. Rubin, V. Solovyev, S. Batzoglou, and I. Dubchak Automated Whole-Genome Multiple Alignment of Rat, Mouse, and Human Genome Res., April 1, 2004; 14(4): 685 - 692. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Blanchette, W. J. Kent, C. Riemer, L. Elnitski, A. F.A. Smit, K. M. Roskin, R. Baertsch, K. Rosenbloom, H. Clawson, E. D. Green, et al. Aligning Multiple Genomic Sequences With the Threaded Blockset Aligner Genome Res., April 1, 2004; 14(4): 708 - 715. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Karolchik, A. S. Hinrichs, T. S. Furey, K. M. Roskin, C. W. Sugnet, D. Haussler, and W. J. Kent The UCSC Table Browser data retrieval tool Nucleic Acids Res., January 1, 2004; 32(90001): D493 - 496. [Abstract] [Full Text] [PDF] |