Previous Article |
Table of Contents
| Next Article
Vol. 93, Issue 19, 10268-10273, September 17, 1996
National Center for Biotechnology Information, National Library of
Medicine, National Institutes of Health, Bethesda, MD 20894
Communicated by Clyde Hutchinson, University of North Carolina,
Chapel Hill, NC, May 17, 1996 (received for review March 11, 1996)
The recently sequenced genome of the parasitic bacterium
Mycoplasma genitalium contains only 468 identified
protein-coding genes that have been dubbed a minimal gene complement
[Fraser, C. M., Gocayne, J. D., White, O., Adams, M. D., Clayton, R. A., et al. (1995) Science 270,
397-403]. Although the M. genitalium gene complement
is indeed the smallest among known cellular life forms, there is no
evidence that it is the minimal self-sufficient gene set. To derive
such a set, we compared the 468 predicted M. genitalium
protein sequences with the 1703 protein sequences encoded by the other
completely sequenced small bacterial genome, that of Haemophilus
influenzae. M. genitalium and H. influenzae belong to two ancient bacterial lineages, i.e., Gram-positive and
Gram-negative bacteria, respectively. Therefore, the genes that are
conserved in these two bacteria are almost certainly essential for
cellular function. It is this category of genes that is most likely to
approximate the minimal gene set. We found that 240 M.
genitalium genes have orthologs among the genes of H.
influenzae. This collection of genes falls short of comprising the minimal set as some enzymes responsible for intermediate steps in
essential pathways are missing. The apparent reason for this is the
phenomenon that we call nonorthologous gene displacement when the same
function is fulfilled by nonorthologous proteins in two organisms. We
identified 22 nonorthologous displacements and supplemented the set of
orthologs with the respective M. genitalium genes. After
examining the resulting list of 262 genes for possible functional
redundancy and for the presence of apparently parasite-specific genes,
6 genes were removed. We suggest that the remaining 256 genes are close
to the minimal gene set that is necessary and sufficient to sustain the
existence of a modern-type cell. Most of the proteins encoded by the
genes from the minimal set have eukaryotic or archaeal homologs but
seven key proteins of DNA replication do not. We speculate that the
last common ancestor of the three primary kingdoms had an RNA genome.
Possibilities are explored to further reduce the minimal set to model a
primitive cell that might have existed at a very early stage of life
evolution.
0027-8424/96/9310268-6/0
Evolution
A minimal gene set for cellular life derived by comparison of
complete bacterial genomes
*
To whom reprint requests should be addressed at: National Center
for Biotechnology Information, National Library of Medicine, Building
38A, National Institutes of Health, Bethesda, MD 20894. e-mail:
koonin{at}ncbi.nlm.nih.gov.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
L. Xie and P. E. Bourne Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments PNAS, April 8, 2008; 105(14): 5441 - 5446. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. N. Bertin, C. Medigue, and P. Normand Advances in environmental genomics: towards an integrated view of micro-organisms and ecosystems Microbiology, February 1, 2008; 154(2): 347 - 359. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Mora, V. Heurgue-Hamard, M. de Zamaroczy, S. Kervestin, and R. H. Buckingham Methylation of Bacterial Release Factors RF1 and RF2 Is Required for Normal Translation Termination in Vivo J. Biol. Chem., December 7, 2007; 282(49): 35638 - 35645. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wang, L. S. Yafremava, D. Caetano-Anolles, J. E. Mittenthal, and G. Caetano-Anolles Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world Genome Res., November 1, 2007; 17(11): 1572 - 1585. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Yassin and A. S. Mankin Potential New Antibiotic Sites in the Ribosome Revealed by Deleterious Mutations in RNA of the Large Ribosomal Subunit J. Biol. Chem., August 17, 2007; 282(33): 24329 - 24342. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Slominski, J. Calkiewicz, P. Golec, G. Wegrzyn, and B. Wrobel Plasmids derived from Gifsy-1/Gifsy-2, lambdoid prophages contributing to the virulence of Salmonella enterica serovar Typhimurium: implications for the evolution of replication initiation proteins of lambdoid phages and enterobacteria Microbiology, June 1, 2007; 153(6): 1884 - 1896. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Fedorov, G. Witte, C. Urbanke, D. J. Manstein, and U. Curth 3D structure of Thermus aquaticus single-stranded DNA-binding protein gives insight into the functioning of SSB proteins Nucleic Acids Res., December 5, 2006; (2006) gkl1002v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Burrage, L. Hood, and M. A. Ragan Advanced computing for systems biology Brief Bioinform, December 1, 2006; 7(4): 390 - 398. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Seringhaus, A. Paccanaro, A. Borneman, M. Snyder, and M. Gerstein Predicting essential genes in fungal genomes Genome Res., September 1, 2006; 16(9): 1126 - 1135. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Forterre Three RNA cells for ribosomal lineages and three DNA viruses to replicate their genomes: A hypothesis for the origin of cellular domain PNAS, March 7, 2006; 103(10): 3669 - 3674. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Su, F. Mao, P. Dam, H. Wu, V. Olman, I. T. Paulsen, B. Palenik, and Y. Xu Computational inference and experimental validation of the nitrogen assimilation regulatory network in cyanobacterium Synechococcus sp. WH 8102 Nucleic Acids Res., February 10, 2006; 34(3): 1050 - 1065. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. I. Glass, N. Assad-Garcia, N. Alperovich, S. Yooseph, M. R. Lewis, M. Maruf, C. A. Hutchison III, H. O. Smith, and J. C. Venter Essential genes of a minimal bacterium PNAS, January 10, 2006; 103(2): 425 - 430. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Mao, Z. Su, V. Olman, P. Dam, Z. Liu, and Y. Xu Mapping of orthologous genes in the context of biological pathways: An application of integer programming PNAS, January 3, 2006; 103(1): 129 - 134. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Zegers, D. Gigot, F. van Vliet, C. Tricot, S. Aymerich, J. M. Bujnicki, J. Kosinski, and L. Droogmans Crystal structure of Bacillus subtilis TrmB, the tRNA (m7G46) methyltransferase. Nucleic Acids Res., January 1, 2006; 34(6): 1925 - 1934. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Che, G. Li, F. Mao, H. Wu, and Y. Xu Detecting uber-operons in prokaryotic genomes. Nucleic Acids Res., January 1, 2006; 34(8): 2418 - 2427. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Fang, E. Rocha, and A. Danchin How Essential Are Nonessential Genes? Mol. Biol. Evol., November 1, 2005; 22(11): 2147 - 2156. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Miyake, Y. Shigeri, Y. Tatsu, N. Yumoto, M. Umekawa, Y. Tsujimoto, H. Matsui, and K. Watanabe Two Thimet Oligopeptidase-Like Pz Peptidases Produced by a Collagen- Degrading Thermophile, Geobacillus collagenovorans MO-1 J. Bacteriol., June 15, 2005; 187(12): 4140 - 4148. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Shen, P. Antalis, J. Gladitz, S. Sayeed, A. Ahmed, S. Yu, J. Hayes, S. Johnson, B. Dice, R. Dopico, et al. Identification, Distribution, and Expression of Novel Genes in 10 Clinical Isolates of Nontypeable Haemophilus influenzae Infect. Immun., June 1, 2005; 73(6): 3479 - 3491. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Qian, Y. Jia, S.-X. Ren, Y.-Q. He, J.-X. Feng, L.-F. Lu, Q. Sun, G. Ying, D.-J. Tang, H. Tang, et al. Comparative and functional genomic analyses of the pathogenicity of phytopathogen Xanthomonas campestris pv. campestris Genome Res., June 1, 2005; 15(6): 757 - 767. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Poole and D. T. Logan Modern mRNA Proofreading and Repair: Clues that the Last Universal Common Ancestor Possessed an RNA Genome? Mol. Biol. Evol., June 1, 2005; 22(6): 1444 - 1455. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Vieira Dos Santos, S. Cuine, N. Rouhier, and P. Rey The Arabidopsis Plastidic Methionine Sulfoxide Reductase B Proteins. Sequence and Activity Characteristics, Comparison of the Expression with Plastidic Methionine Sulfoxide Reductase A, and Induction by Photooxidative Stress Plant Physiology, June 1, 2005; 138(2): 909 - 922. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Hoper, U. Volker, and M. Hecker Comprehensive Characterization of the Contribution of Individual SigB-Dependent General Stress Genes to Stress Resistance of Bacillus subtilis J. Bacteriol., April 15, 2005; 187(8): 2810 - 2826. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. R. Salama, B. Shepherd, and S. Falkow Global Transposon Mutagenesis and Essential Gene Analysis of Helicobacter pylori J. Bacteriol., December 1, 2004; 186(23): 7926 - 7935. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Charlebois and W. F. Doolittle Computing prokaryotic gene ubiquity: Rescuing the core from extinction Genome Res., December 1, 2004; 14(12): 2469 - 2477. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Gil, F. J. Silva, J. Pereto, and A. Moya Determination of the Core of a Minimal Bacterial Gene Set Microbiol. Mol. Biol. Rev., September 1, 2004; 68(3): 518 - 537. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Bernstein, J. M. Eggington, M. P. Killoran, A. M. Misic, M. M. Cox, and J. L. Keck Crystal structure of the Deinococcus radiodurans single-stranded DNA-binding protein suggests a mechanism for coping with DNA damage PNAS, June 8, 2004; 101(23): 8575 - 8580. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. MARTIN and W. KELLER Sequence motifs that distinguish ATP(CTP):tRNA nucleotidyl transferases from eubacterial poly(A) polymerases RNA, June 1, 2004; 10(6): 899 - 906. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Castellanos, D. B. Wilson, and M. L. Shuler A modular minimal cell model: Purine and pyrimidine transport and metabolism PNAS, April 27, 2004; 101(17): 6681 - 6686. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Westberg, A. Persson, A. Holmberg, A. Goesmann, J. Lundeberg, K.-E. Johansson, B. Pettersson, and M. Uhlen The Genome Sequence of Mycoplasma mycoides subsp. mycoides SC Type Strain PG1T, the Causative Agent of Contagious Bovine Pleuropneumonia (CBPP) Genome Res., February 1, 2004; 14(2): 221 - 227. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Zhang, H.-Y. Ou, and C.-T. Zhang DEG: a database of essential genes Nucleic Acids Res., January 1, 2004; 32(90001): D271 - 272. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Worthey, A. Schnaufer, I. S. Mian, K. Stuart, and R. Salavati Comparative analysis of editosome proteins in trypanosomatids Nucleic Acids Res., November 15, 2003; 31(22): 6392 - 6408. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P. C. Rocha and A. Danchin Gene essentiality determines chromosome organisation in bacteria Nucleic Acids Res., November 15, 2003; 31(22): 6570 - 6577. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tsuge, K. Matsui, and M. Itaya One step assembly of multiple DNA fragments with a designed order and orientation in Bacillus subtilis plasmid Nucleic Acids Res., November 1, 2003; 31(21): e133 - e133. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Benson, E. B. Gottlin, D. J. Christensen, and P. T. Hamilton Intracellular Expression of Peptide Fusions for Demonstration of Protein Essentiality in Bacteria Antimicrob. Agents Chemother., September 1, 2003; 47(9): 2875 - 2881. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Papazisi, T. S. Gorton, G. Kutish, P. F. Markham, G. F. Browning, D. K. Nguyen, S. Swartzell, A. Madan, G. Mahairas, and S. J. Geary The complete genome sequence of the avian pathogen Mycoplasma gallisepticum strain Rlow Microbiology, September 1, 2003; 149(9): 2307 - 2316. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Daubin, N. A. Moran, and H. Ochman Phylogenetics and the Cohesion of Bacterial Genomes Science, August 8, 2003; 301(5634): 829 - 832. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kobayashi, S. D. Ehrlich, A. Albertini, G. Amati, K. K. Andersen, M. Arnaud, K. Asai, S. Ashikaga, S. Aymerich, P. Bessieres, et al. Essential Bacillussubtilis genes PNAS, April 15, 2003; 100(8): 4678 - 4683. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Sasaki, J. Ishikawa, A. Yamashita, K. Oshima, T. Kenri, K. Furuya, C. Yoshino, A. Horino, T. Shiba, T. Sasaki, et al. The complete genomic sequence of Mycoplasma penetrans, an intracellular bacterial pathogen in humans Nucleic Acids Res., December 1, 2002; 30(23): 5293 - 5300. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P. Skaar, D. M. Tobiason, J. Quick, R. C. Judd, H. Weissbach, F. Etienne, N. Brot, and H. S. Seifert The outer membrane localization of the Neisseria gonorrhoeae MsrA/B is involved in survival against reactive oxygen species PNAS, July 23, 2002; 99(15): 10108 - 10113. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.T. Cole Comparative mycobacterial genomics as a tool for drug target and antigen discovery Eur. Respir. J., July 1, 2002; 20(36_suppl): 78S - 86s. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. D. P. Clarke, R. G. Beiko, M. A. Ragan, and R. L. Charlebois Inferring Genome Trees by Using a Filter To Eliminate Phylogenetically Discordant Sequences and a Distance Matrix Based on Mean Normalized BLASTP Scores J. Bacteriol., April 15, 2002; 184(8): 2072 - 2080. [Abstract] [Full Text] |
||||
![]() |
G. V. Kryukov, R. A. Kumar, A. Koc, Z. Sun, and V. N. Gladyshev Selenoprotein R is a zinc-containing stereo-specific methionine sulfoxide reductase PNAS, April 2, 2002; 99(7): 4245 - 4250. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Gil, B. Sabater-Munoz, A. Latorre, F. J. Silva, and A. Moya Extreme genome reduction in Buchnera spp.: Toward the minimal genome needed for symbiotic life PNAS, April 2, 2002; 99(7): 4454 - 4458. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Csete and J. C. Doyle Reverse Engineering of Biological Complexity Science, March 1, 2002; 295(5560): 1664 - 1669. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Chambaud, R. Heilig, S. Ferris, V. Barbe, D. Samson, F. Galisson, I. Moszer, K. Dybvig, H. Wroblewski, A. Viari, et al. The complete genome sequence of the murine respiratory pathogen Mycoplasma pulmonis Nucleic Acids Res., May 15, 2001; 29(10): 2145 - 2153. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. M. A. TEN ASBROEK, J. OLSEN, D. HOUSMAN, F. BAAS, and V. STANTON JR. Genetic variation in mRNA coding sequences of highly conserved genes Physiol Genomics, April 2, 2001; 5(3): 113 - 118. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. H. Cassell and J. Mekalanos Development of Antimicrobial Agents in the Era of New and Reemerging Infectious Diseases and Increasing Antibiotic Resistance JAMA, February 7, 2001; 285(5): 601 - 605. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Aravind, K. S. Makarova, and E. V. Koonin SURVEY AND SUMMARY: Holliday junction resolvases and related nucleases: identification of new families, phyletic distribution and evolutionary trajectories Nucleic Acids Res., September 15, 2000; 28(18): 3417 - 3432. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. T. Bull, A. C. Ward, and M. Goodfellow Search and Discovery Strategies for Biotechnology: the Paradigm Shift Microbiol. Mol. Biol. Rev., September 1, 2000; 64(3): 573 - 606. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Dandekar, M. Huynen, J. T. Regula, B. Ueberle, C. U. Zimmermann, M. A. Andrade, T. Doerks, L. Sanchez-Pulido, B. Snel, M. Suyama, et al. Re-annotating the Mycoplasma pneumoniae genome sequence: adding value, function and reading frames Nucleic Acids Res., September 1, 2000; 28(17): 3278 - 3288. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Wong and J. J. Mekalanos Genetic footprinting with mariner-based transposition in Pseudomonasaeruginosa PNAS, August 29, 2000; 97(18): 10191 - 10196. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Balasubramanian, T. Schneider, M. Gerstein, and L. Regan Proteomics of Mycoplasma genitalium: identification and characterization of unannotated and atypical proteins in a small model genome Nucleic Acids Res., August 15, 2000; 28(16): 3075 - 3082. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. P. Villarreal and V. R. DeFilippis A Hypothesis for DNA Viruses as the Origin of Eukaryotic Replication Proteins J. Virol., August 1, 2000; 74(15): 7079 - 7084. [Abstract] [Full Text] |
||||
![]() |
E. L. Braun, A. L. Halpern, M. A. Nelson, and D. O. Natvig Large-Scale Comparison of Fungal Sequence Information: Mechanisms of Innovation in Neurospora crassa and Gene Loss in Saccharomyces cerevisiae Genome Res., April 1, 2000; 10(4): 416 - 430. [Abstract] [Full Text] |
||||
![]() |
G. Perrière, L. Duret, and M. Gouy HOBACGEN: Database System for Comparative Genomics in Bacteria Genome Res., March 1, 2000; 10(3): 379 - 385. [Abstract] [Full Text] |
||||
![]() |
G. V. Kryukov, V. M. Kryukov, and V. N. Gladyshev New Mammalian Selenocysteine-containing Proteins Identified with an Algorithm That Searches for Selenocysteine Insertion Sequence Elements J. Biol. Chem., November 26, 1999; 274(48): 33888 - 33897. [Abstract] |