Skip to main content
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Rights and Permissions
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian
  • Log in
  • My Cart

Main menu

  • Home
  • Articles
    • Current
    • Latest Articles
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • Archive
  • Front Matter
  • News
    • For the Press
    • Highlights from Latest Articles
    • PNAS in the News
  • Podcasts
  • Authors
    • Purpose and Scope
    • Editorial and Journal Policies
    • Submission Procedures
    • For Reviewers
    • Author FAQ
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Rights and Permissions
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Home
Home

Advanced Search

  • Home
  • Articles
    • Current
    • Latest Articles
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • Archive
  • Front Matter
  • News
    • For the Press
    • Highlights from Latest Articles
    • PNAS in the News
  • Podcasts
  • Authors
    • Purpose and Scope
    • Editorial and Journal Policies
    • Submission Procedures
    • For Reviewers
    • Author FAQ

New Research In

Physical Sciences

Featured Portals

  • Physics
  • Chemistry
  • Sustainability Science

Articles by Topic

  • Applied Mathematics
  • Applied Physical Sciences
  • Astronomy
  • Computer Sciences
  • Earth, Atmospheric, and Planetary Sciences
  • Engineering
  • Environmental Sciences
  • Mathematics
  • Statistics

Social Sciences

Featured Portals

  • Anthropology
  • Sustainability Science

Articles by Topic

  • Economic Sciences
  • Environmental Sciences
  • Political Sciences
  • Psychological and Cognitive Sciences
  • Social Sciences

Biological Sciences

Featured Portals

  • Sustainability Science

Articles by Topic

  • Agricultural Sciences
  • Anthropology
  • Applied Biological Sciences
  • Biochemistry
  • Biophysics and Computational Biology
  • Cell Biology
  • Developmental Biology
  • Ecology
  • Environmental Sciences
  • Evolution
  • Genetics
  • Immunology and Inflammation
  • Medical Sciences
  • Microbiology
  • Neuroscience
  • Pharmacology
  • Physiology
  • Plant Biology
  • Population Biology
  • Psychological and Cognitive Sciences
  • Sustainability Science
  • Systems Biology

Functional analysis of gapped microbial genomes: Amino acid metabolism of Thiobacillus ferrooxidans

Evgeni Selkov, Ross Overbeek, Yakov Kogan, Lien Chu, Veronika Vonstein, David Holmes, Simon Silver, Robert Haselkorn and Michael Fonstein
PNAS March 28, 2000. 97 (7) 3509-3514; https://doi.org/10.1073/pnas.97.7.3509
Evgeni Selkov
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ross Overbeek
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yakov Kogan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lien Chu
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Veronika Vonstein
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Holmes
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Simon Silver
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert Haselkorn
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Fonstein
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  1. Contributed by Robert Haselkorn

  • Article
  • Authors & Info
  • PDF
Loading

Abstract

A gapped genome sequence of the biomining bacterium Thiobacillus ferrooxidans strain ATCC23270 was assembled from sheared DNA fragments (3.2-times coverage) into 1,912 contigs. A total of 2,712 potential genes (ORFs) were identified in 2.6 Mbp (megabase pairs) of Thiobacillus genomic sequence. Of these genes, 2,159 could be assigned functions by using the WIT-Pro/EMP genome analysis system, most with a high degree of certainty. Nine hundred of the genes have been assigned roles in metabolic pathways, producing an overview of cellular biosynthesis, bioenergetics, and catabolism. Sequence similarities, relative gene positions on the chromosome, and metabolic reconstruction (placement of gene products in metabolic pathways) were all used to aid gene assignments and for development of a functional overview. Amino acid biosynthesis was chosen to demonstrate the analytical capabilities of this approach. Only 10 expected enzymatic activities, of the nearly 150 involved in the biosynthesis of all 20 amino acids, are currently unassigned in the Thiobacillus genome. This result compares favorably with 10 missing genes for amino acid biosynthesis in the complete Escherichia coli genome. Gapped genome analysis can therefore give a decent picture of the central metabolism of a microorganism, equivalent to that of a complete sequence, at significantly lower cost.

  • genome analysis
  • metabolic reconstruction
  • gapped genomes

Starting with Haemophilus influenzae in 1995 (1), the total genomic DNA of >20 microbial organisms has been sequenced (http://www.tigr.org/tdb/mdb/mdb.html). Computational tools for genome analysis have been developed along with expanding genome sequence data and merged into shared analytical environments, such as http://WIT.mcs.anl.gov/WIT2/CGI/index.cgi and http://igweb.integratedgenomics.com/IGwit/, GeneQuiz (http://columba.ebi.ac.uk:8765/), Pedant (http://pedant.mips.biochem.mpg.de/), and Entrez Genomes (http://www.ncbi.nlm.nih.gov/Entrez/Genome/org.html). These tools have increased gene recognition rates from <50% (2, 3) to >70% (4, 5). Today, the remaining 30% of “hypothetical” genes separate us from a complete description of the genomic content of an organism. New experimental approaches, including various types of probe arrays (6) and systematic gene knock-outs (7, 8) assist the annotation of these unassigned parts of the genome.

Computational approaches based on various types of clustering of potential genes, whether with regard to positions on the chromosome (such as in operons; ref. 9) or clusters of orthologous genes from different organisms (COGs; ref. 10) in what might be called “phylogenetic space,” permit further gene assignments. The next stage of genome analysis is the integration of gene assignments into an overview of metabolism via metabolic reconstruction, the conceptual assembly of functional multi-component systems such as metabolic pathways, transport units, and signal transduction pathways (11). The identification of functional subsystems goes beyond what is achieved solely by similarity based assignment of functions to ORFs. It allows resolution of inconsistencies between different types of analysis and often results in changes from the initial gene function assignments based on similarity scoring (11). By using the WIT-Pro/EMP genome analysis system, a major part of the central metabolism of a microbe can be reconstructed in silico without input of direct experimental data.

Two strategies have been used for bacterial genome sequencing: whole-genome shot-gun (1) and ordered assembly (3, 12). Both of these approaches were intended to produce a highly accurate complete sequence without gaps or ambiguous areas. The complete sequence usually requires an 8- to 10-fold average sequence coverage with extensive effort needed to bridge the final gaps. A major purpose of the present study is to determine whether a gapped genomic sequence, in which 95–98% of the total genome sequence is represented as a set of unlinked contigs, resulting from a lower redundancy of coverage, can provide a useful way to study microbial genomes. Current sets of bioinformatics tools and data from an increasing number of available genomes appear to have shifted the balance from a need for complete genomes to a situation in which gapped genomes are sufficient for many purposes.

The genomic sequence of T. ferrooxidans strain ATCC23270 was used to demonstrate the utility of gapped genome data. T. ferrooxidans is a major component in the consortia of microorganisms used in biomining (13–16) and a contributor to acid mine runoff, which results in pollution near metal and coal mines and other related environments. Microbial biomining has become an established and major part of copper ore extraction worldwide. Recently, microbes have been successfully exploited for the recovery of gold by bioleaching from arsenopyrite ores and also zinc bioleaching. Microbial gold biomining has grown from nothing 10 years ago to perhaps 30% of the world's gold production today (14, 15). T. ferrooxidans generally functions in a consortium of several microbes (14, 16). As the first biomining microbe used for detailed laboratory molecular biology studies (17), T. ferrooxidans was an ideal target microbe for sequencing and analysis (http://igweb.integratedgenomics.com/IGwit/). T. ferrooxidans is a chemoautotroph that gains energy by oxidative phosphorylation, nitrogen from N2 in the air, and carbon exclusively from CO2 fixation via ribulose-1,5-bisphosphate carboxylase and a classic Calvin cycle (14, 15, 17). It derives energy from oxidation of reduced sulfur to H2SO4 and oxidation of Fe2+ to Fe3+ [which precipitates as insoluble Fe(OH)3]. T. ferrooxidans strains have single chromosomes, which may range in genome size from 2.2 Mbp to 2.9 Mbp (18).

Reconstruction of the biosynthesis of the 20 standard amino acids by T. ferrooxidans was chosen as a test of the ability to predict the properties of an organism based on its gapped-genome sequence analyzed in the WIT environment. Although most genes needed for the enzymes of biosynthesis for all 20 amino acids could be identified, 10 genes are currently missing from the list of 150 genes coding for the metabolic transformations necessary for amino acid biosynthesis from glucose, sulfide, and ammonia. The 150 polypeptides catalyze 114 separate reactions because there are many multi-heterosubunit enzymes. An important question as this work progresses is whether completion of the predicted missing 100 kbp (3.7%) of DNA sequence (sequencing of the same T. ferrooxidans strain has begun at The Institute for Genome Research) will yield these 10 missing genes or whether our inability to recognize them is due to nonorthologous alternative gene products carrying out the same or equivalent reactions (11, 19).

Materials and Methods

Chromosomal DNA Extraction and Cloning.

Chromosomal DNA from a culture of T. ferrooxidans strain ATCC23270 (provided by Robert C. Blake II, Xavier University, New Orleans) was prepared as described (20). DNA (10 mg) were digested by using Sau3A endonuclease under conditions of partial cleavage. Approximately 40-kbp fragments were size-selected by using sucrose gradient centrifugation and cloned into dephosphorylated cosmid vector Lorist 6 (21) using cosmid arm cloning (22). Hybrid cosmids were individually harvested in 96-well plates and stored at −70°C. Plasmid cloning was done similarly except DNA fragmentation for approximately one-half of the clones was done by using the HydroShear device (GeneMachines, San Carlos, CA). Size-selected 4-kbp DNA fragments were separated by agarose gel electrophoresis and inserted into the dephosphorylated plasmid vector pGEM3. Transformant colonies were tested for the lack of α-complementation of defective β-galactosidase in Escherichia coli strain DH5, individually collected, and stored.

Plasmid DNA Extraction and Sequencing.

Plasmid and cosmid DNA were extracted using the REAL method (Qiagen, Valencia, CA) with modifications. A first round of sequences was generated from the ends of these subclones using specifically designed “forward” and “reverse” primers located close to the cloning site. Sequencing reactions used a Perkin–Elmer (Foster City, CA) Big Dye kit for fluorescent sequencing. Products of the reactions were analyzed by using an Applied Biosystems model 377 automated sequencer. Average read lengths used were 534 bases. After this step, 864 primer-walking sequencing reactions were added. Primers were designed with the help of the PrimerSelect module of the lasergene (DNA Star, Madison WI) program and by an automated primer-selecting module for sequencher (GeneCodes, Ann Arbor, MI) developed by Randal Cox (University of Chicago). Primers were synthesized by Genosys (The Woodlands, TX) and Operon Technologies (Alameda, CA).

Results

Genome Cloning, Sequencing, ORF Identification, and Primary Functional Assignment.

Genomic DNA of T. ferrooxidans strain ATCC23270 was digested and cloned as described above. Then 9,600 plasmid and 960 cosmid clones were individually transferred to 96-well plates and their DNA was extracted and subjected to dye-terminator sequencing by using two insert-flanking oligonucleotide primers. The success rate (measured as the number of sequencing reactions included in the assembly process) was 91% and the average reading length used was 534 bases, which provided 3.2-times coverage for the 2.7-Mbp predicted genome size of T. ferrooxidans (18).

Sequence assembly was performed first using the sequencher 3.0 program (GeneCodes, Ann Arbor, MI); this was substituted by the phred/phrapp system (http://genome.washington.edu/) subsequently. Primary contigs (1,912) were generated by this assembly process. Primers (864) complementary to the ends of the primary contigs were used for two rounds of primer walking, which closed 559 gaps, leaving the final 1,353 contigs. The size distribution of these contigs is shown in Table 1. With 1,353 contigs, there are 1,353 gaps remaining in the expected single circular chromosome of T. ferrooxidans. With an expected 100 kbp of sequence remaining to be completed, by comparing the 2,611 kbp total in Table 1 with the expected total of 2.7 Mbp (ref. 18), we conclude that the average remaining gap is ≈75 bp, consistent with our experience with other gapped sequences, in which the coverage of genes is nearly complete and the gaps surprisingly small (analysis not shown).

View this table:
  • View inline
  • View popup
Table 1

Size distribution of the assembled contigs

A gene-searching program called critica (23) was used to translate the DNA sequences of each contig in all six reading frames and to tentatively identify the initial, most reliable subset of potential genes based on their high similarity to proteins in the nonredundant database. These ORFs were used as a learning set to reveal codon biases in coding sequences, which, together with potential ribosome-binding sequences (24), were then used to detect all potential genes in the T. ferrooxidans DNA sequence.

Potential coding regions recognized in the DNA contigs were subjected to a fasta search against the nonredundant database of assigned genes and loaded into the WIT-Pro system together with the computed tables of best hits. WIT-Pro is a derivative of the WIT-Pro/EMP genome analysis system (available on http://igweb.integratedgenomics.com/IGwit/), which provides an organized set of tools for the characterization of gene structures and functions. One can formulate queries based on the metabolic content predicted for the sequenced organism by looking for specific functions missing from the metabolic pathways or by separating alternative gene functions derived from similarities found for a putative gene. Chromosomal clustering of functionally related genes is another component of the system (9).

An Overview of the T. ferrooxidans Genome.

The 1,353 contigs comprising 2,611,378 bp were assembled for the T. ferrooxidans genome sequence (Table 1). A comparison of genome statistics for T. ferrooxidans, E. coli, and Bacillus subtilis is shown in Table 2. Then, 2,712 potential genes were recognized in the T. ferrooxidans sequence, which gives about the same gene density (1 ORF per 963 bp) as in the completely sequenced E. coli and B. subtilis genomes with 4,289 and 4,083 genes, respectively. However, some of the T. ferrooxidans ORFs are in fact only gene fragments separated by stop codons introduced by sequencing errors or by contig ends. Our study of amino acid biosynthesis found ≈10% of such disrupted ORFs.

View this table:
  • View inline
  • View popup
Table 2

Comparison of the T. ferrooxidans ATCC 23270 genome with those of E. coli K-12 and B. subtilis 168

The 2,159 (80%) of the T. ferrooxidans genes could be given a functional assignment, based on an assigned similar ORF in the data base. This ratio of assigned genes to the total number of potential genes is close to that seen for the complete genomes of E. coli (82%) and B. subtilis (73%). These high values reflect the power of the iterative WIT-Pro analytical process together with the rapidly growing database. Approximately 900 of the assigned genes were connected into functional pathways. The remaining identified genes include those coding for structural proteins, for proteins involved in macromolecular biosynthesis and degradation, and some membrane transporters. The 900 genes coding for metabolic enzymes are mostly for pathways that are well defined. The assigned genes could be connected into 305 functional pathways. A linear pathway is a sequence of biochemical transformations from one metabolic branching point to another. A branch point occurs when a metabolite is produced by or consumed in more than a single biochemical transformation. The 95 steps in the required predicted functions for T. ferrooxidans do not have assigned ORFs, which could encode the corresponding enzyme or structural gene. This number is less than that for E. coli or B. subtilis (Table 2) and about the same relative to genome size. This comparison suggests that most of these missing genes are not likely to result from genome gaps. The largest number of missing genes probably results from nonorthologous gene displacements (19), so that functionally active genes are present but cannot be recognized. Another small group of missing genes marked as “no sequences” in Table 2 encode metabolic functions without associated sequence data in any organism; these represent cases where the biochemical activity has been demonstrated or predicted, but for which no corresponding sequence information is available. These last two classes constitute an unrecoverable hole in the current metabolic reconstruction process.

An overview of amino acid metabolism in T. ferrooxidans is shown in Fig. 1. This view can also be found on our web site, together with connections to each pathway of metabolism (by clicking on each box in Fig. 1) and, through these, to each of the 167 individual ORFs involved in amino acid biosynthesis. Each ORF name is listed along with its corresponding EC number and descriptive name, the DNA sequence, and the translated amino acid sequence. Fig. 1 displays the biosynthesis of all 20 amino acids from glucose, ammonia, CO2, and sulfide. Amino acid metabolism is split into 35 pathways (boxes in Fig. 1), according to the definition given above. Biosynthesis of the 20 amino acids requires 114 biochemical reactions (i.e., EC numbers), catalyzed by 150 polypeptides. Each box in Fig. 1 contains three numbers: to the left are the numbers of ORFs currently assigned to each pathway and available on http://www.integratedgenomics.com/FE; in the middle are the number of steps in the pathway (which can also be seen on the web site, by clicking first on the box and then on “diagram picture” in the upper left of the frame). The 167 ORFs determine 150 distinct polypeptides due to the presence of isozymes and split genes. Approximately 20 of the 167 potential genes recognized in T. ferrooxidans to be involved in amino acid metabolism are fragments of genes split by sequencing errors, which generate false stops. Ten genes are missing in our reconstruction of the amino acid metabolism: EC 2.7.1.11 in glycolysis; EC 1.8.1.4 in pyruvate dehydrogenase; EC 2.7.1.71 in chorismate biosynthesis; EC 2.6.1.57 and 1.3.1.43 in tyrosine biosynthesis; EC 2.7.1.39 in threonine biosynthesis; EC 1.2.1.38 in ornithine biosynthesis; EC 4.3.1.12 and EC 1.5.1.2 for proline biosynthesis from ornithine and from glutamate respectively, and EC 6.3.5.4 for asparagine synthetase.

Figure 1
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1

Block diagram of amino acid metabolism of T. ferrooxidans. Numbers on top of each pathway box are: total number of related ORFs in the T. ferrooxidans genome/number of steps in the pathway/number of currently missing ORFs. Yellow boxes represent metabolic blocks with one or more missing ORFs. Green boxes represent metabolic blocks with all required ORFs present. Each box, displayed at the IG Internet site, can be selected to show detailed data.

Each box in Fig. 1 can be converted (by selecting the corresponding box displayed on the Internet site) to a more detailed picture such as those shown in Figs. 2 and 3, in which the biosynthesis of tryptophan and histidine are displayed. The metabolic pathway diagrams in Figs. 2A and 3A are also available on the Internet site, as are the EC enzyme assignments, gene functions, and ORF identification numbers. In some cases, the corresponding gene in E. coli (EC) or B. subtilis (BS) is listed in Figs. 2B and 3B. The asterisk on ORF RFE00518* in Fig. 2B has no meaning within the context of this paper. The gene clustering patterns for 22 (Fig. 2C) or 28 additional microbes (Fig. 3C) are not currently linked in these diagrams, but they are available on the WIT/EMP site (http://igweb.integratedgenomics.com/IGwit/). For the example of tryptophan biosynthesis in T. ferrooxidans, the assignments remained the same after a second metabolic reconstruction step. The process of refining assignments is ongoing. However, even some weak initial hits by similarity searches were confirmed both by chromosomal clustering of the genes and by the position of their products in the metabolic pathway. The data in this report, on the http://www.integratedgenomics.com/FE site, and access to the remainder of the T. ferrooxidans data have been made available to T. ferrooxidans researchers. Interested parties are encouraged to contact the corresponding author.

Figure 2
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2

Tryptophan synthesis in T. ferrooxidans. (A) The substrates and products of different enzymatic reactions together with the EC numbers for the enzymes involved. (B) The EC enzyme numbers, names of the enzymes, and “RFE” ORF assignment number for the gene product in the Thiobacillus database. Any item in blue can be selected on the IG Internet site to reveal deeper information. (C) Functional coupling of genes for tryptophan biosynthesis in different microorganisms. Identical colors indicate genes in corresponding clusters in each organism; the colors are arbitrary. The dashes indicate corresponding assignments not clustered with others in that organism. Note also that the diagram of A and ORF assignments from B can be brought up by way of Fig. 1. DR, Dienococcus radiodurans; CT, cyanobacterium Synechocystis sp.; ST, Streptococcus pyogenes; PN, Streptococcus pneumoniae; BS, Bacillus subtilis; CA, Clostridium acetobutylicum; MT, Mycobacterium tuberculosis; CJ, Campylobacter jejuni; NG, Neisseria gonorrhoeae; NM, Neisseria meningitidis; HP, Helicobacter pylori; RC, Rhodobacter capsulatus; ZM, Zymomonas mobilis; PA, Pseudomonas aeruginosa; FE, Thiobacillus ferrooxidans; EC, Escherichia coli; YP, Yersinia pestis; HI, Haemophilus influenzae; AG, Archaeoglobus fulgidus; TH, Methanobacterium thermoautotrophicum; MJ, Methanococcus jannaschii; PF, Pyrococcus furiosus; and SS, Sulfolobus solfataricus.

Figure 3
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3

Histidine synthesis in T. ferrooxidans. Legend as in Fig. 2. Additional microorganism data bases used were BA, Buchnera aphidicola; BP, Bordetella pertussis; CL, Cervus elaphus;; KL, Kluyveromyces lactis; LL, Lactococcus lactis; MB, Mycobacterium bovis; MI, Mycobacterium smegmatis; MN, Streptococcus mutans; SA, Staphylococcus aureus; SX, Staphylococcus xylosus; SY, Synechococcus sp.; TY, Salmonella typhimurium; and XA, Xanthomonas campestris.

Discussion

A gapped genome sequence was used to reconstruct some of the metabolic pathways of T. ferrooxidans, with the biosynthesis of the 20 amino acids of proteins shown in detail. The value of gapped genomes, which can be obtained at a fraction of the cost of complete genomes, has been debated. An equivalent discussion is occurring for the human genome sequence and it has recently been recognized that the first 10% of dollars spent can generate ≈90% of the total data (25, 26). The conclusion from these discussions can be different for academic, government, and industrial genome communities because the needs and goals are different. Clearly, there are specific problems, such as the characterization of phylogenetic diversity amongst microbial types, for which a large number of sequenced genomes is critical. However, the cost of complete and error-free genome sequencing is unnecessarily high for some of these applications. The emphasis on complete genome sequences was important at the beginning of these studies 5 years ago because it produced an accurate reference data set, which provides the basis for analysis of gapped genomes today.

The current paper shows that gapped genome sequencing provides an effective approach to microbial genome analysis. It is clear that such sequencing, here with 3.2-times coverage, produces more errors than complete genome sequencing with 8- to 10-fold coverage. The current error rate is estimated to be 1 per 1,000–2,000 base pairs vs. 1 in 10,000 base pairs for complete sequencing. These numbers are based on our calculations, our experience with recently corrected errors, and comparisons of comparable but redundant data for partially sequenced genomes from different groups (analysis not shown). The sequencing errors that produce frameshifts and truncated ORFs are most readily identified by similarity analysis, which includes consideration of predicted product length.

The gapped sequence contains part of almost every ORF, which allows the assignment of functions to almost all ORFs and the accurate reconstruction of the metabolism of the organism. Given the pattern of short intervals between genes in other microbial genomes and in the current T. ferrooxidans sequence, most gaps occur within genes. With only 3.6% of the DNA remaining to be sequenced, one expects <10% of the gaps to be found between genes. Of course, considerations of the distribution of intergenic spacings and of gaps associated with localized cloning and sequencing difficulties make a more precise calculation based on random statistics inappropriate. The inter-gene regions of complete genomes have been subject to quantitatively greater reinterpretation, due to mis-callings of start sites, than have intra-gene sequences (compare refs. 2 and 5).

The functional reconstruction of the metabolism of T. ferrooxidans from its genome sequence provides a foundation for research aimed at the modification of metabolism for more efficient use of T. ferrooxidans in the mining industry. It may be possible to predict experimental approaches to improving cell growth, based on the understanding of central metabolism, which in turn would be based on the reconstruction of metabolism from the genome. Predicted growth rate-limiting metabolic pathways and steps might be altered by transferring mutant genes or increasing gene number, given the possibility of better methods for introducing genes into T. ferrooxidans. Improving cell growth by altering amino acid metabolism, as analyzed in this report, or carbon flow and hence the yield of substances produced via secondary pathways, could alleviate limited growth and bio-leaching associated with wild-type T. ferrooxidans. The data in this report, the Internet site http://www.integratedgenomics.com/FE, and the remainder of this database have already been made available to researchers with interests in T. ferrooxidans.

Footnotes

    • ↵* To whom correspondence and reprint requests should be addressed. E-mail: michael{at}integratedgenomics.com.

    • Data deposition: The sequences reported in this paper have been deposited in the GenBank database (accession nos. AF246306–AF246444). Additional detailed information about these sequences is available at http://www.integratedgenomics.com/FE.

    Abbreviation

    Mbp,
    megabase pairs
    • Accepted December 22, 1999.
    • Copyright © 2000, The National Academy of Sciences

    References

    1. ↵
      1. Fleischmann R D,
      2. Adams M D,
      3. White O,
      4. Clayton R A,
      5. Kirkness E F,
      6. Kerlavage A R,
      7. Bult C J,
      8. Tomb J F,
      9. Dougherty B A,
      10. Merrick J M,
      11. et al.
      (1995) Science 269:496–512, pmid:7542800.
      OpenUrlAbstract/FREE Full Text
    2. ↵
      1. Bult C J,
      2. White O,
      3. Olsen G J,
      4. Zhou L,
      5. Fleischmann R D,
      6. Sutton G G,
      7. Blake J A,
      8. FitzGerald L M,
      9. Clayton R A,
      10. Gocayne J D,
      11. et al.
      (1996) Science 273:1058–1073, pmid:8688087.
      OpenUrlAbstract
    3. ↵
      1. Kaneko T,
      2. Sato S,
      3. Kotani H,
      4. Tanaka A,
      5. Asamizu E,
      6. Nakamura Y,
      7. Miyajima N,
      8. Hirosawa M,
      9. Sugiura M,
      10. Sasamoto S,
      11. et al.
      (1996) DNA Res 3:185–209, pmid:8905238.
      OpenUrlCrossRefPubMed
    4. ↵
      1. Vlcek C,
      2. Paces V,
      3. Maltsev N,
      4. Haselkorn R,
      5. Fonstein M
      (1997) Proc Natl Acad Sci USA 94:9384–9388, pmid:9256491.
      OpenUrlAbstract/FREE Full Text
    5. ↵
      1. Selkov E,
      2. Maltsev N,
      3. Olsen G J,
      4. Overbeek R,
      5. Whitman W B
      (1997) Gene 197:GC11–GC26, pmid:9332394.
      OpenUrlCrossRefPubMed
    6. ↵
      1. Winzeler E A,
      2. Richards D R,
      3. Conway A R,
      4. Goldstein A L,
      5. Kalman S,
      6. McCullough M J,
      7. McCusker J H,
      8. Stevens D A,
      9. Wodicka L,
      10. Lockhart D J,
      11. et al.
      (1998) Science 281:1194–1197, pmid:9712584.
      OpenUrlAbstract/FREE Full Text
    7. ↵
      1. Kumar V,
      2. Fonstein M,
      3. Haselkorn R
      (1996) Nature (London) 381:653–654, pmid:8649508.
      OpenUrlCrossRefPubMed
    8. ↵
      1. Oliver S G
      (1997) Curr Opin Genet Dev 7:405–409, pmid:9229118.
      OpenUrlCrossRefPubMed
    9. ↵
      1. Overbeek R,
      2. Fonstein M,
      3. D'Souza M,
      4. Pusch G D,
      5. Maltsev N
      (1999) Proc Natl Acad Sci USA 96:2896–2901, pmid:10077608.
      OpenUrlAbstract/FREE Full Text
    10. ↵
      1. Tatusov R L,
      2. Koonin E V,
      3. Lipman D J
      (1997) Science 278:631–637, pmid:9381173.
      OpenUrlAbstract/FREE Full Text
    11. ↵
      1. Selkov E,
      2. Basmanova S,
      3. Gaasterland T,
      4. Goryanin I,
      5. Gretchkin Y,
      6. Maltsev N,
      7. Nenashev V,
      8. Overbeek R,
      9. Panyushkina E,
      10. Pronevitch L,
      11. et al.
      (1996) Nucleic Acids Res 24:26–28, pmid:8594593.
      OpenUrlAbstract/FREE Full Text
    12. ↵
      1. Blattner F R,
      2. Plunkett G III,
      3. Bloch C A,
      4. Perna N T,
      5. Burland V,
      6. Riley M,
      7. Collado-Vides J,
      8. Glasner J D,
      9. Rode C K,
      10. Mayhew G F,
      11. et al.
      (1997) Science 277:1453–1474, pmid:9278503.
      OpenUrlAbstract/FREE Full Text
    13. ↵
      1. Rawlings D E,
      2. Silver S
      (1995) Nat Biotechnol 13:773–778.
      OpenUrlCrossRef
    14. ↵
      1. Rawlings D E
      , ed(1997) Biomining: Theory, Microbes and Industrial Processes (Springer, Berlin).
    15. ↵
      1. Martin A M
      1. Holmes D S
      (1998) in Bioconversion of Waste Materials to Industrial Products, ed Martin A M(Blackie Academic and Professional, London), 2nd Ed. pp 517–545.
    16. ↵
      1. Rawlings D E,
      2. Tributsch H,
      3. Hansford G S
      (1999) Microbiology 145:5–13, pmid:10206710.
      OpenUrlFREE Full Text
    17. ↵
      1. Rawlings D E,
      2. Kusano T
      (1994) Microbiol Rev 58:39–55, pmid:8177170.
      OpenUrlAbstract/FREE Full Text
    18. ↵
      1. Amils R,
      2. Irazabal N,
      3. Moreira D,
      4. Abad J P,
      5. Marvin I
      (1998) Biochimie 80:911–921, pmid:9893951.
      OpenUrlPubMed
    19. ↵
      1. Koonin E V,
      2. Mushegian A R,
      3. Bork P
      (1996) Trends Genet 12:334–336, pmid:8855656.
      OpenUrlPubMed
    20. ↵
      1. Sambrook J,
      2. Fritch E F,
      3. Maniatis T
      (1989) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Lab. Press, Plainview, NY), 2nd Ed..
    21. ↵
      1. Gibson T J,
      2. Rosenthal A,
      3. Waterston R H
      (1987) Gene 53:283–286, pmid:3038694.
      OpenUrlCrossRefPubMed
    22. ↵
      1. Fonstein M,
      2. Zheng S,
      3. Haselkorn R
      (1992) J Bacteriol 174:4070–4077, pmid:1597421.
      OpenUrlAbstract/FREE Full Text
    23. ↵
      1. Badger J H,
      2. Olsen G J
      (1999) Mol Biol Evol 16:512–524, pmid:10331277.
      OpenUrlAbstract
    24. ↵
      1. Shine J,
      2. Dalgarno L
      (1974) Proc Natl Acad Sci USA 71:1342–1346, pmid:4598299.
      OpenUrlAbstract/FREE Full Text
    25. ↵
      1. Venter J C,
      2. Adams M D,
      3. Sutton G G,
      4. Kerlavage A R,
      5. Smith H O,
      6. Hunkapiller M
      (1998) Science 280:1540–1542, pmid:9644018.
      OpenUrlAbstract/FREE Full Text
    26. ↵
      1. Collins F S,
      2. Patrinos A,
      3. Jordan E,
      4. Chakravarti A,
      5. Gesteland R,
      6. Walters L
      (1998) Science 282:682–689, pmid:9784121.
      OpenUrlAbstract/FREE Full Text
    View Abstract
    PreviousNext
    Back to top
    Article Alerts
    Email Article

    Thank you for your interest in spreading the word on PNAS.

    NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

    Enter multiple addresses on separate lines or separate them with commas.
    Functional analysis of gapped microbial genomes: Amino acid metabolism of Thiobacillus ferrooxidans
    (Your Name) has sent you a message from PNAS
    (Your Name) thought you would like to see the PNAS web site.
    Citation Tools
    Functional analysis of gapped microbial genomes: Amino acid metabolism of Thiobacillus ferrooxidans
    Evgeni Selkov, Ross Overbeek, Yakov Kogan, Lien Chu, Veronika Vonstein, David Holmes, Simon Silver, Robert Haselkorn, Michael Fonstein
    Proceedings of the National Academy of Sciences Mar 2000, 97 (7) 3509-3514; DOI: 10.1073/pnas.97.7.3509

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    Request Permissions
    Share
    Functional analysis of gapped microbial genomes: Amino acid metabolism of Thiobacillus ferrooxidans
    Evgeni Selkov, Ross Overbeek, Yakov Kogan, Lien Chu, Veronika Vonstein, David Holmes, Simon Silver, Robert Haselkorn, Michael Fonstein
    Proceedings of the National Academy of Sciences Mar 2000, 97 (7) 3509-3514; DOI: 10.1073/pnas.97.7.3509
    del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
    • Tweet Widget
    • Facebook Like
    • Mendeley logo Mendeley

    More Articles of This Classification

    Biological Sciences

    • Druggable negative allosteric site of P2X3 receptors
    • Rapid selection of a pyrethroid metabolic enzyme CYP9K1 by operational malaria control activities
    • Inferring genetic connectivity in real populations, exemplified by coastal and oceanic Atlantic cod
    Show more

    Microbiology

    • Origin, antigenicity, and function of a secreted form of ORF2 in hepatitis E virus infection
    • Flavivirus internalization is regulated by a size-dependent endocytic pathway
    • A small-molecule fragment that emulates binding of receptor and broadly neutralizing antibodies to influenza A hemagglutinin
    Show more

    Related Content

    • No related articles found.
    • Scopus
    • PubMed
    • Google Scholar

    Cited by...

    • Navigating Microbiological Food Safety in the Era of Whole-Genome Sequencing
    • Draft Genome Sequence of the Extremely Acidophilic Bacterium Acidithiobacillus caldus ATCC 51756 Reveals Metabolic Versatility in the Genus Acidithiobacillus
    • Genome Analysis of F. nucleatum sub spp vincentii and Its Comparison With the Genome of F. nucleatum ATCC 25586
    • Genome-Scale Reconstruction of the Saccharomyces cerevisiae Metabolic Network
    • The Value of Complete Microbial Genome Sequencing (You Get What You Pay For)
    • Draft Sequencing and Comparative Genomics of Xylella fastidiosa Strains Reveal Novel Biological Insights
    • Whole-genome comparative analysis of three phytopathogenic Xylella fastidiosa strains
    • Genome Sequence and Analysis of the Oral Bacterium Fusobacterium nucleatum Strain ATCC 25586
    • Characterization of the petI and res Operons of Acidithiobacillus ferrooxidans
    • ISAfe1, an ISL3 Family Insertion Sequence from Acidithiobacillus ferrooxidans ATCC 19859
    • Scopus (46)
    • Google Scholar

    Similar Articles

    You May Also be Interested in

    Recent flooding events highlight why flood-risk governance in the United States needs a major overhaul. They also suggest why the necessary refocus on shared responsibility will not be easy.
    Opinion: How to achieve better flood-risk governance in the United States
    Recent flooding events highlight why flood-risk governance in the United States needs a major overhaul. They also suggest why the necessary refocus on shared responsibility will not be easy.
    Image courtesy of Shutterstock/michelmond.
    Bridget Scanlon discusses the use of global hydrologic models for studying changes in water storage worldwide.
    Global hydrologic models and water storage
    Bridget Scanlon discusses the use of global hydrologic models for studying changes in water storage worldwide.
    Listen
    Past PodcastsSubscribe
    PNAS Profile of Dorothy L. Cheney and Robert M. Seyfarth.
    PNAS Profile
    PNAS Profile of Dorothy L. Cheney and Robert M. Seyfarth.
    Researchers estimate the risk of infectious disease transmission on board transcontinental airline flights.
    Infectious disease transmission on airplanes
    Researchers estimate the risk of infectious disease transmission on board transcontinental airline flights.
    Image courtesy of Pixabay/PublicDomainPictures.
    Researchers report early evidence of Maya animal management.
    Early Maya animal rearing and trade
    Researchers report early evidence of Maya animal management.
    Proceedings of the National Academy of Sciences: 115 (16)
    Current Issue

    Submit

    Sign up for Article Alerts

    Jump to section

    • Article
      • Abstract
      • Materials and Methods
      • Results
      • Discussion
      • Footnotes
      • Abbreviation
      • References
    • Authors & Info
    • PDF
    Site Logo
    Powered by HighWire
    • Submit Manuscript
    • Twitter
    • Facebook
    • RSS Feeds
    • Email Alerts

    Articles

    • Current Issue
    • Latest Articles
    • Archive

    PNAS Portals

    • Classics
    • Front Matter
    • Teaching Resources
    • Anthropology
    • Chemistry
    • Physics
    • Sustainability Science

    Information for

    • Authors
    • Reviewers
    • Press

    Feedback    Privacy/Legal

    Copyright © 2018 National Academy of Sciences.