A hidden metabolic pathway exposed

  1. Andrei Osterman*
  1. Burnham Institute for Medical Research, La Jolla, CA 92037; and Fellowship for Interpretation of Genomes, Burr Ridge, IL 60527

Before reading the report by Sydney Kustu’s research group published in a recent issue of PNAS (1), I would have seriously doubted that discovery of a novel pathway for pyrimidine catabolism in Escherichia coli K12 was something that was still possible today. After all of the years of exhaustive genetic and biochemical studies, almost a decade past completion of the genome (2), after innumerable annotation and reannotation efforts, metabolic modeling (3), and systematic functional genomics studies (46), a completely novel pathway was found? Although functional roles of many E. coli genes (at least 20%) are still unknown, these remaining y genes are largely perceived as potential off-mainstream players in some obscure areas of metabolism or, even more likely, in nonmetabolic processes. Therefore, a string of eight y genes encoding an entire pathway in the very center of the E. coli metabolic map felt like it was too big a skeleton to be hidden in a closet.

How could not only the specific genes, but the very existence of this pathway have eluded earlier studies? One possible explanation is that “pyrimidine rings can be used as sole N source at room temperature but not at 37°C” (1). The question still remains: why among the many successful hunts for missing genes (7) did all of the powerful bioinformatics techniques fail to spot this missing pathway? A likely explanation (in one of its many versions) is that “to find a black cat in the dark room we should at least know that it is hiding there.” Indeed, early physiological evidence (8), as long as it was not captured by public archives, such as the Kyoto Encyclopedia of Genes and Genomes (KEGG) (www.genome.ad.jp/kegg) or EcoCyc, the Encyclopedia of Escherichia coli K-12 Genes and Metabolism (http://ecocyc.org), remained inaccessible to genome annotators, preventing them from asking the right questions. On the other hand, experts in a field are often unaware of bioinformatics tools that would help them address this question. A tale of a hidden pyrimidine catabolism pathway (1) illustrates the power of combining both worlds. I will return to this topic after a brief discussion of specific implications of this study.

An elaborate machinery of pyrimidine catabolism revealed by Kustu and coworkers (1) is an important addition to the N-utilization network in E. coli. This machinery, a classical operon (now termed rut operon, see Fig. 1 A), is under regulation of a new repressor RutR (former ycdC gene) competing with NtrC, whose role as a global “nitrogen dispatcher” was elucidated by Kustu’s group earlier (9). The main physiological role of this pathway is not obvious, at least to members of that group, who suspect that it may contribute more to global cellular regulation than to mere nutrition (1). Nevertheless, in the best catabolic traditions, the rut operon is equipped with a transporter RutG (former ycdG gene). A final product of this pathway, hydroxypropionate (or 2-methyl-hydroxypropionate) is another novelty, distinguishing it from both “legacy” pathways described in other species and captured in the KEGG map (www.genome.jp/dbget-bin/www_bget?path:rn00240). The distinction from the reductive pathway is also supported by gene comparison, whereas no genetic data are available for the oxidative pathway described in early studies of Corynebacteria and other species (1012).

Fig. 1.

Clustering on the chromosome of the genes involved in the novel pathway (A, rut genes) and in the previously characterized reductive pathway (B, pyd genes) of pyrimidine catabolism. Gene names are as in Table 1. Matching colors (within each panel) correspond to gene orthologs. A unique feature of the pyd operon in P. aeruginosa (but not in other Pseudomonadales) is the presence of genes encoding cytosine permease (codB, a homolog of PydP) and cytosine deaminase (codA), which would feed cytosine to the same utilization pathway.


View this table:
Table 1.

Occurrence of genes involved in pyrimidine catabolism in completely sequenced genomes


Comparative genome analysis allowed Kustu and coworkers (1) to expand the impact of their findings beyond the model system of E. coli, as illustrated in Table 1, a condensed form of a pyrimidine utilization subsystem compiled with the seed platform (http://theseed.uchicago.edu/FIG/index.cgi). The subsystem approach to genome annotation and pathway analysis allows us to capture the existing knowledge of biological processes and reliably project it across the entire collection of diverse species with completely sequenced genomes (13). In addition to establishing which organisms implement one or the other functional variants of a subsystem (e.g., reductive pathway or rut pathway), this approach helps to reveal gaps in knowledge (missing genes) and potential new players (predicted genes).

Genes of the rut pathway are clustered on the chromosome in all species (see Table 1 and Fig. 1 A), providing strong evidence of their functional coupling (14). Regulator RutR and most enzymatic components are conserved, except for putative reductase RutE, a possible subject of nonorthologous gene displacement in several species beyond Enterobacteria. An absence of RutG orthologs in some species may reflect an alternative transport system or an exclusive role for the rut pathway in controlling the internal pyrimidine pool.

The core of the reductive pathway is comprised of three enzymatic steps: (i) dihydropyrimidine dehydrogenase, (ii) dihydropyrimidinase, and (iii) β-ureidopropionase, historically characterized in a context of human metabolic disorders (15). Bacterial genes encoding functional homologs of these enzymes [pydA, pydB, and pydC, characterized in Brevibacillus agri (16)] form a chromosomal cluster conserved in a variety of species (see Fig. 1 B). The most striking observation in the comparison of the two pathways is that the only gene shared by them is the regulator rutR (17), which is conserved and tightly clustered with both operons. Although this conservation may be rationalized by sharing of effectors (pyrimidines), an evolutionary scenario explaining this observation is not obvious. A single copy of rutR in Agrobacterium tumefaciens carrying both operons is clustered with pyd operon (see Fig. 1), which is also the only example of an incomplete reductive pathway. Although the absence of pydA in this organism could be explained by a nonorthologous displacement, the unique coexistence of the two pathways may invoke other explanations.

Transport mechanisms vary between pathways and between groups of species. Two types of transport systems can be inferred for the reductive pathway from chromosomal clustering (see Fig. 1 B). A permease PydP characteristic of β- and γ-proteobacteria (e.g., PA0443 in Pseudomonas aeruginosa) appears to be functionally replaced by an ABC cassette in α-proteobacteria. An additional component of reductive pathway, termed here PydX (e.g., PA0440 in P. aeruginosa) can be inferred based on chromosomal clustering (14) and protein fusion (18) evidence. The latter is provided by an observed homology between PydX and the N-terminal domain of the human dihydropyrimidine dehydrogenase known to supply an electron transfer chain for a catalytic C-terminal domain (19), a homolog of bacterial PydA.

Notwithstanding the importance of specific findings discussed above, an impact of the published study (1) extends beyond this interesting subsystem providing us with a remarkable illustration of the power of combining functional genomics techniques, such as microarray analysis and phenotype screening, with comparative genomics for mapping of completely novel pathways. Analyzing the rut pathway in retrospect, one may notice a number of important clues provided just by bioinformatics: (i) chromosomal clustering and co-occurrence of rut genes indicated that they could belong to the same pathway, (ii) a homology-based annotation of RutG as nucleobase transporter hinted at a possible role of this pathway in utilization of pyrimidines or purines, and (iii) sharing of RutR orthologs between rut and pyd operons pointed to a possible involvement with pyrimidine catabolism. These clues, combined with sparse evidence in the literature, could induce a hypothesis amenable to focused experimental testing. This analysis (no matter how speculative it is with respect to an already accomplished study) is aimed to emphasize the tremendous opportunities provided by comparative analysis of multiple sequenced genomes. A growing number of successful examples (among which the tale of the hidden rut pathway is one of the most striking) and a growing awareness of comparative genomics tools will prompt many research groups to explore these opportunities, thus accelerating the pace of gene and pathway discovery.

Acknowledgments

I thank Dr. Sydney Kustu for her kind permission to include the results for the pyrimidine utilization subsystem into the SEED database (http://theseed.uchicago.edu/FIG/subsys.cgi?ssa_name=Pyrimidine_utilization&request=show_ssa) before their publication. I am grateful to Dr. Veronika Vonstein for discussion and help with this manuscript.

Footnotes

  • E-mail: osterman{at}burnham.org
  • See companion article on page 5114 in issue 13 of volume 103.

  • Author contributions: A.O. wrote the paper.

  • Conflict of interest statement: No conflicts declared.

References

« Previous | Next Article »Table of Contents

Navigate This Article