Previous Article |
Table of Contents
| Next Article
Microbiology
Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli



||
**





, 

, 





*Department of Medical Microbiology and Immunology,
Laboratory of Genetics, 
Genome Center
of Wisconsin, and 
Animal Health and Biological
Sciences, University of Wisconsin,
Madison, WI 53706; and
Department of Microbiology and Immunology,¶
Division of Infectious Diseases, Department of
Medicine, University of Maryland School of
Medicine, Baltimore, MD
21201
Edited by John J. Mekalanos, Harvard Medical School, Boston, MA, and approved October 24, 2002 (received for review August 30, 2002)
| Abstract |
|---|
|
|
|---|
| Materials and Methods |
|---|
|
|
|---|
Sequence Analysis and Annotation. The genome sequence was annotated in the multiuser, web-based annotation environment called MAGPIE (7). This system used GLIMMER to define ORFs (8). Predicted proteins were searched against the nonredundant database by using BLAST (9). MAGPIE assigned automatic annotations for all ORFs, which then were checked individually and corrected. These formed the basis for the GenBank submission. The island annotations contain unique identifiers in the form CI no., for islands of all sizes. Orthology was inferred when matches for CFT073 genes in either the MG1655 or EDL933 database exceeded 90% identity, alignments included at least 90% of both genes, and the MG1655 and EDL933 genes did not have an equivalent match elsewhere in the CFT073 genome. Genome comparisons were carried out by a modification of the method used to compare EDL933 and MG1655 (4).
| Results |
|---|
|
|
|---|
|
|
|
The CFT073-specific islands contain 2,004 genes, of which only 204 also occur among the EDL933-specific genes. Two-thirds of these island genes shared by EDL933 and CFT073 have unknown functions or are associated with phage or insertion sequence elements (10). The remaining shared genes encode putative iron-uptake systems, a complex set of potential fatty acid biosynthetic enzymes, several adhesins, and phosphotransferase system and ATP-binding cassette (ABC)-type transport systems. CFT073 and EDL933 contain, respectively, 60 and 57 islands >4 kb in length. The locations and sizes of these are shown in Fig. 3. Many island locations are at the same relative backbone position in the two pathogens although the island contents are unrelated. Thirteen CFT073 and 10 EDL933 islands are closely associated with known tRNA genes (nine are at the same tRNA in both genomes). Ten other locations also are occupied by unrelated islands in both strains.
|
Potential for New Niches and Different Pathogenic Mechanisms. The ability to inhabit the different niches during an ascending urinary tract infection and cause particular pathologies at each site resides largely in the island genes specific to uropathogenic E. coli. The CFT073 genome sequence has revealed many possible factors that may contribute to colonization of the urinary tract tissues and the disease. The most important examples are mentioned here.
Surface structures known as fimbriae or pili mediate specificity for and attachment to host cells, an essential event for host colonization. We found genes encoding 12 distinct, putative fimbriae in the genome of CFT073, 10 fimbriae of the chaperoneusher family, and two type IV pili. Two pap operons (pylonephritis-associated pilus) encode P fimbriae with PapGII adhesins (17), located in islands at pheV and pheU. These are specific to uropathogens but are not the sole adhesins in CFT073 that are important for virulence. The foc operon encoding F1C fimbria and a chaperoneusher family operon with two chaperone genes both have been linked to urinary tract infections (18). Several of the chaperoneusher pilus operons are common to CFT073, EDL933, and MG1655, including the yad fimbriae (19) and the type I fimbrial operon, which plays an essential role in the pathogenesis of urinary tract infection (20). Type 1 fimbriae are ubiquitous, but they are not all identical. Also common to all three sequenced strains is a pilus similar to Stf of Salmonella enterica serotypes Typhimurium and Typhi and to the Mrp pilus of Proteus mirabilis, a confirmed urovirulence determinant (21). In CFT073, these proteins are highly divergent from those in MG1655 and EDL933, with amino acid sequence identities ranging from 53% to 81%, suggesting that the selective pressure on the expression of this pilus has varied among E. coli lineages. Four other fimbrial operons are shared by two of the three strains or by S. enterica. These have similarly variable amino acid sequence identity. Presumably, the variable sequences of the shared operons allows for the specificity of each adhesin to its individual target tissue.
Type IV pili are assembled by the type II general secretory pathway. They occur in a wide range of species and frequently are associated with diseases. Genes encoding a putative type IV pilin and tip adhesin were found in CFT073. In all three strains, ppdD and hofBC genes may encode type IV prepilin. Although there is no evidence for its expression in MG1655, PpdD can be incorporated into a type IV pilus in a suitable host (22). Genes encoding the putative secretin, a nucleotide-binding protein required for twitching motility, and other type IV pilin-like proteins also are present in all three strains. The type II general secretory pathway secreton for chitinase (23) is found in CFT073 and K-12 in the backbone region between rpsJ and tufA but is absent from the EDL933 genome, although the large plasmid pO157 carries a functional type II secretion system (24).
FimE and FimB recombinases control expression of the fimbriae encoded by the widespread fim operon in a phase-switch system that involves site-specific inversion of a small, 314-bp DNA element. Five different copies of fimBE-like genes were found in the CFT073 genome. Two copies are associated with the type I fimbrial locus present in the same place as other E. coli genomes. There are two divergently transcribed copies linked to the D-serine deaminase locus near argW and a fifth linked to the osmoregulatory cholineglycine betaine locus, betABIT.
E. coli CFT073 encodes at least seven putative
autotransporters, proteins that export a large passenger-domain
cleavage fragment across the outer membrane via a
-barrel pore
formed by the C terminus of the same protein. The secreted polypeptides
often confer virulence (25). For example, in CFT073, Sat, a serine
protease, elicits cytopathic effects on bladder and kidney epithelial
cells (26). Unique examples in the CFT073 genome are similar to
hemagglutinin or diffuse adherence (AIDA)-like adhesins. One is
a homologue of Pic, a mucinase of enteroaggregative E. coli
and Shigella flexneri, that contains within its sequence on
the opposite strand two ORFs >95% identical to SetA and SetB,
the AB subunit enterotoxin (ShET-1) in Shigella (27).
The well-characterized hemolysin genes (hlyCABD) at the pheV island encode a cytolytic toxin and its secretion apparatus (28). An additional member of the type I RTX-like secretion family, upxBDA, is found in the 100-kb island at aspV. The gene order for this member is atypical when compared with the originally characterized RTX determinants, the B and D secretion genes preceding the A gene. It also lacks a C-like gene that typically encodes a fatty acid modification enzyme. There are no notable UpxA sequence features that indicate that it is a member of one of the known RTX family branches (i.e., pore-forming toxin, protease, or lipase). This finding suggests that this locus encodes a unique class of RTX-like secreted protein.
| Discussion |
|---|
|
|
|---|
For uropathogenic strains of E. coli, island acquisition resulted in the capability to infect the urinary tract and bloodstream and evade host defenses without compromising the ability to harmlessly colonize the intestine. For the different intestinal pathogens, acquired genes promote the colonization of specific regions of the intestine and new modes of interaction with the host tissue that produce clinically distinct variations of gastrointestinal disease. Each type of E. coli possesses combinations of island genes that confer its characteristic lifestyle or disease-causing traits. Hacker and colleagues (11) elaborated the pathogenicity island concept based on the genetic behavior, virulence gene linkage relationships, and location of unique inserts near several tRNA genes. Pathogenicity-associated islands were designated based on the presumption that pathogenic traits are present in all inserts and with the assumption that each unique DNA segment has some unifying physical features and similar genetic history and behavior. Our sequence comparisons show that this is not true even for similar uropathogenic strains that have two islands containing some similar genes inserted at the same tRNA site. Comparisons of CFT073 islands with those of other extraintestinal E. coli isolates indicate that similar virulence genes may come into play, but their linkage relationships and chromosomal locations vary considerably. Our observation provides evidence that extraintestinal E. coli may be oligoclonal despite the apparent linkage relationships of a handful of virulence genes and suggests that the uropathogenic E. coli may be as diverse as the intestinal strains. Recent epidemiological analyses lend support to the proposal that specific subsets of genes are characteristic of each of the E. coli uropathogenic subtypes: cystitis, pyelonephritis, and urosepsis (14, 15).
The presence of three extra homologues of the fimBE-like recombinases suggests that the DNA segment-inversion mechanism of genetic phase variation may operate at regions other than the type I fimbrial adhesin in CFT073. The extent of genotypic differences from other E. coli, on a scale larger than previously known, is not altogether surprising given the complexity in the lifestyle of this pathogen, where it colonizes distinct niches including the intestine, perineum, urethra, bladder, and kidney of humans as well as these sites in other mammals such as dogs (29).
The common core chromosome of the E. coli genomes has been preserved throughout its vertical evolution, with very limited intragenomic rearrangement, resulting in the conserved synteny apparent today. The backbone also provides a large, core set of markers for this group, including genes of nutrient synthesis and others that form the signature of Escherichia physiology. No extensive genome reductions have taken place to take advantage of nutrients available in the intestinal environment, and this presumably has remained true despite millions of years of a commensal lifestyle in animals. The presence of "black holes," i.e., deletions that remove genes detrimental to the uropathogenic lifestyle, is difficult to assess at this time because of the large number of genetic differences already observed, the absence of Shigella spp., and additional E. coli genome sequences needed for comparisons (30). The detection of only a relatively small number of pseudogenes in CFT073 stands in contrast to the numbers that have been recently observed, 204 in S. enterica serovar Typhi and 149 in Y. pestis (31, 32). In this respect, CFT073 parallels the broad host range and varied lifestyle of S. enterica serovar Typhimurium (39 pseudogenes) than the more restricted lifestyles of Typhi or Y. pestis. However, the sheer amount of unique DNA in each E. coli strain that can be explained by the frequent gain and loss of accessory genes suggests that careful reconsideration is due for defining species by a few phenotypic traits and low-resolution mapping. The CFT073 and EDL933 genome sequences enable us to design far more discriminating tools for diagnosis of particular E. coli pathotypes that cause such a wide range of intestinal and extraintestinal diseases.
| Acknowledgements |
|---|
| Footnotes |
|---|
To whom correspondence should be addressed at:
Laboratory of Genetics, University of Wisconsin, 445 Henry Mall,
Madison, WI 53706. E-mail: val{at}genome.wisc.edu.
|| Present address: Department of Ecology and
Evolution, State University of New York, Stony Brook, NY 11794. ![]()
** Present address: Department of Biology, University of California
at San Diego, La Jolla, CA 92093. ![]()

Present address: Department of Biology, University
of Iowa, Iowa City, IA 52242. ![]()
This paper was submitted directly (Track II) to the PNAS office.
Data deposition: The sequence reported in this paper has been deposited in the GenBank database (accession no. AE014075).
| References |
|---|
|
|
|---|
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
T. E. Erova, V. G. Kosykh, A. A. Fadl, J. Sha, A. J. Horneman, and A. K. Chopra Cold Shock Exoribonuclease R (VacB) Is Involved in Aeromonas hydrophila Pathogenesis J. Bacteriol., May 15, 2008; 190(10): 3467 - 3474. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. N. Simms and H. L. T. Mobley Multiple Genes Repress Motility in Uropathogenic Escherichia coli Constitutively Expressing Type 1 Fimbriae J. Bacteriol., May 15, 2008; 190(10): 3747 - 3756. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Johnson, Y. M. Wannemuehler, and L. K. Nolan Evolution of the iss Gene in Escherichia coli Appl. Envir. Microbiol., April 15, 2008; 74(8): 2360 - 2369. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bhushan, S. Tchatalbachev, J. Klug, M. Fijak, C. Pineau, T. Chakraborty, and A. Meinhardt Uropathogenic Escherichia coli Block MyD88-Dependent and Activate MyD88-Independent Signaling Pathways in Rat Testicular Cells J. Immunol., April 15, 2008; 180(8): 5537 - 5547. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Li, C. Ewers, C. Laturnus, I. Diehl, K. Alt, J. Dai, E.-M. Antao, K. Schnetz, and L. H. Wieler Characterization of a yjjQ mutant of avian pathogenic Escherichia coli (APEC) Microbiology, April 1, 2008; 154(4): 1082 - 1093. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. G. Toyota, C. L. Berthold, A. Gruez, S. Jonsson, Y. Lindqvist, C. Cambillau, and N. G. J. Richards Differential Substrate Specificity and Kinetic Behavior of Escherichia coli YfdW and Oxalobacter formigenes Formyl Coenzyme A Transferase J. Bacteriol., April 1, 2008; 190(7): 2556 - 2564. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Bentley, C. Corton, S. E. Brown, A. Barron, L. Clark, J. Doggett, B. Harris, D. Ormond, M. A. Quail, G. May, et al. Genome of the Actinomycete Plant Pathogen Clavibacter michiganensis subsp. sepedonicus Suggests Recent Niche Adaptation J. Bacteriol., March 15, 2008; 190(6): 2150 - 2160. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Krzeslak, G. Gerritse, R. van Merkerk, R. H. Cool, and W. J. Quax Lipase Expression in Pseudomonas alcaligenes Is Under the Control of a Two-Component Regulatory System Appl. Envir. Microbiol., March 1, 2008; 74(5): 1402 - 1411. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Berlanda Scorza, F. Doro, M. J. Rodriguez-Ortega, M. Stella, S. Liberatori, A. R. Taddei, L. Serino, D. Gomes Moriel, B. Nesta, M. R. Fontana, et al. Proteomics Characterization of Outer Membrane Vesicles from the Extraintestinal Pathogenic Escherichia coli {Delta}tolR IHE3034 Mutant Mol. Cell. Proteomics, March 1, 2008; 7(3): 473 - 485. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Lee, S. J. W. Busby, L. F. Westblade, and B. T. Chait Affinity Isolation and I-DIRT Mass Spectrometric Analysis of the Escherichia coli O157:H7 Sakai RNA Polymerase Complex J. Bacteriol., February 15, 2008; 190(4): 1284 - 1289. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Bronowski, S. L. Smith, K. Yokota, J. E. Corkill, H. M. Martin, B. J. Campbell, J. M. Rhodes, C. A. Hart, and C. Winstanley A subset of mucosa-associated Escherichia coli isolates from patients with colon cancer, but not Crohn's disease, share pathogenicity islands with urinary pathogenic E. coli Microbiology, February 1, 2008; 154(2): 571 - 583. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sabri, M. Caza, J. Proulx, M. H. Lymberopoulos, A. Bree, M. Moulin-Schouleur, R. Curtiss III, and C. M. Dozois Contribution of the SitABCD, MntH, and FeoB Metal Transporters to the Virulence of Avian Pathogenic Escherichia coli O78 Strain {chi}7122 Infect. Immun., February 1, 2008; 76(2): 601 - 611. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zdziarski, C. Svanborg, B. Wullt, J. Hacker, and U. Dobrindt Molecular Basis of Commensalism in the Urinary Tract: Low Virulence or Virulence Attenuation? Infect. Immun., February 1, 2008; 76(2): 695 - 703. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lindberg, Y. Xia, B. Sonden, M. Goransson, J. Hacker, and B. E. Uhlin Regulatory Interactions among Adhesin Gene Systems of Uropathogenic Escherichia coli Infect. Immun., February 1, 2008; 76(2): 771 - 780. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. S. Vernikos and J. Parkhill Resolving the structural features of genomic islands: A machine learning approach Genome Res., February 1, 2008; 18(2): 331 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-P. Nuccio and A. J. Baumler Evolution of the Chaperone/Usher Assembly Pathway: Fimbrial Classification Goes Greek Microbiol. Mol. Biol. Rev., December 1, 2007; 71(4): 551 - 575. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Kendall, D. A. Rasko, and V. Sperandio Global Effects of the Cell-to-Cell Signaling Molecules Autoinducer-2, Autoinducer-3, and Epinephrine in a luxS Mutant of Enterohemorrhagic Escherichia coli Infect. Immun., October 1, 2007; 75(10): 4875 - 4884. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Moulin-Schouleur, M. Reperant, S. Laurent, A. Bree, S. Mignon-Grasteau, P. Germon, D. Rasschaert, and C. Schouler Extraintestinal Pathogenic Escherichia coli Strains of Avian and Human Origin: Link between Phylogenetic Relationships and Common Virulence Patterns J. Clin. Microbiol., October 1, 2007; 45(10): 3366 - 3376. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Bergholz, C. L. Tarr, L. M. Christensen, D. J. Betting, and T. S. Whittam Recent Gene Conversions between Duplicated Glutamate Decarboxylase Genes (gadA and gadB) in Pathogenic Escherichia coli Mol. Biol. Evol., October 1, 2007; 24(10): 2323 - 2333. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Hutchison III DNA sequencing: bench to bedside and beyond Nucleic Acids Res., September 25, 2007; 35(18): 6227 - 6237. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Guo, M. Flores, L. Morales, D. Garcia, P. Bustos, V. Gonzalez, R. Palacios, and G. Davila DNA Diversification in Two Sinorhizobium Species J. Bacteriol., September 1, 2007; 189(17): 6474 - 6476. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. King, R. P. Vimr, S. M. Steenbergen, L. Spanjaard, G. Plunkett III, F. R. Blattner, and E. R. Vimr Escherichia coli K1-Specific Bacteriophage CUS-3 Distribution and Function in Phase-Variable Capsular Polysialic Acid O Acetylation J. Bacteriol., September 1, 2007; 189(17): 6447 - 6456. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Anjum, M. Mafura, P. Slickers, K. Ballmer, P. Kuhnert, M. J. Woodward, and R. Ehricht Pathotyping Escherichia coli by Using Miniaturized DNA Microarrays Appl. Envir. Microbiol., September 1, 2007; 73(17): 5692 - 5697. [Abstract] [Full Text] [PDF] |
||||