Previous Article |
Table of Contents
| Next Article
BIOLOGICAL SCIENCES / MICROBIOLOGY
Genomic analysis of the Mozambique strain of Vibrio cholerae O1 reveals the origin of El Tor strains carrying classical CTX prophage



,¶

*Molecular Genetics Laboratory, International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka-1212, Bangladesh;
Department of Microbiology, Faculty of Public Health, Mahidol University, Bangkok 73170, Thailand; ¶School of Medicine and Dentistry, University of Rochester, 601 Elmwood Avenue, Box 672, Rochester, NY 14642; ||The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850; **International Vaccine Institute, Kwanak, P.O. Box 14, Seoul 151-600, Korea; and
Department of Microbiology and Molecular Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115
Contributed by G. Balakrish Nair, January 17, 2007 (received for review January 5, 2007)
| Abstract |
|---|
|
|
|---|
cholera | CTX phage | TLC element | evolution
The seventh pandemic of cholera reached sub-Saharan West Africa in the early 1970s and caused explosive outbreaks with a high case fatality, mainly because of a lack of background immunity in the population and inadequacies in the health care infrastructure (2). In this epidemic, cholera spread along the coast and into the interior through waterways and further disseminated into the interior of the Sahelian states by land travel fostered by nomadic tribes (1, 2). Cholera outbreaks caused by the El Tor biotype of V. cholerae O1 continue to occur frequently in many sub-Saharan African countries. However, cholera surveillance in Beira, the second largest city of Mozambique, in early 2004, revealed the presence of a distinct El Tor strain associated with incidences of cholera (6). In preliminary analyses, this Mozambique strain was found to display most of the typical traits of the El Tor biotype, but, interestingly, the resident CTX prophage in the strain is of the classical type (CTXClass
). In view of rapidly growing genetic diversity among toxigenic V. cholerae strains with epidemic potential, we conducted detailed analyses of the Mozambique strain and its CTX prophage to understand the origin of this strain. Here, we show that the Mozambique strain represents a distinct branch in the evolutionary pathway of pathogenic V. cholerae strains with epidemic potential.
| Results |
|---|
|
|
|---|
, CTXClass
, and CTXCalc
(4, 5, 7). This diversity of rstR constitutes the molecular basis for heteroimmunity among CTX phages. We examined the CTX prophage in 18 clinical isolates from Mozambique using specific probes for the different repressor genes rstR as well as probes for the ctxA and zot genes [see supporting information (SI) Fig. 5]. All isolates hybridized with the rstRClass probe, and not with other rstR probes, and the restriction patterns derived from respective CTX phage genes were identical for all of the isolates tested. The deduced CTX prophage array structure based on the observed restriction patterns of the ctxA, zot, and rstR genes in the Mozambique strain agreed with a previous report (8), suggesting that the strain carries two copies of the CTXClass prophage, arranged in tandem (SI Fig. 5). We attempted to identify the CTX prophage integration site in the Mozambique strain of V. cholerae O1 by analyzing PCR products containing the junctions between the prophage and chromosomal DNA. Interestingly, we found that primers specific for the TLC region (9), which is found upstream of the CTX prophage in other V. cholerae O1 strains, failed to produce a PCR amplicon, suggesting that the TLC region is absent in the Mozambique strain. This was later confirmed by using genomic microarray and sequence analysis, and both copies of the CTXClass prophage were found to be located on the small chromosome (see below). Most El Tor strains are also known to carry a satellite phage genome originally termed RS1 element, which is related to CTX
(10). RS1 encodes an antirepressor rstC that counteracts the activity of the CTX phage repressor rstR and thus promotes expression of genes required for phage production (11). However, analysis of the Mozambique strain for the presence of RS1 showed that, as in classical biotype strains, the RS1 element was also absent in the Mozambique strain.
Excision of the CTX Prophage.
The production of extrachromosomal CTX
genome (pCTX) by excision of the prophage and subsequent production of phage particles is presumed to depend on the structure of the chromosomal CTX array. Davis and colleagues (4) have proposed that a distinct process, apparently unrelated to phage DNA integration, usually underlies formation of CTX
virions. This process, which does not require homologous recombination, depends on the presence of tandem prophages or of a prophage followed by a related genetic element known as RS1. The CTX prophage is known to reside in the chromosome of El Tor biotype strains forming tandem arrays of two prophages or a prophage and an RS1. On the other hand, most classical strains have either a solitary phage or two fused truncated prophages (5), and classical strains are not known to produce pCTX or infectious virions. Because, in the Mozambique strain, the CTX prophage formed an array of two tandem prophages, we assumed that this array structure might support the excision of the CTX prophage to form pCTX in the Mozambique strain. To verify this, we conducted Southern blot analysis of plasmid and phage preparations from all isolates. Although none of the isolates produced phage particles, at least three isolates produced a plasmid band that hybridized with the CTX probe (data not shown). However, repeated attempts to clone the excised CTX
genome by inserting a kanamycin-resistance (KanR) marker in intergenic regions failed, although the excised phage genome could be cloned into pUC18, suggesting that the excised phage DNA was likely deficient in self-replication. The phage genome from one of these Mozambique isolates designated B-33 was cloned into pUC18 to construct pNSF-1. The insert of pNSF-1 was later sequenced and found to be the complete genome of a classical type CTX
(see below).
Comparative Genomic Analysis. The representative Mozambique strain B-33 was analyzed together with various control V. cholerae strains (SI Table 2) in our collection for their gene content as compared with strain N16961, a typical El Tor strain whose whole genome has been sequenced (12). The results of microarray analyses of various strains included in the study are summarized in Fig. 1. The array was based on the genomic sequence of N16961 and represented a total of 3,890 genes (13).
|
9899% of these genes. In agreement with previous studies, we found that occasionally the test strains were negative for clusters of genes that correspond to multigene insertions or chromosomal "islands" in the N16961 genome. However, the microarray data did not always indicate that every gene of an island was absent. In some cases, PCR and/or Southern blot analysis was conducted to confirm the presence or absence of genes within a suspected island insertion. This finding was particularly true for environmental isolates and supported our previous assumption that the gene clusters might have been derived from genes that are dispersed as individual genes among environmental strains.
Sequence of the CTX Prophage and Phage Integration Sites.
Determination of the nucleotide sequence of the excised CTX
genome in the Mozambique strain B-33 revealed that the phage genome comprised 7,007 nucleotides and contained all of the ORFs and intergenic sequences of a typical CTX
genome. The orientation of the ORFs were also identical to that of previously reported CTX phage sequences. The sequence of rstR gene was identical to that of the CTXClass prophage in strain 569B. There were minor differences among the sequences of other genes of CTX prophage in B-33 and those of CTXClass
carried by strain 569B and CTXET
of strain N16961. However, none of these nucleotide replacements or deletions appear to have resulted in frame-shift mutations in any of the genes.
Analysis of the sequence of the phage integration sites (att) on both chromosomes of the Mozambique strain B-33, revealed interesting differences between typical El Tor strains and strain B-33. The seventh pandemic El Tor strain represented by N16961 carries the CTXET prophage located at the att region between dif core1 and dif core2 of chromosome I, with adjacent elements known as TLC and RS1 (Table 1 and Figs. 2 and 3). Dif sites are located opposite the origin of replication and are binding sites for recombinases XerC and XerD to resolve dimeric chromosomes before cell division (14, 15). Previous studies have shown that CTX
integration requires XerC and XerD, and CTX
integrates into the normally overlapping dif core1 and dif core2 sites (16, 17). CTXET
is integrated into the attRS1 site within dif core1, thus separating dif 1 and dif 2 in N16961 (Fig. 3). Surprisingly, we found that B33 carried only the dif core2 site in chromosome I. Although B33 does not carry a CTX prophage in chromosome I, two copies of the CTXClass prophage in tandem are located on chromosome II. These copies apparently inserted at a second filamentous phage attachment site in B33 that is functional for CTXClass
, such as those carried by the typical classical strain O395. These and other differences in the B33 att and related sequences are summarized in Table 1 and Fig. 3. Interestingly, the CTX-negative strain AM19226 carries two other filamentous phages (VSK and f237), each inserted at the dif core sites on the two chromosomes. The absence of TLC and dif core1 site in chromosome I of B33 could be explained by a recombination event between flanking attRS1-related end repeat (ER) sites (Fig. 4). Based on sequence across the filled and empty dif sites in different strains, it appears that during this recombination event, the precursor to B33 might have lost the TLC element, CTX
, and the dif core1 (Table 1 and Figs. 2 and 3). Without dif core1, which contains an attRS1 site required for integration of CTXET
into chromosome I, the precursor strain was no longer able to acquire CTXET
. Thus, if cholera toxin is required for virulence, this hypothetical precursor strain was "forced" to pick up CTXClass
, which can use the chromosome II attachment site.
|
|
|
|
| Discussion |
|---|
|
|
|---|
We propose a pathway in which multiple recombination events were involved in the origination of the Mozambique strain from a precursor seventh pandemic strain similar to N16961. In this pathway, the Mozambique strain was derived from an El Tor-like strain that lost the El Tor CTX phage, TLC, and RS1, based on a loop-out recombination between related att sites, preventing future El Tor phage acquisition (Fig. 4). Because the remaining attachment site on chromosome II can be used only by a CTXClass
, the strain acquired a CTXClass
to generate B33. The acquisition of CTX phage might have been driven by strong selective pressure to colonize and amplify in the host intestinal environment.
It is not clear whether the progenitor strain of B33 received the classical CTX phage genome through an infectious CTXClass
, particularly because production of CTX
particles by classical V. cholerae strains has not been demonstrated. However, the possibility that infectious CTXClass
particles existed in nature or the phage genome might have been packaged into another phage cannot be ruled out. It is also not clear whether the circular but apparently nonreplicating form of the classical CTX
genome demonstrated in this study is a required intermediate for its transfer between V. cholerae strains. Interestingly, a recent study has suggested that V. cholerae may become naturally competent to uptake DNA from its environment when induced by growing on a chitin substrate (22). Thus, the precursor strain of B33 might have accepted a free CTXClass phage genome or a chromosomal segment of a classical V. cholerae strain carrying the CTXClass prophage from the environment.
It is now generally agreed that there were at least two critical gene transfer events in the evolution of pathogenic V. cholerae from a nonpathogenic precursor strain (9). First, the progenitor strain acquired the TCP pathogencity island (via a mechanism that is not fully understood) and thereby became colonization proficient while simultaneously acquiring the receptor for CTX
(3). Then, perhaps in the intestine, this TCP-positive strain was infected by CTX
. Previously, the finding that all TCP-positive V. cholerae O1 strains are also positive for TLC, irrespective of their CTX status, suggested the possibility that acquisition of the TLC element by TCP-positive V. cholerae strains is an intermediate step preceding the acquisition of CTX
in the evolution of fully pathogenic V. cholerae (9). We examined the sequences surrounding chromosome I dif sites of several non-O1 non-O139 strains that are negative for TCP, TLC, and CTX as well as strain 2740-80, an environmental U.S. Gulf Coast isolate that is clonal with TCP+CTX+ isolates but lacks CTX. From the sequence alignment and phylogenetic analysis (Fig. 2), dif1 sites of strains containing TLC are closely related and have intact attRS1 sites, whereas the three non-O1 non-O139 strains have a different dif 1 site and do not contain an attRS1 site, which is required for CTX
integration. Furthermore, the TLC element is linked to CTX
by an element-intervening ORF (eio) (9); the TLC and eio sequences are duplicated once in N16961 and twice in O395. The number of TLC duplications corresponds to the number of dif1 and attRS1 sites present in the region; explicitly, there are one TLC element and one dif 1/attRS1 in 274080, two in N16961, and three in O395. Thus, acquisition of TLC copies is correlated with duplications of dif 1 sites. Interestingly, the eio genes between the duplicate TLC and dif1/attRS1 sequences are truncated, suggesting that different versions of TLC carrying different eio sequences were acquired through time. Based on the chromosomal location and DNA sequence analysis, TLC may have initially brought in the attRS1 site that allowed the seventh pandemic strain to emerge through lysogenic conversion by the El Tor CTX phage. We predict that the precursor of the seventh pandemic clone needed to pick up the TLC element currently in seventh pandemic strains because this element can potentially generate an attRS1 site upon insertion into the chromosomal I site. TLC encodes a filamentous phage replicase and is likely a satellite phage but has not been studied in this context. The concept that a satellite phage (TLC) can carry only a short sequence that, upon integration into the chromosome, could generate a new functional att site for another phage (CTXET
) that carries a selectable trait (i.e., cholera toxin) is fascinating and deserves further experimental investigation.
The reconstruction of the events that led to the origination of strain B33 can also shed light on the emergence of the seventh pandemic clone. This also provides evidence that the seventh pandemic El Tor genomic backbone remains more fit than other TCP+CTX+ strains, because the Mozambique strain remains a seventh pandemic El Tor strain, despite carrying the classical CTX prophage. Based on molecular markers, the sixth-pandemic, seventh-pandemic, and U.S. Gulf coast isolates have been classified as three distinct clones, apparently evolving independently from environmental, nontoxigenic, non-O1 strains (23). The emergence of the Mozambique strain and its association with incidences of cholera may represent yet another pathway for the evolution of pathogenic strains from environmental Vibrios that have their origin in pathogenic strains that have lost virulence factors. Whether the loss and gain of virulence factors provide strains with increased fitness in the environment or in the host remains to be determined. It will also be interesting to see whether the Mozambique clone of V. cholerae El Tor O1 will eventually replace the typical seventh pandemic strain as a causative agent of cholera in the sub-Saharan African region and elsewhere in the world. If this occurs, we propose that the minor differences in the sequence or regulation of the ctxAB genes on the two copies of the CTXClass phage carried by B33 are the most likely explanation for this improved evolutionary fitness.
| Materials and Methods |
|---|
|
|
|---|
Probes and PCR Assays.
All strains were analyzed for the presence of different virulence genes, as well as distinctive genes carried by the two biotypes of V. cholerae O1. Presence of tcpA genes specific for the classical and El Tor biotypes was determined by using a multiplex PCR assay as described previously (25). Presence of classical, El Tor, and Calcutta type rstR genes of the CTX prophage and the rstC gene of the RS1 element were determined by PCR as well as specific DNA probes as described by us (2527). All probes were labeled by random priming (28) using a random primers DNA labeling kit (Invitrogen, Carlsbad, CA) and [
-32P]deoxycytidine triphosphate [3,000 Ci/mmol (1 Ci = 37 GBq); Amersham Biosciences, Uppsala, Sweden]. Southern blots were hybridized with the labeled probes and washed under stringent conditions, and autoradiographs were developed by using standard methods (29).
Microarray Analysis. The preparation of V. cholerae DNA microarrays and their use in comparative genomic studies have been described (13). Briefly, genomic DNA from each strain was extracted by using the EasyDNA kit (Invitrogen), and 3 µg of DNA from the test strain was used to generate Cy3- and Cy5-labeled samples (13). For each test strain, two independent gDNA preparations were used as template, and both Cy3 and Cy5-dCTP were used in independent labeling and hybridization experiments to account for any differences in DNA preparation or dye incorporation. Hybridized, washed slides were scanned for Cy5 and Cy3 fluorescence intensities by using a ScanArray 5000 (Packard Instruments). The resulting files were analyzed by using GENEPIX 3.0 software (Axon Instruments, Foster City, CA). In a single array experiment, the genomic content from one of the test strains was compared with that of N16961. For each of the test strains, data were compiled from at least two array experiments. Confirmation of absent genes included verification by Southern blot analysis, PCR analysis, or both methods. Standard 50-ml PCRs using TaqDNA polymerase were performed according to the manufacturer's instructions (Invitrogen). Southern blot analysis was performed by using standard protocols (29).
Sequencing the B-33 Genome. A B-33 genomic library was constructed according to standard methods. Sequencing was performed with pSMART primers (Lucigen, Middleton, WI) in 10-µl reactions in 384-well plates using Applied Biosystems BIGDYE V 3.1. Products were purified by using the CLEANSEQ system (Agencourt, Beverly, MA) on BIOMEK FX (Beckman Coulter, Fullerton, CA) and applied to an ABI 3730 sequencer. Chromatograms were transferred to the UNIX platform, and quality controlquality assessment scripts were run. Data were analyzed and assembled by using PHRED, PHRAP, and CONSED (3032). Sequence data were submitted to the Annotation Engine at The Institute for Genomic Research (Rockville, MD). The GLIMMER system (33, 34) was used to identify putative ORFs.
In addition, the CTX prophage in B33 was also sequenced separately from the excised pCTX DNA. Initially, overlapping subclones of the excised pCTX derived from the CTX prophage of a representative Mozambique strain B-33 was constructed in pUC18 and were sequenced by using universal sequencing primers (Invitrogen). Nucleotide sequencing was performed with an Applied Biosystems automated DNA sequencing system (ABI Prism 310 PE; Applied Biosystems, Foster City, CA) using BigDye terminator cycle sequencing ready reaction kit (PE; Applied Biosystems). Nucleotide sequence of both strands of pCTX were further determined by primer walking with primers derived from the preliminary sequencing of pCTX subclones in pUC18. Sequences were processed by using the Sequencher alignment program, Version 4.0 (Gene Codes, Ann Arbor, MI). Nucleotide sequence of pCTX of strain B-33 was compared with sequences in the GenBank databases, and protein homology search was done by using the National Center for Biotechnology Information BLAST server program.
Sequence Analysis. Sequence analysis was performed by using Artemis (Sanger, Cambridge, U.K.), and Lasergene (DNAStar, Madison, WI). Sequence alignments were created by using the CLUSTALW alignment feature of MACVECTOR 7.2.3.
| Acknowledgements |
|---|
|
|
|---|
| Footnotes |
|---|
Abbreviations: ER, end repeat.
To whom correspondence may be addressed. E-mail: faruque{at}icddrb.org or gbnair{at}icddrb.org
Author contributions: S.M.F., J.D.C., J.J.M., and G.B.N. designed research; S.M.F., V.C.T., N.C., P.D., M.D., and J.F.H. performed research; S.M.F., V.C.T., N.C., P.D., M.D., J.F.H., and J.J.M. analyzed data; and S.M.F. and J.J.M. wrote the paper.
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/cgi/content/full/0700365104/DC1.
© 2007 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||