Previous Article |
Table of Contents
| Next Article
EVOLUTION
Horizontal gene transfer from flowering plants to Gnetum


Department of Biology, University of Missouri, 8001 Natural Bridge Road, St. Louis, MO 63121; and
Missouri Botanical Garden, P.O. Box 299, St. Louis, MO 63166
Edited by Barbara A. Schaal, Washington University, St. Louis, MO, and approved July 18, 2003 (received for review June 12, 2003)
| Abstract |
|---|
|
|
|---|
|
During work on the phylogeny of Gnetum, we developed specific primers to amplify the second intron in the nad1 gene, plus flanking exons, from numerous previously unstudied species of Gnetum. We discovered that Gnetum harbors two copies of nad1 intron 2 and reasoned that a larger-scale survey of complete nad1 intron 2 sequences should allow identification of the source of the different copies because major seed plant lineages have specific intron signatures. If the origin of the two copies in Gnetum could be traced, this might provide evidence for horizontal gene transfer, adding a new dimension to our understanding of group II intron evolution. An earlier survey of the distribution of nad1 intron 2 in 276 angiosperms and 17 gymnosperms representing all major lineages of seed plants had shown that the intron is widespread, but absent in seven unrelated angiosperms, non-Pinaceae conifers, and the Gnetales genus Welwitschia (19). The sister group of Welwitschia, Gnetum, was found to have the intron, whereas the third genus of Gnetales, Ephedra, gave a weak hybridization signal and multiple PCR products. Given its apparent multiple loss in angiosperms, the intron's distribution in gymnosperms was most parsimoniously explained by parallel independent losses in Welwitschia and the ancestor of non-Pinaceae conifers (19). This interpretation of a few repeated losses, rather than multiple regains, is independent of the specific relationships among seed plants and the mono- or paraphyly of Coniferales, which are currently unresolved (ref. 21 and references therein). However, these previous studies of the distribution of nad1 intron 2 in seed plants have mainly relied on Southern hybridization.
| Materials and Methods |
|---|
|
|
|---|
Gene Sequencing. We extracted DNA from silica gel-dried leaves by using Qiagen (Valencia, CA) Plant DNeasy mini kits. Concentration and quality of extracted DNAs were checked by 1.5% agarose gel electrophoresis with a Lambda/HindIII/EcoRI size marker. We amplified the entire nad1 intron 2 region by using the Expand High Fidelity PCR system (Roche Applied Science), using primers "exonB" and "exonC" designed by Demesure et al. (ref. 22; Figs. 4 and 5, which are published as supporting information on the PNAS web site). The 5' and 3' region sequences of each type were compared with partial Gnetum gnemon sequences (19), and their homologues were searched via BLASTN searches. Exon sequences homologous to partial G. gnemon sequences were designated as "gymnosperm-type," whereas exon sequences homologous to angiosperm ones were designated "angiosperm-type." Type-specific internal primers were then designed to prevent amplification of the other sequence type (Figs. 4 and 5), and appropriate amounts (0.54 ng) of template DNA were added to the PCR master mixture following the manufacturer's protocol. Portions of the PCR products were run on an 1.5% agarose gel with a Lambda/HindIII/EcoRI size marker to verify the size and amount of the PCR products (Fig. 5). Single band PCR products were purified with Qiagen PCR purification kits. Double bands were separated via 1.2% agarose gel electrophoresis and then purified using Qiagen QIAEX II gel purification kits. Separated and purified PCR products were PCR-sequenced, using ABI Prism Big Dye Terminator Cycle Sequencing Ready Reaction kits (PerkinElmer) with the amplification primers. Sequences were aligned and cleaned-up in SEQMAN II (DNASTAR).
Sequence Analysis and Secondary Structure Comparison. Gnetum mt nad1 intron 2 and partial exon b and c sequences were compared with homologous regions in Citrullus (watermelon, Cucurbitaceae) whose secondary structure has been predicted (11). This helped to find and align the stem regions of the six domains. A data matrix of seed plant nad1 exon b and c and intron 2 sequences was constructed that included information on the domain regions, by using SE-AL software (version 2.0a11; Fig. 2 and Fig. 6, which is published as supporting information on the PNAS web site). We verified the secondary structure by the thermodynamic method using MFOLD software (version 3.1, www.bioinfo.rpi.edu/applications/mfold/; refs. 23 and 24; Fig. 1B). Domains between ID and I3 are excluded from Fig. 6 because of mismatches in the thermodynamically predicted secondary structure with the Citrullus model (Fig. 1B). Length and GC content of each domain were calculated from the aligned sequences (Tables 3 and 4, which are published as supporting information on the PNAS web site). Sequence divergences were calculated by using the General Time Reversible model with among-site rate variation and proportion of invariant sites (GTR + G + I; ref. 25) estimated by Bayesian analysis (Table 1 and Table 5, which is published as supporting information on the PNAS web site).
|
|
Phylogenetic Analysis. Phylogenetic analyses used PAUP* version 4.0b10 (26) and MR. BAYES (27). For Fig. 3A, partial sequences of mt nad1 exons b and c (Fig. 2) and the conserved sections of intron 2 (Fig. 6) from 85 accessions (Table 2) were analyzed by neighbor-joining, with sequence divergences calculated under the GTR + G + I model (25) after removing all gaps and ambiguous characters. Nonparametric bootstrap support was obtained by resampling the data 1,000 times with the same search options and model. Parsimony analyses of full intron sequences using heuristic searching were run separately for each intron type. Heuristic searching used 100 random taxon addition replicates, holding 100 trees at each step, tree bisectionreconnection branch swapping, MulTrees, Collapse, and Steepest Descent options, and no upper limit for trees held in memory. The partial exons/intron sequence tree and two full-length intron sequence trees had basically identical topologies, with the intron trees showing more phylogenetic structure.
|
To construct a phylogenetic framework in which to analyze the distribution of the nad1 intron 2 within Gnetum, we sequenced five additional loci (together 4,540 aligned base pairs): the nuclear Leafy gene second intron, the nuclear internal transcribed spacers/5.8S, the plastid rbcL gene, the plastid matK gene, and the plastid tRNALeu (UAA) intron and adjacent spacers. Combined data were analyzed, after removing all gaps and ambiguous sites, under parsimony, maximum likelihood, and Bayesian optimization. The final data set included 40 accessions of Gnetum, of which 27 nonredundant ones are included in Fig. 3B to represent the species recognized in a new monograph (H.W., unpublished data). Parsimony analysis used the same options as above. Maximum likelihood used the GTR + Gamma + Pinv model. Bayesian probabilities were obtained under the GTR + Gamma model, with four Markov chain Monte Carlo chains run for 556,700 generations, using random trees as starting points, sampling every 10th generation, and discarding the first 16,300 trees as burn-in.
Age Estimation of Crown Gnetum Lineages. For the age estimates, combined chloroplast matK and rbcL sequences of seed plants were analyzed under maximum likelihood, using the GTR + Gamma + Pinv model. Psilotum was included to root seed plants. Likelihood ratio tests were used to assess clock-like behavior of the data. Branch lengths in Gnetales were calibrated with Gurvanella (28), a fossil consisting of branches and attached seeds, and Cratonia cotyledon, a seedling macrofossil from the Brazilian Crato formation (29, 30). Gurvanella constrains the minimum age of Gnetales to 125 million years (my) ago (BarremianAptian) because it combines the characteristic mode of branching of Ephedra with the seed morphology of Welwitschia; the seedling from the Crato formation shows an embryo feeder, epidermis and venation characters unique to Gnetum and Welwitschia, and constrains the minimum age of their most recent common ancestor to 115 my ago (AptianAlbian).
| Results |
|---|
|
|
|---|
The length of the gymnosperm-type introns in Gnetum ranged from 1,124 to 1,779 bp, whereas that of the angiosperm-type introns ranged from 1,345 to 1,352 bp (Tables 3 and 4). Length variation in Gnetum gymnosperm-type introns was due mainly to domain II, which comprised between 383 and 994 bp. Domain IV varied between 237 and 316 bp, and domains I (292295 bp), III (102 bp), V (34 bp), and VI (10 bp) were invariant in length across accessions (Fig. 1B). As expected from this length variation, gymnosperm-type introns across seed plants were difficult to align. Thus, in domain IV, Pinaceae introns were up to 10 times longer than Gnetum introns (1,3472,367 vs. 237316 bp), whereas in domain II, Gnetum introns were up to six times longer than Pinaceae introns (383994 vs. 159 bp). Angiosperm-type introns showed little variation in domain lengths except for domain IV (Tables 3 and 4) and were thus relatively easily alignable. As expected from these differences, alignment of the Gnetum gymnosperm-type intron sequences with Gnetum angiosperm-type intron sequences was not feasible except in the stem regions (Figs. 1B and 6).
Sequence divergences between the nad1 intron of angiosperms and the angiosperm-type intron of Gnetum (Tables 1 and 5) lie between 0.03 and 0.18, with the closest similarity (0.030.06) to Pagamea (Rubiaceae) and Petunia (Solanaceae), members of the euasterid I clade (3133).
Phylogenetic Position of Gymnosperm- and Angiosperm-Type Exons/Intron. Fig. 3A shows the tree resulting from the analysis of partial exons and conserved sections of the nad1 intron 2 sequences obtained from Gnetum and other seed plants. Gnetum gymnosperm-type nad1 exons b and c and intron sequences cluster with Pinaceae, whereas Gnetum angiosperm-type sequences cluster inside the flowering plants. A Gnetum/Pinaceae clade is relatively well supported (73% bootstrap support) as found in other studies (refs. 19 and 21 and references therein), and intra-Gnetum relationships are congruent with those resulting from combined nuclear and chloroplast markers in that the South American species are sister to the Southeast Asian and African species (Fig. 3B). Angiosperm-type exon and intron sequences, including those from Gnetum, group together with 99% bootstrap support (lower part of Fig. 3A), and the Gnetum sequences are closest to the euasterids Pagamea (Rubiaceae) and Petunia (Solanaceae). The relationships among the remaining angiosperm nad1 introns are generally congruent with current three- and five-gene phylogenies for the same taxa (3133).
Plotting of the two types of exons/introns on a phylogeny of Gnetum (Fig. 3B) reveals that the angiosperm-type sequences are restricted to one of two Southeast Asian clades (labeled "clade II"). Gnetum species from South America and the remainder of Southeast Asia contain only gymnosperm-type exons/introns, and the single African species, G. africanum, lacks either intron type. Strikingly, the angiosperm-type exons/introns are lacking, and apparently were lost secondarily, in four species of clade II (Fig. 3B).
Age Estimation of Crown Gnetum Lineages. Because a likelihood ratio test of models with and without a globally optimal substitution rate did not reject the clock assumption (0.05 > P > 0.01), branch lengths in chloroplast matK+rbcL data were calibrated with gnetalean macrofossils (Materials and Methods). Calibration with the Gurvanella fossil yielded a divergence time of 7 to 6 my for all extant Gnetum species and of 5 to 2 my for species in the Asian clade II of Gnetum (Table 6, which is published as supporting information on the PNAS web site, and Fig. 3C). Calibration with the Cratonia cotyledon fossil yielded an age of Gnetales of 205 to 184 my, of crown Gnetum of 11 to 10 my, and of the Asian clade II of 4 my.
| Discussion |
|---|
|
|
|---|
The phylogenetic distribution of the angiosperm-type intron in Gnetum suggests that the horizontal gene transfer happened in the stem lineage of the Asian clade II (Fig. 3B). Different from the gymnosperm-type introns, which show large divergences in accordance with their long evolutionary history, the angiosperm-type introns show little divergence and size variation (Tables 1 and 35), indicative of short divergence times. Based on a local molecular clock, species in Gnetum clade II diverged from each other only 5 to 2 my ago (Fig. 3C). Relatively recent horizontal gene transfer from a Southeast Asian asterid angiosperm to the common ancestor of Gnetum clade II thus is the most parsimonious explanation for the distribution of the two copies of nad1 intron 2 found in Gnetum.
An alternative hypothesis explaining the observed phylogenetic incongruence between the two (xenologous) types of Gnetum nad1 introns would involve persistence of an ancient seed plant nad1 intron 2 polymorphism, followed by selective extinction (36, 37). Such an explanation requires that the nad1 second intron and adjacent exon sections duplicated in the most recent common ancestor of angiosperms and Gnetales and that the angiosperm-type intron and exons b and c then survived only in Gnetum and angiosperms. The age of the most recent common ancestor of angiosperms and Gnetales is currently unknown and depends on the resolution of seed plant phylogenetic relationships. However, the low sequence divergence between nad1 intron 2 in Pagamea and Petunia vs. Gnetum (0.030.06) argues against an ancient divergence of these introns. Also, all Gnetum angiosperm-like introns cluster with asterids (Pagamea and Petunia), that is, derived angiosperms, rather than as a sister lineage to angiosperms, but the age of the angiosperm nad1 intron 2 is thought to predate the split of monocots and dicots some 140 my ago (10). The monophyly of angiosperm sequences, including the Gnetum angiosperm-type exons/introns, leaves no other possible explanation but horizontal gene transfer.
Possible Mechanisms of Horizontal Gene Transfer. The mechanisms by which horizontal gene transfer takes place are largely a matter of speculation; agents that have been implied are viruses, bacteria, fungi, and plant cell-piercing insects (see ref. 3). The discovery of mt heteroplasmy in Silene acaulis (38) suggests the possibility of transfer of an entire angiosperm mitochondrion to Gnetum during insect (moth or fly) pollination (39, 40). Mechanisms involved in yeast group II intron retrohoming/homing and coconversion (34, 35) are unlikely to apply to the present case of horizontal transfer because seed plant nad1 second introns have lost the ORFs crucial for retrohoming. Instead, the presence of angiosperm-specific characteristics in the upstream and downstream exons of Gnetum angiosperm-type nad1 intron 2 (Fig. 2), and especially that these sites are posttranscriptionally edited (RNA editing) in angiosperms (refs. 57 and Fig. 2), suggest that the exons and intron have transferred simultaneously as DNA. The insertion of the GATA motif in the Gnetum angiosperm-type exon b may have occurred during the horizontal transfer or immediately after the transfer, because all Gnetum angiosperm-type sequences share this insert. Extra indels in angiosperm-type exon b occur in Gnetum aff. latifolium SAN151116 and Gnetum neglectum (data not shown), further supporting the pseudogenization of this exon. We found no evidence for pseudogenization in exon c, nor did we find evidence in the intron for secondary structure disruption.
Future work is needed to clarify the subcellular location of the angiosperm-type exons/introns as well as the scope of the horizontal transfer (see below). Resolving the location will require mapping of the Gnetum mt genome, which would also clarify the arrangement of the five exons of the nad1 gene (Fig. 1 A). So far, intact group II introns have not been found in nuclear genomes (12), and incorporation of mt introns or genes into chloroplasts appears extremely rare (41, 42). That the horizontally transferred group II intron and its flanking exons may still be located in mitochondria is also suggested by the absence of acceleration in the substitution rates of the Gnetum angiosperm-type intron. Genes transferred from mitochondria to nuclei often acquire accelerated substitution rates (18, 41, 43, 44).
The scope of the horizontal transfer clearly involved the intron and adjacent exons. An involvement of all five exons (ae) of the nad1 gene is unlikely because of their wide separation (Fig. 1 A). The presence of two identical copies of mt nad1 exon a in maize (8) and of a group II intron in mt nad5 in Huperzia selago, which in addition has a group II intron in its mt nad1 (45), demonstrates that several copies of nad exons or introns can coexist in single mt genomes.
The extent and mechanisms of horizontal gene transfer between eukaryotes are not well understood, with concomitant unease about the release of genetically modified organisms. As shown here, horizontal transfer of mt DNA segments has occurred naturally between gymnosperms and angiosperms in their recent evolutionary past, and Bergthorsson et al. (46) reported instances of such transfers within flowering plants. These results indicate that natural mechanisms exist for the horizontal transfer of mt genes, suggesting that horizontal gene transfer may play an underestimated role in the evolution of seed plants.
| Acknowledgements |
|---|
| Footnotes |
|---|
Abbreviations: mt, mitochondrial; my, million years.
Data deposition: The DNA sequences reported in this paper have been deposited in the GenBank database (accession nos. AY230269 [GenBank] AY230316, AY231296 [GenBank] AY231300, AY243113 [GenBank] , AY243121 [GenBank] , AY243125 [GenBank] , AY243129 [GenBank] AY243131, AY256880 [GenBank] AY256885, and AY283607 [GenBank] AY283610). For a full listing of accession numbers, see Table 2, which is published as supporting information on the PNAS web site, www.pnas.org.
To whom correspondence should be addressed. E-mail: renner{at}umsl.edu.
| References |
|---|
|
|
|---|
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
C. J. Willson, P. S. Manos, and R. B. Jackson Hydraulic traits are influenced by phylogenetic history in the drought-resistant, invasive genus Juniperus (Cupressaceae) Am. J. Botany, March 1, 2008; 95(3): 299 - 314. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-Y. Qiao, J.-H. Ran, Y. Li, and X.-Q. Wang Phylogeny and Biogeography of Cedrus (Pinaceae) Inferred from Sequences of Seven Paternal Chloroplast and Maternal Mitochondrial DNA Regions Ann. Bot., September 1, 2007; 100(3): 573 - 580. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. O. Richardson and J. D. Palmer Horizontal gene transfer in plants J. Exp. Bot., January 1, 2007; 58(1): 1 - 9. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Bateman, J. Hilton, and P. J. Rudall Morphological and molecular phylogenetic context of the angiosperms: contrasting the 'top-down' and 'bottom-up' approaches used to infer the likely characteristics of the first flowers J. Exp. Bot., October 1, 2006; 57(13): 3471 - 3503. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Inagaki, E. Susko, and A. J. Roger Recombination between elongation factor 1{alpha} genes from distantly related archaeal lineages PNAS, March 21, 2006; 103(12): 4528 - 4533. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kim, P. S. Soltis, K. Wall, and D. E. Soltis Phylogeny and Domain Evolution in the APETALA2-like Gene Family Mol. Biol. Evol., January 1, 2006; 23(1): 107 - 120. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Habetha and T. C. G. Bosch Symbiotic Hydra express a plant-like peroxidase gene during oogenesis J. Exp. Biol., June 1, 2005; 208(11): 2157 - 2165. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-H. Oh and D. Potter Molecular phylogenetic systematics and biogeography of tribe Neillieae (Rosaceae) using DNA sequences of cpDNA, rDNA, and LEAFY Am. J. Botany, January 1, 2005; 92(1): 179 - 192. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Bergthorsson, A. O. Richardson, G. J. Young, L. R. Goertzen, and J. D. Palmer From the Cover: Massive horizontal transfer of mitochondrial genes from diverse land plant donors to the basal angiosperm Amborella PNAS, December 21, 2004; 101(51): 17747 - 17752. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. W. Chase Monocot relationships: an overview Am. J. Botany, October 1, 2004; 91(10): 1645 - 1655. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Davis and K. J. Wurdack Host-to-Parasite Gene Transfer in Flowering Plants: Phylogenetic Evidence from Malpighiales Science, July 30, 2004; 305(5684): 676 - 678. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Barkman, S.-H. Lim, K. M. Salleh, and J. Nais From the Cover: Mitochondrial DNA sequences reveal the photosynthetic relatives of Rafflesia, the world's largest flower PNAS, January 20, 2004; 101(3): 787 - 792. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||