SIRE-1, a copia/Ty1-like retroelement from soybean, encodes a retroviral envelope-like protein

  1. Howard M. Laten*,
  2. Arpita Majumdar, and
  3. Eric A. Gaucher
  1. Biology Department, Loyola University Chicago, 6525 North Sheridan Road, Chicago, IL 60626

Abstract

The soybean genome hosts a family of several hundred, relatively homogeneous copies of a large, copia/Ty1-like retroelement designated SIRE-1. A copy of this element has been recovered from a Glycine max genomic library. DNA sequence analysis of two SIRE-1 subclones revealed that SIRE-1 contains a long, uninterrupted, ORF between the 3′ end of the pol ORF and the 3′ long terminal repeat (LTR), a region that harbors the env gene in retroviral genomes. Conceptual translation of this second ORF produces a 70-kDa protein. Computer analyses of the amino acid sequence predicted patterns of transmembrane domains, α-helices, and coiled coils strikingly similar to those found in mammalian retroviral envelope proteins. In addition, a 65-residue, proline-rich domain is characterized by a strong amino acid compositional bias virtually identical to that of the 60-amino acid, proline-rich neutralization domain of the feline leukemia virus surface protein. The assignment of SIRE-1 to the copia/Ty1 family was confirmed by comparison of the conceptual translation of its reverse transcriptase-like domain with those of other retroelements. This finding suggests the presence of a proretrovirus in a plant genome and is the strongest evidence to date for the existence of a retrovirus-like genome closely related to copia/Ty1 retrotransposons.

Footnotes

  • * To whom reprint requests should be addressed. e-mail: hlaten{at}orion.it.luc.edu.

  • Edted by Margaret G. Kidwell, University of Arizona, Tucson, AZ, and approved March 23, 1998

  • This paper was submitted directly (Track II) to the Proceedings Office.

  • Abbreviations: LTR, long terminal repeat; rt, reverse transcriptase; int, integrase; rh, ribonuclease H; env, envelope; SU, surface protein; TM, transmembrane protein.

  • Data deposition: The sequences reported in this paper have been deposited in the GenBank database (accession nos. U96295 and AF053008).

« Previous | Next Article »Table of Contents