Identification of five new genes on the Y chromosome of Drosophila melanogaster

  1. Antonio Bernardo Carvalho*,,,
  2. Bridget A. Dobo,
  3. Maria D. Vibranovski*, and
  4. Andrew G. Clark
  1. *Departamento de Genética, Universidade Federal do Rio de Janeiro, Caixa Postal 68011 CEP 21944-970, Rio de Janeiro, Brazil; and Institute of Molecular Evolutionary Genetics, Department of Biology, Pennsylvania State University, University Park, PA 16802
  1. Communicated by Dan L. Lindsley, University of California at San Diego, La Jolla, CA (received for review February 20, 2001)

Abstract

The heterochromatic state of the Drosophila Y chromosome has made the cloning and identification of Y-linked genes a challenging process. Here, we report application of a procedure to identify Y-linked gene fragments from the unmapped residue of the whole genome sequencing effort. Previously identified Y-linked genes appear in sequenced scaffolds as individual exons, apparently because many introns have become heterochromatic, growing to enormous size and becoming virtually unclonable. A TBLASTN search using all known proteins as query sequences, tested against a blastable database of the unmapped fragments, produced a number of matches consistent with this scenario. Reverse transcription–PCR and genetic methods were used to confirm those that are expressed, Y-linked genes. The five genes reported here include three protein phosphatases (Pp1-Y1, Pp1-Y2, and PPr-Y), an occludin-related gene (ORY), and a coiled-coils gene (CCY). This brings the total to nine protein-coding genes identified on the Drosophila Y chromosome. ORY and CCY may correspond, respectively, to the fertility factors ks-1 and ks-2, whereas the three protein phosphatases represent novel genes. There remains a strong functional coherence to male function among the genes on the Drosophila Y chromosome.

Footnotes

  • To whom reprint requests should be addressed. E-mail: bernardo{at}biologia.ufrj.br.

  • Data deposition: The sequences reported in this paper have been deposited in the GenBank database [accession nos. AF427493 (Pp1-Y1), AF427494 (Pp1-Y2), AF427495 and AF747998 (PPr-Y), AF427496 (ORY), and AF427497 (CCY)].

  • Abbreviations:
    WGS,
    whole genome shotgun;
    armU,
    unmapped scaffolds of the Drosophila Genome Project;
    RT,
    reverse transcription;
    3′-RACE,
    3′-rapid amplification of cDNA ends;
    EST,
    expressed sequence tag
« Previous | Next Article »Table of Contents