A high-resolution map of transcription in the yeast genome

  1. Lior David*,,
  2. Wolfgang Huber,,
  3. Marina Granovskaia§,
  4. Joern Toedling,
  5. Curtis J. Palm*,
  6. Lee Bofkin,
  7. Ted Jones*,
  8. Ronald W. Davis*,, and
  9. Lars M. Steinmetz*,§,
  1. *Stanford Genome Technology Center and Department of Biochemistry, Stanford University, Palo Alto, CA 94304;
  2. European Bioinformatics Institute, European Molecular Biology Laboratory, Cambridge CB10 1SD, United Kingdom; and
  3. §European Molecular Biology Laboratory, 69117 Heidelberg, Germany
  1. Contributed by Ronald W. Davis, February 10, 2006

  2. L.D. and W.H. contributed equally to this work.

  1. Fig. 1.

    Visualization of yeast tiling array intensities along 100 kb of chromosome 1, corresponding to ≈1% of the genome. The plot shows the normalized hybridization intensities (y axis) along genomic coordinates (x axis in bp). Each dot corresponds to a probe, Watson strand in green and Crick strand in blue. Probes with more than one perfect match in the genome are colored gray. Annotated ORFs are shown as blue boxes, dubious ORFs are shown as light blue boxes, and transcription factor binding sites are shown as gray bars. Vertical lines are segment boundaries. The background threshold (y = 0) is shown as a horizontal line.


  2. Fig. 2.

    Examples of transcriptional architecture. (a) Detection of spliced transcripts. (b) Long 5′ UTR of GCN4 including its cotranscribed upstream ORFs. (c) Complex transcript architecture of MET7. (d) Overlapping transcripts of two ORFs. (e) Adjacent transcripts of SER3 and the noncoding SRG1. (f) Nonannotated isolated transcript. (g) Transcript antisense to SPO22. CDS refers to coding sequence; uORF, upstream ORF; ncRNA, noncoding RNA; TF, transcription factor. Plot layout as in Fig. 1.


  3. Fig. 3.

    Length of UTRs and functional categories with exceptional UTR length. Analyses were based on 2,044 genes from poly(A) samples. (a) Scatterplot and histogram of 3′ vs. 5′ UTR lengths. (b) Association between UTR length, cellular localization, and biological process. Length distributions between genes inside and outside of GO categories were compared, and selected significant categories are shown (orange, cellular component; green, biological process; blue, molecular function). For each category, a horizontal line shows the 5′ and 3′ median UTR lengths measured in nucleotides (x axis). The median over all genes is shown by a vertical dashed line. Significant medians are indicated by asterisks, red longer, blue shorter (two-sided Wilcoxon test, P ≤ 0.002).


  4. Fig. 4.

    Categories of expressed segments, their length, and their expression levels. (a) Number and percentage of the expressed segments detected from the poly(A) RNA and total RNA hybridizations. Categories “>= 50%” and “<50%” consist of segments that overlap more, or less, than half of an annotated feature, respectively. The “nonannotated isolated” category consists of segments that have no overlap with annotated features on either strand, whereas the “nonannotated antisense” category consists of those that overlap with features on the opposite strand. The “filtered” categories consist of the high confidence segments that passed our filter, and the “unassigned” categories consist of the remaining segments. Length (b) and transcript level (c) distributions for segments from the above categories are given.


Footnotes

  • To whom correspondence may be addressed. E-mail: dbowe{at}stanford.edu or larsms{at}embl.de
« Previous | Next Article »Table of Contents
OPEN ACCESS ARTICLE