On the sequencing of the human genome

  1. Robert H. Waterston*,,
  2. Eric S. Lander, and
  3. John E. Sulston§
  1. *Genome Sequencing Center, Washington University, Saint Louis, MO 63108; Whitehead Institute/Massachusetts Institute of Technology Center for Genome Research, Cambridge, MA 02142; and §Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
  1. Communicated by Aaron Klug, Medical Research Council, Cambridge, United Kingdom (received for review November 7, 2001)

Abstract

Two recent papers using different approaches reported draft sequences of the human genome. The international Human Genome Project (HGP) used the hierarchical shotgun approach, whereas Celera Genomics adopted the whole-genome shotgun (WGS) approach. Here, we analyze whether the latter paper provides a meaningful test of the WGS approach on a mammalian genome. In the Celera paper, the authors did not analyze their own WGS data. Instead, they decomposed the HGP's assembled sequence into a “perfect tiling path”, combined it with their WGS data, and assembled the merged data set. To study the implications of this approach, we perform computational analysis and find that a perfect tiling path with 2-fold coverage is sufficient to recover virtually the entirety of a genome assembly. We also examine the manner in which the assembly was anchored to the human genome and conclude that the process primarily depended on the HGP's sequence-tagged site maps, BAC maps, and clone-based sequences. Our analysis indicates that the Celera paper provides neither a meaningful test of the WGS approach nor an independent sequence of the human genome. Our analysis does not imply that a WGS approach could not be successfully applied to assemble a draft sequence of a large mammalian genome, but merely that the Celera paper does not provide such evidence.

Footnotes

  • To whom reprint requests should be addressed at: Genome Sequencing Center, Box 8501, 4444 Forest Park Boulevard, Saint Louis, MO 63108. E-mail: bwaterst{at}watson.wustl.edu.

  • See ref. 2. The nature of “shredding” is mentioned in passing on p. 1309, column 3, paragraph 4; the implications for assembly are not discussed in the paper.

  • See ref. 2, p. 1313, column 2, paragraph 3 and p. 1314, column 2, paragraph 2.

  • Abbreviations:
    HGP,
    Human Genome Project;
    HS,
    hierarchical shotgun;
    BACs,
    bacterial artificial chromosomes;
    WGS,
    whole-genome shotgun;
    STS,
    sequence-tagged sites;
    CSA,
    compartmentalized sequence assembly
« Previous | Next Article »Table of Contents