Simple and accurate estimation of ancestral protein sequences

  1. Barry G. Hall*
  1. Bellingham Research Institute, 218 Chuckanut Point Road, Bellingham, WA 98229
  1. Edited by W. Ford Doolittle, Dalhousie University, Halifax, NS, Canada, and approved February 14, 2006 (received for review October 14, 2005)

Abstract

There are a variety of reasons to reconstruct the sequences of ancient proteins, but whatever the reason, the value of the reconstructed protein depends on the accuracy with which the ancient sequence is inferred. This study uses sequences simulated by a sequence-evolution simulation program that compares parsimony, maximum likelihood, and the Bayesian methods of inferring ancestral sequences and concludes that the Bayesian method, as implemented by mrbayes 3.11, is preferred. Estimated ancestral sequences are of necessity the same length as the alignment on which the underlying phylogeny is based. A highly accurate method for correcting the estimated sequences is introduced, and it is shown that the correction permits inferring the sequences of ancient protein sequences with a very high degree of accuracy.

Footnotes

  • *E-mail: drbh{at}mail.rochester.edu
  • Author contributions: B.G.H. designed research, performed research, analyzed data, and wrote the paper.

  • Conflict of interest statement: No conflicts declared.

  • This paper was submitted directly (Track II) to the PNAS office.

« Previous | Next Article »Table of Contents