Evolutionary population genetics of promoters: Predicting binding sites and functional phylogenies

  1. Ville Mustonen and
  2. Michael Lässig*
  1. Institut für Theoretische Physik, Universität zu Köln, Zülpicherstrasse 77, 50937 Cologne, Germany
  1. Edited by Tomoko Ohta, National Institute of Genetics, Mishima, Japan (received for review June 30, 2005)

Abstract

We study the evolution of transcription factor-binding sites in prokaryotes, using an empirically grounded model with point mutations and genetic drift. Selection acts on the site sequence via its binding affinity to the corresponding transcription factor. Calibrating the model with populations of functional binding sites, we verify this form of selection and show that typical sites are under substantial selection pressure for functionality: for cAMP response protein sites in Escherichia coli, the product of fitness difference and effective population size takes values 2NΔF of order 10. We apply this model to cross-species comparisons of binding sites in bacteria and obtain a prediction method for binding sites that uses evolutionary information in a quantitative way. At the same time, this method predicts the functional histories of orthologous sites in a phylogeny, evaluating the likelihood for conservation or loss or gain of function during evolution. We have performed, as an example, a cross-species analysis of E. coli, Salmonella typhimurium, and Yersinia pseudotuberculosis. Detailed lists of predicted sites and their functional phylogenies are available.

Footnotes

  • * To whom correspondence should be addressed. E-mail: lassig{at}thp.uni-koeln.de.

  • Author contributions: V.M. and M.L. designed research, performed research, analyzed data, and wrote the paper.

  • This paper was submitted directly (Track II) to the PNAS office.

  • Abbreviation: CRP, cAMP response protein.

« Previous | Next Article »Table of Contents