Previous Article |
Table of Contents
| Next Article
PHYSICAL SCIENCES / BIOLOGICAL SCIENCES / APPLIED MATHEMATICS / EVOLUTION
Defining functional distance using manifold embeddings of gene ontology annotations

,
*Department of Mathematics, University of Minnesota, Minneapolis, MN 55455; and
Program in Bioinformatics, Boston University, Boston, MA 02215
Communicated by Ronald R. Coifman, Yale University, New Haven, CT, April 9, 2007 (received for review June 7, 2006)
Although rigorous measures of similarity for sequence and structure are now well established, the problem of defining functional relationships has been particularly daunting. Here, we present several manifold embedding techniques to compute distances between Gene Ontology (GO) functional annotations and consequently estimate functional distances between protein domains. To evaluate accuracy, we correlate the functional distance to the well established measures of sequence, structural, and phylogenetic similarities. Finally, we show that manual classification of structures into folds and superfamilies is mirrored by proximity in the newly defined function space. We show how functional distances place structure–function relationships in biological context resulting in insight into divergent and convergent evolution. The methods and results in this paper can be readily generalized and applied to a wide array of biologically relevant investigations, such as accuracy of annotation transference, the relationship between sequence, structure, and function, or coherence of expression modules.
kernel methods | diffusion geometry | domain evolution | functional annotation | homology modeling
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/cgi/content/full/0702965104/DC1.
To whom correspondence may be addressed. E-mail: lerman{at}umn.edu or borya{at}bu.edu
© 2007 by The National Academy of Sciences of the USA
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?