Skip to main content
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Accessibility Statement
    • Rights and Permissions
    • Site Map
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian
  • Log in
  • My Cart

Main menu

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Accessibility Statement
    • Rights and Permissions
    • Site Map
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Home
Home

Advanced Search

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses

New Research In

Physical Sciences

Featured Portals

  • Physics
  • Chemistry
  • Sustainability Science

Articles by Topic

  • Applied Mathematics
  • Applied Physical Sciences
  • Astronomy
  • Computer Sciences
  • Earth, Atmospheric, and Planetary Sciences
  • Engineering
  • Environmental Sciences
  • Mathematics
  • Statistics

Social Sciences

Featured Portals

  • Anthropology
  • Sustainability Science

Articles by Topic

  • Economic Sciences
  • Environmental Sciences
  • Political Sciences
  • Psychological and Cognitive Sciences
  • Social Sciences

Biological Sciences

Featured Portals

  • Sustainability Science

Articles by Topic

  • Agricultural Sciences
  • Anthropology
  • Applied Biological Sciences
  • Biochemistry
  • Biophysics and Computational Biology
  • Cell Biology
  • Developmental Biology
  • Ecology
  • Environmental Sciences
  • Evolution
  • Genetics
  • Immunology and Inflammation
  • Medical Sciences
  • Microbiology
  • Neuroscience
  • Pharmacology
  • Physiology
  • Plant Biology
  • Population Biology
  • Psychological and Cognitive Sciences
  • Sustainability Science
  • Systems Biology
Commentary

Lasting impressions: Motifs in protein–protein maps may provide footprints of evolutionary events

J. Jeremy Rice, Aaron Kershenbaum, and Gustavo Stolovitzky
PNAS March 1, 2005 102 (9) 3173-3174; https://doi.org/10.1073/pnas.0500130102
J. Jeremy Rice
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aaron Kershenbaum
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gustavo Stolovitzky
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Related Article

  • Inferring network mechanisms: The Drosophila melanogaster protein interaction network
    - Feb 22, 2005
  • Article
  • Figures & SI
  • Info & Metrics
  • PDF
Loading

Imagine a paleontologist confronted with of a fossil of a single footprint. She could probably risk some conjectures about the imprinting creature, such as whether it possessed paws, claws, or feet, and estimate its weight, height, and other attributes. But the information contained in a single static print remains limited. The situation improves if a multitude of footprints from a population of creatures becomes available. Now, stride lengths can be calculated, variability between individuals can be assessed, and speeds can be estimated. Furthermore, correlating footprints from different geographies could lead to clues about the population's migration patterns. With appropriate assumptions, our paleontologist could begin to map how the population evolved over time.

As this example illustrates, even though a footprint is a static entity, the sum of many footprints could in principle provide clues about the dynamics of a population. In this vein, the work of Middendorf, Ziv, and Wiggins in this issue of PNAS (1) seeks to understand the broad strokes of the evolutionary dynamics that shaped a species from a static network of protein–protein interactions. By way of “footprints,” Middendorf et al. (1) use substructures in the protein–protein network called network motifs whose count provides a window into the dynamics that left the record. These counts are the markers that the authors use to “learn” which motifs differentiate between competing hypothetical evolutionary schemes. The outcome of the study suggests a dominant evolutionary mechanism that shaped Drosophila melanogaster.

Evolution Revealed in Manmade Networks

Networks of all sorts evolve, and it is often possible to form hypotheses about such evolution by direct examination of the structure of the network (2–4). Consider the U.S. telephone network, which evolved over the past 100 years in response to changes in traffic load and advances in technology. The telephone network was originally engineered to carry voice traffic. The addition of data traffic, which eventually dominated voice, was an evolutionary pressure to this network that forced radical changes in its topology. In the 1960s, as the volume of data traffic carried on the network occupied a significant amount of bandwidth, the facilities of many large cities became saturated, and it was necessary to relieve the congestion by rerouting interstate traffic. These changes led to a network topology that resembled that of the interstate highway network, where “beltways” surround the major cities to keep interstate congestion off local roads. These beltways can be observed directly as motifs in today's telephone network topology. Today's topology is the result of evolutionary forces that could be deduced by modeling changes in topology necessary to overcome the rise in data traffic. On a larger scale, there was a shift in topology that resulted from the need to add major capacity in the southern half of the U.S. accompanying the population shift to warmer climates. Notice that the topology changes necessary to respond to local and global pressures are different.

Characterizing Biological Networks

In contrast with the telephone network example, where design and control are explicitly engineered, our understanding of biological network design principles and of mechanisms that control the traffic of biological information is very poor. Part of today's challenge, therefore, is to elucidate the principles on which biological networks evolve.

In recent years, biological network architectures have been characterized by properties such as sparseness (5), small-world (6, 7), and scale-free (8). These network characterizations are global in that a single number, such as the average connectivity or the radius (average number of hops between any two nodes), describes a property of the whole network. An alternative approach to characterize a network is via topological motifs. If the number of occurrences of a motif is large compared with what is expected by chance, then a case can be made that such a motif represents a reusable functional module or is the consequence of evolutionary mechanisms. Thus, motifs such as those shown in Fig. 1 have been discovered from a number of complex networks (9, 10). The feed-forward triangle and the bi-fan square (see Fig. 1), for example, occurred with a Z score of >10, both in the Escherichia coli and Saccharomyces cerevisiae gene regulatory networks (10).

Fig. 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 1.

Some of the most common motifs found in biological networks. Nodes indicate cellular components, such as genes or proteins. Edges represent associations between nodes, such as binding (undirected edges) or influence (directed edges).

A high Z score does not necessarily imply biological relevance for a motif. Efforts are underway to find biological interpretations to what statistical significance can only suggest. Alon and coworkers (11) have associated the feed-forward motif with a rapid-response filter of noisy inputs. Cycles, both positive and negative, can be associated with feedback loops leading to responses over an extended time frame. The relatively large number of squares and triangles observed in the protein interaction network of Drosophila (1) directly points to possible mechanisms at work in its evolutionary history (see Fig. 2).

Fig. 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 2.

Two examples of network evolution through the DMC mechanism. Before the iteration, a node is chosen at random (blue node) and replicated (orange node) along with its connections (orange edges). The edges linking the original or replicated node and its neighbors can be mutated and rendered unfunctional (shown with an X in scenarios 1 and 2). The original and duplicated nodes can be conjoined (only scenario 2) with some finite probability. Scenario 1 leads to the creation of square subgraphs, whereas scenario 2 leads to the creation of square and triangle subgraphs.

Protein–Protein Interaction Networks

The two-hybrid studies that produced the protein interaction map for D. melanogaster (12) provide a valuable genome-wide view of protein interactions but have a number of shortcomings (13). Even if the protein–protein interactions were determined with high accuracy, the resulting network would still require careful interpretation to extract its underlying biological meaning. Specifically, the map is a representation of all possible interactions, but one would only expect some fraction to be operating at any given time. Hence, the map is a static imprint of all possible interactions that clearly lacks dynamic information. Combine this feature with the considerable sources of noise and artifacts known to protein–protein networks and one can see the difficulties in using interaction data. However, as in our footprint analogy, interpreting a multitude of static records may give clues to dynamic interactions, even if some records are faulty (i.e., not every footprint needs to be intact).

Machine-Learning Approaches

Rather than discovering all motifs in a network (a task that would result in a combinatorial explosion of subgraphs), previous studies (10) have used statistical criteria to select significant subgraphs from a family of prescribed subnetworks, such as all n-node subgraphs (n = 3 and 4). Similar statistical criteria were used to cluster disparate networks into superfamilies that share the same (normalized) vector of Z scores over all three- or four-node subgraphs (14). However, problems may arise with these approaches if the underlying null hypothesis is not posed carefully (15). Middendorf et al. (1) avoided the use of statistical considerations in the selection or normalization of subgraph counts. They explored all subgraphs that verified two criteria: (i) subgraphs produced by walks of up to eight hops and (ii) subgraphs with up to seven edges. Although such criteria keep the search computationally tractable, other important motifs could be missed. The raw counts of these motifs in the Drosophila network were used as inputs into a learning machine previously trained to recognize networks grown by using seven different putative evolutionary trends: duplication–mutation–complementation (DMC) (Fig. 2), duplication–mutation followed by random attachment, linear preferential attachment, small-world networks, random static networks, random growing networks, and aging vertex networks. As a machine-learning algorithm, the authors chose the alternating decision tree (ADT) (16), which allows for a reasonably straightforward identification of the main subgraphs used in the decisions. With the Drosophila network, the ADT algorithm chose the DMC mechanism as the best choice to explain the subgraph counts in the input network. Indeed, two of the most discriminative subgraphs chosen by the ADT to make its decision were triangles and squares (table 6 in ref. 1), which can be easily interpreted in the context of the DMC mechanism (Fig. 2).

Future Directions

The study by Middendorf et al. (1) is a laudable step toward extracting evolutionary information from static biological networks, but certainly much remains unanswered. Most likely, the evolution of protein–protein maps involves complexities that we are yet to fully grasp. For example, whereas the small-scale features contained in the network motifs are well captured by the DMC mechanism, some large-scale features of the Drosophila interaction map, such as the size of the giant component (table 1, SM, in ref. 1), do not seem to be recapitulated by DMC. This inconsistency should not be too surprising because different aspects of the interaction map may respond to different evolutionary pressures. As with the U.S. phone system, different pressures shaped the network at the city and country levels. Such a notion is compatible with theoretical work that characterized small-world graphs as a superposition of multiple graphs rather than a homogenous entity (3). One might expect analogous superpositions in biological networks. For example, a hierarchical arrangement of connected clusters is described for E. coli metabolic networks (17). Refinements in techniques as well as larger data sets from more species will help us to better trace the evolutionary past and the logical relations that make the networks functional in the present (18).

Footnotes

  • ↵ * To whom correspondence should be addressed at: The T. J. Watson Research Center, IBM, P.O. Box 218, Yorktown Heights, NY 10598. E-mail: gustavo{at}us.ibm.com.

  • See companion article on page 3192.

  • Copyright © 2005, The National Academy of Sciences

References

  1. ↵
    Middendorf, M., Ziv, E. & Wiggins, C. H. (2005) Proc. Natl. Acad. Sci. USA 102 , 3192-3197. pmid:15728374
    OpenUrlAbstract/FREE Full Text
  2. ↵
    Dorogovtsev, S. N. & Mendes, J. F. F. (2003) Evolution of Networks: From Biological Nets to the Internet and WWW (Oxford Univ. Press, Oxford).
  3. ↵
    Watts, D. J. & Strogatz, S. H. (1998) Nature 393 , 440-442. pmid:9623998
    OpenUrlCrossRefPubMed
  4. ↵
    Albert, R. & Barabasi, A. L. (2000) Phys. Rev. Lett. 85 , 5234-5237. pmid:11102229
    OpenUrlCrossRefPubMed
  5. ↵
    Wagner, A. (2002) Genome Res. 12 , 309-315. pmid:11827950
    OpenUrlAbstract/FREE Full Text
  6. ↵
    Wagner, A. & Fell, D. A. (2001) Proc. R. Soc. London B 268 , 1803-1810.
    OpenUrlPubMed
  7. ↵
    Arita, M. (2004) Proc. Natl. Acad. Sci. USA 101 , 1543-1547. pmid:14757824
    OpenUrlAbstract/FREE Full Text
  8. ↵
    Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N. & Barabasi, A. L. (2000) Nature 407 , 651-654. pmid:11034217
    OpenUrlCrossRefPubMed
  9. ↵
    Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. (2002) Nat. Genet. 31 , 64-68. pmid:11967538
    OpenUrlCrossRefPubMed
  10. ↵
    Milo, R., Shen-Orr, S., Itzkovitz, S., Kashtan, N., Chklovskii, D. & Alon, U. (2002) Science 298 , 824-827. pmid:12399590
    OpenUrlAbstract/FREE Full Text
  11. ↵
    Mangan, S., Zaslaver, A. & Alon, U. (2003) J. Mol. Biol. 334 , 197-204. pmid:14607112
    OpenUrlCrossRefPubMed
  12. ↵
    Giot, L., Bader, J. S., Brouwer, C., Chaudhuri, A., Kuang, B., Li, Y., Hao, Y. L., Ooi, C. E., Godwin, B., Vitols, E., et al. (2003) Science 302 , 1727-1736. pmid:14605208
    OpenUrlAbstract/FREE Full Text
  13. ↵
    Jansen, R. & Gerstein, M. (2004) Curr. Opin. Microbiol. 7 , 535-545. pmid:15451510
    OpenUrlCrossRefPubMed
  14. ↵
    Milo, R., Itzkovitz, S., Kashtan, N., Levitt, R., Shen-Orr, S., Ayzenshtat, I., Sheffer, M. & Alon, U. (2004) Science 303 , 1538-1542. pmid:15001784
    OpenUrlAbstract/FREE Full Text
  15. ↵
    Artzy-Randrup, Y., Fleishman, S. J., Ben-Tal, N. & Stone, L. (2004) Science 305 , 1107.
    OpenUrlFREE Full Text
  16. ↵
    Middendorf, M., Kundaje, A., Wiggins, C., Freund, Y. & Leslie, C. (2004) Bioinformatics 20 , Suppl. 1, I232-I240. pmid:15262804
  17. ↵
    Ravasz, E., Somera, A. L., Mongru, D. A., Oltvai, Z. N. & Barabasi, A. L. (2002) Science 297 , 1551-1555. pmid:12202830
    OpenUrlAbstract/FREE Full Text
  18. ↵
    Bowers, P. M., Cokus, S. J., Eisenberg, D. & Yeates, T. O. (2004) Science 306 , 2246-2249. pmid:15618515
    OpenUrlAbstract/FREE Full Text
PreviousNext
Back to top
Article Alerts
Email Article

Thank you for your interest in spreading the word on PNAS.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Lasting impressions: Motifs in protein–protein maps may provide footprints of evolutionary events
(Your Name) has sent you a message from PNAS
(Your Name) thought you would like to see the PNAS web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
Lasting impressions: Motifs in protein–protein maps may provide footprints of evolutionary events
J. Jeremy Rice, Aaron Kershenbaum, Gustavo Stolovitzky
Proceedings of the National Academy of Sciences Mar 2005, 102 (9) 3173-3174; DOI: 10.1073/pnas.0500130102

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
Lasting impressions: Motifs in protein–protein maps may provide footprints of evolutionary events
J. Jeremy Rice, Aaron Kershenbaum, Gustavo Stolovitzky
Proceedings of the National Academy of Sciences Mar 2005, 102 (9) 3173-3174; DOI: 10.1073/pnas.0500130102
Digg logo Reddit logo Twitter logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Mendeley logo Mendeley
Proceedings of the National Academy of Sciences of the United States of America: 102 (9)
Table of Contents

Submit

Sign up for Article Alerts

Jump to section

  • Article
    • Evolution Revealed in Manmade Networks
    • Characterizing Biological Networks
    • Protein–Protein Interaction Networks
    • Machine-Learning Approaches
    • Future Directions
    • Footnotes
    • References
  • Figures & SI
  • Info & Metrics
  • PDF

You May Also be Interested in

Surgeons hands during surgery
Inner Workings: Advances in infectious disease treatment promise to expand the pool of donor organs
Despite myriad challenges, clinicians see room for progress.
Image credit: Shutterstock/David Tadevosian.
Setting sun over a sun-baked dirt landscape
Core Concept: Popular integrated assessment climate policy models have key caveats
Better explicating the strengths and shortcomings of these models will help refine projections and improve transparency in the years ahead.
Image credit: Witsawat.S.
Double helix
Journal Club: Noncoding DNA shown to underlie function, cause limb malformations
Using CRISPR, researchers showed that a region some used to label “junk DNA” has a major role in a rare genetic disorder.
Image credit: Nathan Devery.
Steamboat Geyser eruption.
Eruption of Steamboat Geyser
Mara Reed and Michael Manga explore why Yellowstone's Steamboat Geyser resumed erupting in 2018.
Listen
Past PodcastsSubscribe
Multi-color molecular model
Enzymatic breakdown of PET plastic
A study demonstrates how two enzymes—MHETase and PETase—work synergistically to depolymerize the plastic pollutant PET.
Image credit: Aaron McGeehan (artist).

Similar Articles

Site Logo
Powered by HighWire
  • Submit Manuscript
  • Twitter
  • Facebook
  • RSS Feeds
  • Email Alerts

Articles

  • Current Issue
  • Special Feature Articles – Most Recent
  • List of Issues

PNAS Portals

  • Anthropology
  • Chemistry
  • Classics
  • Front Matter
  • Physics
  • Sustainability Science
  • Teaching Resources

Information

  • Authors
  • Editorial Board
  • Reviewers
  • Librarians
  • Press
  • Site Map
  • PNAS Updates

Feedback    Privacy/Legal

Copyright © 2021 National Academy of Sciences. Online ISSN 1091-6490