Previous Article |
Table of Contents
| Next Article
COLLOQUIUM PAPERS
Evolution of document networks

School of Informatics, Indiana University, Bloomington, IN 47408
How does a network of documents grow without centralized control? This question is becoming crucial as we try to explain the emergent scale-free topology of the World Wide Web and use link analysis to identify important information resources. Existing models of growing information networks have focused on the structure of links but neglected the content of nodes. Here I show that the current models fail to reproduce a critical characteristic of information networks, namely the distribution of textual similarity among linked documents. I propose a more realistic model that generates links by using both popularity and content. This model yields remarkably accurate predictions of both degree and similarity distributions in networks of web pages and scientific literature.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
S. Fortunato, A. Flammini, F. Menczer, and A. Vespignani Topical interests and the mitigation of search engine bias PNAS, August 22, 2006; 103(34): 12684 - 12689. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Borner, J. T. Maru, and R. L. Goldstone The simultaneous evolution of author and paper networks PNAS, April 6, 2004; 101(suppl_1): 5266 - 5273. [Abstract] [Full Text] [PDF] |
||||