Phylogeny, in situ hybridization service  Sign up for PNAS Online eTocs
Link: Info for AuthorsLink: Editorial BoardLink: AboutLink: SubscribeLink: AdvertiseLink: ContactLink: Sitemap Link: PNAS Home
Proceedings of the National Academy of Sciences
Link: Current Issue "" Link: Archives "" Link: Online Submission ""  Link: Advanced Search

Published online on March 12, 2004, 10.1073/pnas.0307760101
PNAS | April 6, 2004 | vol. 101 | Suppl. 1 | 5220-5227


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supporting Table
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a colleague
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My File Cabinet
Right arrow Download to citation manager
Right arrow Request Copyright Permission
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via ISI Web of Science (5)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Erosheva, E.
Right arrow Articles by Lafferty, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Erosheva, E.
Right arrow Articles by Lafferty, J.
Related Content
Right arrow Related Web Pages
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg  
What's this?

 Previous Article  | Table of Contents |  Next Article 

COLLOQUIUM PAPERS
Mixed-membership models of scientific publications

Elena Erosheva * {dagger}, Stephen Fienberg {ddagger} §, and John Lafferty § ¶

*Department of Statistics, School of Social Work, and Center for Statistics and the Social Sciences, University of Washington, Seattle, WA 98195; and {ddagger}Department of Statistics, Computer Science Department, and §Center for Automated Learning and Discovery, Carnegie Mellon University, Pittsburgh, PA 15213

PNAS is one of world's most cited multidisciplinary scientific journals. The PNAS official classification structure of subjects is reflected in topic labels submitted by the authors of articles, largely related to traditionally established disciplines. These include broad field classifications into physical sciences, biological sciences, social sciences, and further subtopic classifications within the fields. Focusing on biological sciences, we explore an internal soft-classification structure of articles based only on semantic decompositions of abstracts and bibliographies and compare it with the formal discipline classifications. Our model assumes that there is a fixed number of internal categories, each characterized by multinomial distributions over words (in abstracts) and references (in bibliographies). Soft classification for each article is based on proportions of the article's content coming from each category. We discuss the appropriateness of the model for the PNAS database as well as other features of the data relevant to soft classification.


This paper results from the Arthur M. Sackler Colloquium of the National Academy of Sciences, "Mapping Knowledge Domains," held May 9-11, 2003, at the Arnold and Mabel Beckman Center of the National Academies of Sciences and Engineering in Irvine, CA.

{dagger} To whom correspondence should be addressed. E-mail: elena{at}stat.washington.edu.


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg    What's this?


Related Web Pages:

NAS Sackler Colloquium on Mapping Knowledge Domains

This article has been cited by other articles in HighWire Press-hosted journals:


Home page
BioinformaticsHome page
K.-A. Sohn and E. P. Xing
Spectrum: joint bayesian inference of population structure and recombination events
Bioinformatics, July 1, 2007; 23(13): i479 - i489.
[Abstract] [Full Text] [PDF]


Home page
J Aging HealthHome page
K. G. Manton, V. L. Lamb, and XiLiang Gu
Medicare Cost Effects of Recent U.S. Disability Trends in the Elderly: Future Implications
J Aging Health, June 1, 2007; 19(3): 359 - 381.
[Abstract] [PDF]


Home page
GeneticsHome page
N. A. Rosenberg and M. Nordborg
A General Population-Genetic Model for the Production by Population Structure of Spurious Genotype-Phenotype Associations in Discrete, Admixed or Spatially Distributed Populations
Genetics, July 1, 2006; 173(3): 1665 - 1678.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. W. Boyack
Mapping knowledge domains: Characterizing PNAS
PNAS, April 6, 2004; 101(suppl_1): 5192 - 5199.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. K. Landauer, D. Laham, and M. Derr
From paragraph to graph: Latent semantic analysis for information visualization
PNAS, April 6, 2004; 101(suppl_1): 5214 - 5219.
[Abstract] [Full Text] [PDF]