PNAS Peer Review  Sign up for PNAS Online eTocs
Link: Info for AuthorsLink: Editorial BoardLink: AboutLink: SubscribeLink: AdvertiseLink: ContactLink: Sitemap Link: PNAS Home
Proceedings of the National Academy of Sciences
Link: Current Issue "" Link: Archives "" Link: Online Submission ""  Link: Advanced Search

Published online on October 27, 2003, 10.1073/pnas.1733249100
PNAS | November 11, 2003 | vol. 100 | no. 23 | 13167-13172


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supporting Information
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a colleague
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My File Cabinet
Right arrow Download to citation manager
Right arrow Request Copyright Permission
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via ISI Web of Science (20)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Liu, L.
Right arrow Articles by Young, S. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Liu, L.
Right arrow Articles by Young, S. S.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg  
What's this?

 Previous Article  | Table of Contents |  Next Article 

Statistics
Robust singular value decomposition analysis of microarray data

Li Liu * {dagger}, Douglas M. Hawkins {ddagger}, Sujoy Ghosh §, and S. Stanley Young *

*National Institute of Statistical Sciences, P.O. Box 14006, Research Triangle Park, NC 27709-4006; {ddagger}School of Statistics, University of Minnesota, 313 Ford Hall, 224 Church Street NE, Minneapolis, MN 55455; and §GlaxoSmithKline, Research Triangle Park, NC 27709

Edited by Peter J. Bickel, University of California, Berkeley, CA, and approved July 2, 2003 (received for review May 22, 2003)

In microarray data there are a number of biological samples, each assessed for the level of gene expression for a typically large number of genes. There is a need to examine these data with statistical techniques to help discern possible patterns in the data. Our technique applies a combination of mathematical and statistical methods to progressively take the data set apart so that different aspects can be examined for both general patterns and very specific effects. Unfortunately, these data tables are often corrupted with extreme values (outliers), missing values, and non-normal distributions that preclude standard analysis. We develop a robust analysis method to address these problems. The benefits of this robust analysis will be both the understanding of large-scale shifts in gene effects and the isolation of particular sample-by-gene effects that might be either unusual interactions or the result of experimental flaws. Our method requires a single pass and does not resort to complex "cleaning" or imputation of the data table before analysis. We illustrate the method with a commercial data set.


This paper was submitted directly (Track II) to the PNAS office.

Abbreviations: SVD, singular value decomposition; rSVD, robust SVD; ARF, alternating robust fitting.

{dagger} To whom correspondence should be sent at present address: Aventis Pharmaceuticals, Bridgewater, NJ 08807. E-mail: li.liu{at}aventis.com.


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg    What's this?


This article has been cited by other articles in HighWire Press-hosted journals:


Home page
Physiol. GenomicsHome page
L. Li, L. Ying, M. Naesens, W. Xiao, T. Sigdel, S. Hsieh, J. Martin, R. Chen, K. Liu, M. Mindrinos, et al.
Interference of globin genes with biomarker discovery for allograft rejection in peripheral blood samples
Physiol Genomics, January 17, 2008; 32(2): 190 - 197.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
P. Fogel, S. S. Young, D. M. Hawkins, and N. Ledirac
Inferential, robust non-negative matrix factorization analysis of microarray data
Bioinformatics, January 1, 2007; 23(1): 44 - 49.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. W. Carter, S. Rupp, G. R. Fink, and T. Galitski
Disentangling information flow in the Ras-cAMP signaling network
Genome Res., April 1, 2006; 16(4): 520 - 526.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. A. Irizarry, Z. Wu, and H. A. Jaffee
Comparison of Affymetrix GeneChip expression measures
Bioinformatics, April 1, 2006; 22(7): 789 - 794.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Tuikkala, L. Elo, O. S. Nevalainen, and T. Aittokallio
Improving missing value estimation in microarray data with gene ontology
Bioinformatics, March 1, 2006; 22(5): 566 - 572.
[Abstract] [Full Text] [PDF]