Previous Article |
Table of Contents
| Next Article
Statistics
Robust singular value decomposition analysis of microarray data



*National Institute of Statistical Sciences, P.O. Box 14006, Research Triangle Park, NC 27709-4006;
School of Statistics, University of Minnesota, 313 Ford Hall, 224 Church Street NE, Minneapolis, MN 55455; and
GlaxoSmithKline, Research Triangle Park, NC 27709
Edited by Peter J. Bickel, University of California, Berkeley, CA, and approved July 2, 2003 (received for review May 22, 2003)
In microarray data there are a number of biological samples, each assessed for the level of gene expression for a typically large number of genes. There is a need to examine these data with statistical techniques to help discern possible patterns in the data. Our technique applies a combination of mathematical and statistical methods to progressively take the data set apart so that different aspects can be examined for both general patterns and very specific effects. Unfortunately, these data tables are often corrupted with extreme values (outliers), missing values, and non-normal distributions that preclude standard analysis. We develop a robust analysis method to address these problems. The benefits of this robust analysis will be both the understanding of large-scale shifts in gene effects and the isolation of particular sample-by-gene effects that might be either unusual interactions or the result of experimental flaws. Our method requires a single pass and does not resort to complex "cleaning" or imputation of the data table before analysis. We illustrate the method with a commercial data set.
Abbreviations: SVD, singular value decomposition; rSVD, robust SVD; ARF, alternating robust fitting.
To whom correspondence should be sent at present address: Aventis Pharmaceuticals, Bridgewater, NJ 08807. E-mail: li.liu{at}aventis.com.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
L. Li, L. Ying, M. Naesens, W. Xiao, T. Sigdel, S. Hsieh, J. Martin, R. Chen, K. Liu, M. Mindrinos, et al. Interference of globin genes with biomarker discovery for allograft rejection in peripheral blood samples Physiol Genomics, January 17, 2008; 32(2): 190 - 197. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Fogel, S. S. Young, D. M. Hawkins, and N. Ledirac Inferential, robust non-negative matrix factorization analysis of microarray data Bioinformatics, January 1, 2007; 23(1): 44 - 49. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. W. Carter, S. Rupp, G. R. Fink, and T. Galitski Disentangling information flow in the Ras-cAMP signaling network Genome Res., April 1, 2006; 16(4): 520 - 526. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Irizarry, Z. Wu, and H. A. Jaffee Comparison of Affymetrix GeneChip expression measures Bioinformatics, April 1, 2006; 22(7): 789 - 794. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Tuikkala, L. Elo, O. S. Nevalainen, and T. Aittokallio Improving missing value estimation in microarray data with gene ontology Bioinformatics, March 1, 2006; 22(5): 566 - 572. [Abstract] [Full Text] [PDF] |
||||