A general framework for multiple testing dependence
- aDepartment of Oncology, Johns Hopkins University School of Medicine, Baltimore, MD 21287; and
- bLewis-Sigler Institute and Department of Molecular Biology, Princeton University, Princeton, NJ 08544
-
Communicated by Burton H. Singer, Princeton University, Princeton, NJ, September 4, 2008 (received for review May 8, 2008)
Abstract
We develop a general framework for performing large-scale significance testing in the presence of arbitrarily strong dependence. We derive a low-dimensional set of random vectors, called a dependence kernel, that fully captures the dependence structure in an observed high-dimensional dataset. This result shows a surprising reversal of the “curse of dimensionality” in the high-dimensional hypothesis testing setting. We show theoretically that conditioning on a dependence kernel is sufficient to render statistical tests independent regardless of the level of dependence in the observed data. This framework for multiple testing dependence has implications in a variety of common multiple testing problems, such as in gene expression studies, brain imaging, and spatial epidemiology.
Footnotes
- 1To whom correspondence should be addressed. E-mail: jstorey{at}princeton.edu
-
Author contributions: J.T.L. and J.D.S. designed research, performed research, contributed new reagents/analytic tools, analyzed data, and wrote the paper.
-
The authors declare no conflict of interest.
-
This article contains supporting information online at www.pnas.org/cgi/content/full/0808709105/DCSupplemental.
-
Freely available online through the PNAS open access option.
- © 2008 by the National Academy of Sciences of the USA










