A general framework for multiple testing dependence

  1. Jeffrey T. Leeka and
  2. John D. Storeyb,1
  1. aDepartment of Oncology, Johns Hopkins University School of Medicine, Baltimore, MD 21287; and
  2. bLewis-Sigler Institute and Department of Molecular Biology, Princeton University, Princeton, NJ 08544
  1. Communicated by Burton H. Singer, Princeton University, Princeton, NJ, September 4, 2008 (received for review May 8, 2008)

Abstract

We develop a general framework for performing large-scale significance testing in the presence of arbitrarily strong dependence. We derive a low-dimensional set of random vectors, called a dependence kernel, that fully captures the dependence structure in an observed high-dimensional dataset. This result shows a surprising reversal of the “curse of dimensionality” in the high-dimensional hypothesis testing setting. We show theoretically that conditioning on a dependence kernel is sufficient to render statistical tests independent regardless of the level of dependence in the observed data. This framework for multiple testing dependence has implications in a variety of common multiple testing problems, such as in gene expression studies, brain imaging, and spatial epidemiology.

Footnotes

  • 1To whom correspondence should be addressed. E-mail: jstorey{at}princeton.edu
  • Author contributions: J.T.L. and J.D.S. designed research, performed research, contributed new reagents/analytic tools, analyzed data, and wrote the paper.

  • The authors declare no conflict of interest.

  • This article contains supporting information online at www.pnas.org/cgi/content/full/0808709105/DCSupplemental.

  • Freely available online through the PNAS open access option.

« Previous | Next Article »Table of Contents
OPEN ACCESS ARTICLE