Approaches to multiplicity issues in complex research in microarray analysis
False discovery rate; Hierarchical testing; High throughput analysis
The multiplicity problem is evident in the simplest form of statistical analysis of gene expression data - the identification of differentially expressed genes. In more complex analysis, the problem is compounded by the multiplicity of hypotheses per gene. Thus, in some cases, it may be necessary to consider testing millions of hypotheses. We present three general approaches for addressing multiplicity in large research problems, (a) Use the scalability of false discovery rate (FDR) controlling procedures; (b) apply FDR-controlling procedures to a selected subset of hypotheses; (c) apply hierarchical FDR-controlling procedures. We also offer a general framework for ensuring reproducible results in complex research, where a researcher faces more than just one large research problem. We demonstrate these approaches by analyzing the results of a complex experiment involving the study of gene expression levels in different brain regions across multiple mouse strains. © 2006 VVS.
Yekutieli, D., Reiner-Benaim, A., Benjamini, Y., Elmer, G., Kafkafi, N., Letwin, N., & Lee, N. (2006). Approaches to multiplicity issues in complex research in microarray analysis. Statistica Neerlandica, 60 (4). http://dx.doi.org/10.1111/j.1467-9574.2006.00343.x