Pharmacology and Physiology Faculty Publications

Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets

Yinglei Lai, George Washington UniversityFollow
Fanni Zhang, George Washington University
Tapan K. Nayak, George Washington University
Reza Modarres, George Washington University
Norman H. Lee, George Washington UniversityFollow
Timothy A. McCaffrey, George Washington UniversityFollow

Document Type

Journal Article

Publication Date

2014

Journal

BMC Genomics

Volume

Volume 15, Supplement 1

Inclusive Pages

Article number S6

Abstract

Background

Gene set enrichment analysis (GSEA) is an important approach to the analysis of coordinate expression changes at a pathway level. Although many statistical and computational methods have been proposed for GSEA, the issue of a concordant integrative GSEA of multiple expression data sets has not been well addressed. Among different related data sets collected for the same or similar study purposes, it is important to identify pathways or gene sets with concordant enrichment.

Methods

We categorize the underlying true states of differential expression into three representative categories: no change, positive change and negative change. Due to data noise, what we observe from experiments may not indicate the underlying truth. Although these categories are not observed in practice, they can be considered in a mixture model framework. Then, we define the mathematical concept of concordant gene set enrichment and calculate its related probability based on a three-component multivariate normal mixture model. The related false discovery rate can be calculated and used to rank different gene sets.

Results

We used three published lung cancer microarray gene expression data sets to illustrate our proposed method. One analysis based on the first two data sets was conducted to compare our result with a previous published result based on a GSEA conducted separately for each individual data set. This comparison illustrates the advantage of our proposed concordant integrative gene set enrichment analysis. Then, with a relatively new and larger pathway collection, we used our method to conduct an integrative analysis of the first two data sets and also all three data sets. Both results showed that many gene sets could be identified with low false discovery rates. A consistency between both results was also observed. A further exploration based on the KEGG cancer pathway collection showed that a majority of these pathways could be identified by our proposed method.

Conclusions

This study illustrates that we can improve detection power and discovery consistency through a concordant integrative analysis of multiple large-scale two-sample gene expression data sets.

Comments

Reproduced with permission of BMC Genomics.

Creative Commons License

This work is licensed under a Creative Commons Attribution 3.0 License.

APA Citation

Lai, Y., Zhang, F., Nayak, T.K., Modarres, R., Lee, N.H., McCaffrey, T.A (2014). Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets. BMC Genomics, 15(suppl.1):S6.

Peer Reviewed

Open Access

Download

Included in

Medical Pharmacology Commons, Medical Physiology Commons

COinS

Pharmacology and Physiology Faculty Publications

Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets

Document Type

Publication Date

Journal

Volume

Inclusive Pages

Abstract

Background

Methods

Results

Conclusions

Comments

Creative Commons License

APA Citation

Peer Reviewed

Open Access

Included in

Search

Browse

Author Corner

Links

Pharmacology and Physiology Faculty Publications

Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets

Authors

Document Type

Publication Date

Journal

Volume

Inclusive Pages

Abstract

Background

Methods

Results

Conclusions

Comments

Creative Commons License

APA Citation

Peer Reviewed

Open Access

Included in

Share

Search

Browse

Author Corner

Links