MOTIVATION The identification of pathways and biological processes from differential gene expression is central for interpretation of data collected by transcriptomics assays. Gene-Set Enrichment Analysis (GSEA) is the most common… Click to show full abstract
MOTIVATION The identification of pathways and biological processes from differential gene expression is central for interpretation of data collected by transcriptomics assays. Gene-Set Enrichment Analysis (GSEA) is the most common used algorithm to calculate the significance of the relevancy of an annotated gene set with a differential expression signature. To compute significance, GSEA implements permutation tests which are slow and inaccurate for comparing many differential expression signatures to thousands of annotated gene sets. RESULTS Here we present blitzGSEA, an algorithm that is based on the same running sum statistic as GSEA, but instead of performing permutations, blitzGSEA approximates the enrichment score probabilities based on Gamma distributions. blitzGSEA achieves dramatic improvement in performance compared with prior GSEA implementations, while approximating small p-values more accurately. AVAILABILITY A python package, together with all source code, and a detailed user guide are available from GitHub at: https://github.com/MaayanLab/blitzgsea. SUPPLEMENTARY INFORMATION Supplementary text and figures are available at Bioinformatics online.
               
Click one of the above tabs to view related content.