Abstract Motivation RNA-sequencing (RNA-seq) of tumor tissue is typically only used to measure gene expression. Here, we present a statistical approach that leverages existing RNA-seq data to also detect somatic… Click to show full abstract
Abstract Motivation RNA-sequencing (RNA-seq) of tumor tissue is typically only used to measure gene expression. Here, we present a statistical approach that leverages existing RNA-seq data to also detect somatic copy number alterations (SCNAs), a pervasive phenomenon in human cancers, without a need to sequence the corresponding DNA. Results We present an analysis of 4942 participant samples from 28 cancers in The Cancer Genome Atlas (TCGA), demonstrating robust detection of SCNAs from RNA-seq. Using genotype imputation and haplotype information, our RNA-based method had a median sensitivity of 85% to detect SCNAs defined by DNA analysis, at high specificity (∼95%). As an example of translational potential, we successfully replicated SCNA features associated with breast cancer subtypes. Our results credential haplotype-based inference based on RNA-seq to detect SCNAs in clinical and population-based settings. Availability and implementation The analyses presented use the data publicly available from TCGA Research Network (http://cancergenome.nih.gov/). See Methods for details regarding data downloads. hapLOHseq software is freely available under The MIT license and can be downloaded from http://scheet.org/software.html. Supplementary information Supplementary data are available at Bioinformatics online.
               
Click one of the above tabs to view related content.