LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

Crumble: reference free lossy compression of sequence quality values

Photo by bermixstudio from unsplash

Motivation: The bulk of space taken up by NGS sequencing CRAM files consists of per‐base quality values. Most of these are unnecessary for variant calling, offering an opportunity for space… Click to show full abstract

Motivation: The bulk of space taken up by NGS sequencing CRAM files consists of per‐base quality values. Most of these are unnecessary for variant calling, offering an opportunity for space saving. Results: On the Syndip test set, a 17 fold reduction in the quality storage portion of a CRAM file can be achieved while maintaining variant calling accuracy. The size reduction of an entire CRAM file varied from 2.2 to 7.4 fold, depending on the non‐quality content of the original file (see Supplementary Material S6 for details). Availability and implementation: Crumble is OpenSource and can be obtained from https://github.com/jkbonfield/crumble. Supplementary information: Supplementary data are available at Bioinformatics online.

Keywords: quality; quality values; free lossy; lossy compression; reference free; crumble reference

Journal Title: Bioinformatics
Year Published: 2019

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.