LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

CURC: a CUDA-based reference-free read compressor

Photo by madalyncox from unsplash

MOTIVATION The data deluge of high-throughput sequencing has posed great challenges to data storage and transfer. Many specific compression tools have been developed to solve this problem. However, most of… Click to show full abstract

MOTIVATION The data deluge of high-throughput sequencing has posed great challenges to data storage and transfer. Many specific compression tools have been developed to solve this problem. However, most of the existing compressors are based on CPU platform, which might be inefficient and expensive to handle large-scale HTS data. With the popularization of GPUs, GPU-compatible sequencing data compressors become desirable to exploit the computing power of GPUs. RESULTS We present a GPU-accelerated reference-free read compressor, namely CURC, for FASTQ files. Under a GPU-CPU heterogeneous parallel scheme, CURC implements highly efficient lossless compression of DNA stream based on the pseudogenome approach and CUDA library. CURC achieves 2∼6-fold speedup of the compression with competitive compression rate, compared with other state-of-the-art reference-free read compressors. AVAILABILITY CURC can be downloaded from https://github.com/BioinfoSZU/CURC. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

Keywords: read compressor; free read; reference free; curc

Journal Title: Bioinformatics
Year Published: 2022

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.