MOTIVATION Metagenome-assembled genomes (MAGs) have substantially extended our understanding of microbial functionality. However, 16S rRNA genes, which are commonly used in phylogenetic analysis and environmental surveys, are often missing from… Click to show full abstract
MOTIVATION Metagenome-assembled genomes (MAGs) have substantially extended our understanding of microbial functionality. However, 16S rRNA genes, which are commonly used in phylogenetic analysis and environmental surveys, are often missing from MAGs. Here, we developed MarkerMAG, a pipeline that links 16S rRNA genes to MAGs using paired-end sequencing reads. RESULTS Assessment of MarkerMAG on three benchmarking metagenomic datasets with various degrees of complexity shows substantial increases in the number of MAGs with 16S rRNA genes and a 100% assignment accuracy. MarkerMAG also estimates the copy number of 16S rRNA genes in MAGs with high accuracy. Assessments on three real metagenomic datasets demonstrates 1.1- to 14.2-fold increases in the number of MAGs with 16S rRNA genes. We also show that MarkerMAG-improved MAGs increase the accuracy of functional prediction from 16S rRNA gene amplicon data. MarkerMAG is helpful in connecting information in MAG database with those in 16S rRNA databases and surveys and hence contributes to our increasing understanding of microbial diversity, function, and phylogeny. AVAILABILITY MarkerMAG is implemented in Python3 and freely available at https://github.com/songweizhi/MarkerMAG. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
               
Click one of the above tabs to view related content.