LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

1210. K-mer Profiling Powered by Reference-assisted Assembly of NGS Data: A Highly Sensitive Protocol to Infer the Plasma Microbiome Using Cell-free DNA Sequence Data

Photo from wikipedia

Cell-free DNA (cfDNA) has emerged as an important clinical specimen to probe for pathogenic microbes, especially in organ transplant patients where the same data can be used to predict allograft… Click to show full abstract

Cell-free DNA (cfDNA) has emerged as an important clinical specimen to probe for pathogenic microbes, especially in organ transplant patients where the same data can be used to predict allograft rejection. Recent reports described viral, bacterial or the complete microbial diversity in plasma following cfDNA sequencing. The prevalence of certain viral families (anelloviridae) is associated with immunosuppressant dosage and the risk of antibody mediated rejection. While being informative, the cfDNA reads are inherently shorter in length (~160bp or 2x75bp) and predominated by the host DNA (~97-99%), causing challenges in their taxonomic annotation and lower specificity. Here we present a computational protocol which minimizes these challenges by merging the concept of “Reference-assisted Assembly” with K-mer profiles of NGS data, for highly sensitive and specific microbial detection. We developed a pipeline in which non-host NGS data (reads not mapped to the human genome) undergo a reference-assisted assembly operation and then taxonomic annotation using KrakenUneq (a K-mer based classifier). We trained the KrakenUneq on an in-house and curated database of ~12,000 viral genomes. We used three different K-mer values (16, 21, 31) to train KrakenUneq, and final predictions are made by applying a majority-wins rule. Currently the default KrakenUneq database is used for bacterial & fungal metagenome analysis. We tested our method on 30 simulated and 124 clinical samples obtained from a biorepository. Our protocol currently screens for a targeted list of pathogens (15 viral species, 16 bacterial and 10 fungal genera). On a simulated set of viral sample mixes, our protocol had 100% accuracy. For 124 clinical samples, predictions were evaluated for specificity and sensitivity using qPCR assays for the following viral species: EBV, BKV, JCV, HSV1/2, HHV7, and CMV. Total 33/38 computational predictions (87%) were confirmed by qPCR. The prediction sensitivity in terms of cps/ml ranged from 6 - 106 copies/mL. Our efforts to perform ‘Reference-assisted assembly’ followed by K-mer based taxonomic annotation of cfDNA data, led to development of a novel and accurate pathogen detection protocol. Rohita Sinha, PhD, Viracor-Eurofins (Employee) Steve Kleiboeker, DVM, PhD, Viracor-Eurofins (Employee) Michelle Altrich, PhD, Viracor-Eurofins (Employee) Ellis Bixler, MS, Viracor-Eurofins (Employee)

Keywords: reference assisted; dna; assisted assembly; protocol; ngs data

Journal Title: Open Forum Infectious Diseases
Year Published: 2020

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.