Data quality in global metabolomics is of great importance for biomarker discovery and systems biology studies. However, comprehensive metrics and methods to evaluate and compare the data quality of global… Click to show full abstract
Data quality in global metabolomics is of great importance for biomarker discovery and systems biology studies. However, comprehensive metrics and methods to evaluate and compare the data quality of global metabolomics data sets are lacking. In this work, we combine newly developed metrics, along with well-known measures, to comprehensively and quantitatively characterize the data quality across two similar LC-MS platforms, with the goal of providing an efficient and improved ability to evaluate the data quality in global metabolite profiling experiments. A pooled human serum sample was run 50 times on two high-resolution LC-QTOF-MS platforms to provide profile and centroid MS data. These data were processed using Progenesis Qi software and then analyzed using five important data quality measures, including retention time drift, number of compounds detected, missing values and MS reproducibility (2 measures). The detected compounds were fit to a gamma distribution versus compound abundance, which was normalized to allow comparison of different platforms. To evaluate missing values, characteristic curves were obtained by plotting the compound detection percentage versus extraction frequency. To characterize reproducibility, the accumulative coefficient of variation (CV) versus percentage of total compounds detected and intra-class correlation coefficient (ICC) versus compound abundance were investigated. Key findings include significantly better performance using profile mode data compared to centroid mode as well quantitatively better performance from the newer, higher resolution instrument. A summary table of results gives a snapshot of the experimental results and provides a template to evaluate the global metabolite profiling workflow. In total, these measures give a good overall view of data quality in global profiling and allow comparisons of data acquisition strategies and platforms as well as optimization of parameters.
               
Click one of the above tabs to view related content.