Accurate discrimination and classification of unknown species are the basis to predict its characteristics or applications to make correct decisions. However, for biogenic solutions that are ubiquitous in nature and… Click to show full abstract
Accurate discrimination and classification of unknown species are the basis to predict its characteristics or applications to make correct decisions. However, for biogenic solutions that are ubiquitous in nature and our daily lives, direct determination of their similarities and disparities by their molecular compositions remains a scientific challenge. Here, we explore a standard and visualizable ontology, termed "biogenic solution map", that organizes multifarious classes of biogenic solutions into a map of hierarchical structures. To build the map, a novel 4-dimensional (4D) fingerprinting method based on data-independent acquisition data sets of untargeted metabolomics is developed, enabling accurate characterization of complex biogenic solutions. A generic parameter of metabolic correlation distance, calculated based on averaged similarities between 4D fingerprints of sample groups, is able to define "species", "genus", and "family" of each solution in the map. With the help of the "biogenic solution map", species of unknown biogenic solutions can be explicitly defined. Simultaneously, intrinsic correlations and subtle variations among biogenic solutions in the map are accurately illustrated. Moreover, it is worth mentioning that samples of the same analyte but prepared by alternative protocols may have significantly different metabolic compositions and could be classified into different "genera".
               
Click one of the above tabs to view related content.