SUMMARY Learning underlying correlation patterns in data is a central problem across scientific fields. Maximum entropy models present an important class of statistical approaches for addressing this problem. However, accurately… Click to show full abstract
SUMMARY Learning underlying correlation patterns in data is a central problem across scientific fields. Maximum entropy models present an important class of statistical approaches for addressing this problem. However, accurately and efficiently inferring model parameters is a major challenge, particularly for modern high-dimensional applications such as in biology, for which the number of parameters is enormous. Previously, we developed a statistical method, Minimum Probability Flow-Boltzmann Machine Learning (MPF-BML), for performing fast and accurate inference of maximum entropy model parameters, which was applied to genetic sequence data to estimate the fitness landscape for the surface proteins of HIV and hepatitis C virus. To facilitate seamless use of MPF-BML and encourage more widespread application to data in diverse fields, we present a standalone cross-platform package of MPF-BML which features an easy-to-use GUI. The package only requires the input data (protein sequence data or data of multiple configurations of a complex system with large number of variables) and returns the maximum entropy model parameters. AVAILABILITY AND IMPLEMENTATION The MPF-BML software is publicly available under the MIT License at https://github.com/ahmedaq/MPF-BML-GUI. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
               
Click one of the above tabs to view related content.