Anomaly detection is one of the most important problems of modern data science due to the threat to the security of information systems as well as their users. This applies… Click to show full abstract
Anomaly detection is one of the most important problems of modern data science due to the threat to the security of information systems as well as their users. This applies in particular to logistic data, which is used to predict costs, times, and organization of travel routes. Data anomalies may endanger the welfare and safety of transport users, goods, handling companies, and consumers. Moreover, they contribute to the overexploitation of the natural environment. Therefore, it is extremely important to find methods that are responsible for their effective detection. The desired approach may be the Choquet integral and its extensions, which in various applications have proven that with their help it is possible to efficiently increase the quality of the classification measured, for example, with the help of the accuracy. Due to the fact that the Choquet integral is resistant to data fluctuations and takes into account the quality (significance) of the information source, it appears to be an effective proposition for the final determination of what data, or more precisely, which records can be considered anomalous. The innovative approach to analyze transport data has not been used before. This article considers four publicly available databases covering different fields of application of transport systems. In a series of comprehensive numerical experiments, the Choquet integral-based approach has proven high efficiency for each of them. Moreover, we made a comparative analysis of the solutions before applying the Choquet integral and the results after its application.
               
Click one of the above tabs to view related content.