Articles with "data leakage" as a keyword



Being Aware of Data Leakage and Cross‐Validation Scaling in Chemometric Model Validation

Sign Up to like & get
recommendations!
Published in 2025 at "Journal of Chemometrics"

DOI: 10.1002/cem.70026

Abstract: The main goal of our investigation is to raise awareness among chemometricians about how easy it is to introduce data or parameter leakage by inappropriate methods and to demonstrate that high precision is necessary in… read more here.

Keywords: validation; data leakage; model; cross validation ... See more keywords

Some combinatorics of data leakage induced by clusters

Sign Up to like & get
recommendations!
Published in 2024 at "Stochastic Environmental Research and Risk Assessment"

DOI: 10.1007/s00477-024-02715-1

Abstract: Data leakage is a common issue that can lead to misleading generalisation error estimation and incorrect hyperparameter tuning. However, its mechanisms are not always well understood. In this work, we consider the case of clustered… read more here.

Keywords: induced clusters; leakage; combinatorics data; leakage induced ... See more keywords

Data leakage inflates prediction performance in connectome-based machine learning models

Sign Up to like & get
recommendations!
Published in 2024 at "Nature Communications"

DOI: 10.1038/s41467-024-46150-w

Abstract: Predictive modeling is a central technique in neuroimaging to identify brain-behavior relationships and test their generalizability to unseen data. However, data leakage undermines the validity of predictive models by breaching the separation between training and… read more here.

Keywords: leakage; prediction performance; inflates prediction; machine learning ... See more keywords

A Novel Mechanism for Fast Detection of Transformed Data Leakage

Sign Up to like & get
recommendations!
Published in 2018 at "IEEE Access"

DOI: 10.1109/access.2018.2851228

Abstract: Data leakage is a growing insider threat in information security among organizations and individuals. A series of methods has been developed to address the problem of data leakage prevention (DLP). However, large amounts of unstructured… read more here.

Keywords: data leakage; mechanism fast; transformed data; novel mechanism ... See more keywords

A Computational Harmonic Detection Algorithm to Detect Data Leakage Through EM Emanation

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Internet of Things Journal"

DOI: 10.1109/jiot.2025.3578511

Abstract: Unintended electromagnetic emissions, called EM emanations, can be exploited to recover sensitive information, posing security risks. Metal shielding, used by defense organizations to preventdata leakage, is costly and impractical for widespread use. This issue is… read more here.

Keywords: detection; emanation; method; computational harmonic ... See more keywords

Combating Data Leakage Trojans in Commercial and ASIC Applications With Time-Division Multiplexing and Random Encoding

Sign Up to like & get
recommendations!
Published in 2018 at "IEEE Transactions on Very Large Scale Integration (VLSI) Systems"

DOI: 10.1109/tvlsi.2018.2844180

Abstract: Globalization of microchip fabrication opens the possibility for an attacker to insert hardware Trojans into a chip during the manufacturing process. While most defensive methods focus on detection or prevention, a recent method, called Randomized… read more here.

Keywords: division multiplexing; data leakage; time; time division ... See more keywords

Leakage Prediction in Machine Learning Models When Using Data from Sports Wearable Sensors

Sign Up to like & get
recommendations!
Published in 2022 at "Computational Intelligence and Neuroscience"

DOI: 10.1155/2022/5314671

Abstract: One of the major problems in machine learning is data leakage, which can be directly related to adversarial type attacks, raising serious concerns about the validity and reliability of artificial intelligence. Data leakage occurs when… read more here.

Keywords: machine; machine learning; data leakage; prediction machine ... See more keywords

The Effect of Data Leakage and Feature Selection on Machine Learning Performance for Early Parkinson’s Disease Detection

Sign Up to like & get
recommendations!
Published in 2025 at "Bioengineering"

DOI: 10.3390/bioengineering12080845

Abstract: If we do not urgently educate current and future medical professionals to critically evaluate and distinguish credible AI-assisted diagnostic tools from those whose performance is artificially inflated by data leakage or improper validation, we risk… read more here.

Keywords: data leakage; machine; parkinson disease; machine learning ... See more keywords