LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

Determination of optimum number of components in partial least squares regression from distributions of the root‐mean‐squared error obtained by Monte Carlo resampling

Photo by pask_07 from unsplash

Monte Carlo resampling is utilized to determine the number of components in partial least squares (PLS) regression. The data are randomly and repeatedly divided into calibration and validation samples. For… Click to show full abstract

Monte Carlo resampling is utilized to determine the number of components in partial least squares (PLS) regression. The data are randomly and repeatedly divided into calibration and validation samples. For each repetition, the root‐mean‐squared error (RMSE) is determined for the validation samples for a = 1, 2, … , A PLS components to provide a distribution of RMSE values for each number of PLS components. These distributions are used to determine the median RMSE for each number of PLS components. The component (Amin) having the lowest median RMSE is located. The fraction p of the RMSE values of Amin exceeding the median RMSE for the preceding component is determined. This fraction p represents a probability measure that can be used to decide if the RMSE for the Amin PLS component is significantly lower than the RMSE for the preceding component for a preselected threshold (pupper). If so, it defines the optimum number of PLS components. If not, the process is repeated for the previous components until significance is achieved. The pupper = 0.5 implies that the median is used for selecting the optimum number of components. The RMSE is approximately normally distributed on the smallest components. This can be utilized to relate p to a fraction of a standard deviation. For instance, p = 0.308 corresponds to half a standard deviation if RMSE is normally distributed.

Keywords: rmse; monte carlo; carlo resampling; optimum number; number; number components

Journal Title: Journal of Chemometrics
Year Published: 2018

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.