Research Communication | Open Access
Volume 2020 | Communication ID 28 |

Cross validation vs generalized cross validation in PLS regression: a comparison by simulation

Abdelmounaim Kerkri
Academic Editor: Youssef EL FOUTAYENI
Received
Accepted
Published
14 November 2019
29 November 2019
10 March 2020

Abstract: Partial least squares regression (PLS regression) is often used to overcome the problem of multicollinearity in chemometrics. This method shrinks the ordinary least squares estimator (OLS estimator) in order to exchange bias with lower variance. PLS requires a proper model selection tool to choose the optimal model; cross validation is the most common tool in the PLS literature. In this work we will compare leave one out cross validation with two methods of generalized cross validation (GCV), often used in regularization methods in the numerical analysis literature. The difference between the two (GCV) methods is the use of different estimations for the trace of the influence matrix. Our comparison is conducted using simulated datasets.