Analytical Sciences

Abstract − Analytical Sciences, 17(2), 319 (2001).

Elimination of the Uninformative Calibration Sample Subset in the Modified UVE(Uninformative Variable Elimination)-PLS (Partial Least Squares) Method
Jun KOSHOUBU,* Tetsuo IWATA,** and Shigeo MINAMI***
*JASCO Technical Research Laboratory Corporation, 2963-3, Ishikawa, Hachioji, Tokyo 192-0032, Japan
**Department of Mechanical Engineering, Faculty of Engineering, University of Tokushima, Minami-Jyosanjima-cho-2, Tokushima 770-8506, Japan
***Osaka Electro-Communication University, 18-8, Hatsu-cho, Neyagawa, Osaka 572-8530, Japan
In order to increase the predictive ability of the PLS (Partial Least Squares) model, we have developed a new algorithm, by which uninformative samples which cannot contribute to the model very much are eliminated from a calibration data set. In the proposed algorithm, uninformative wavelength (or independent) variables are eliminated at the first stage by using the modified UVE (Uninformative Variable Elimination)-PLS method that we reported previously. Then, if the prediction error of the ith (1 =< i =< n) sample is larger than 3sigma, the corresponding sample is eliminated as uninformative, where n is the total number of calibration samples and sigma is the standard deviation calculated from the other n-1 samples. Calculation of sigma by the leave-one-out manner enhances the ability to identify the uninformative samples. The final PLS model is constructed precisely because both uninformative wavelength variables and uninformative samples are eliminated. In order to demonstrate the usefulness of the algorithm, we have applied it to two kinds of mid-infrared spectral data sets.