Near-Infrared Spectroscopy Analytical Model Using Ensemble Partial Least Squares Regression |
| |
Authors: | Na Luo Ping Han Shifang Wang Dong Wang |
| |
Affiliation: | 1. College of Information and Electrical Engineering, Shenyang Agricultural University, Liaoning, China;2. Beijing Research Center for Agricultural Standards and Testing, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China;3. Beijing Research Center for Agricultural Standards and Testing, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China |
| |
Abstract: | A novel ensemble-based feature selection method was developed which is designated as ensemble partial least squares regression coeffientents (EPRC). It was composed of two steps: generating a series of different single feature selectors and aggregating them to reach a consensus. Specifically, the bootstrap resampling approach was used to generate a diversity of single feature selectors, and the absolute values of the regression coefficients of the partial least squares (PLS) model were used to rank the features. Next, these feature rankings out of single feature selectors were aggregated by the weighted-sum approach. Finally, coupled with the regression model, the features selected by EPRC were evaluated through cross validation and an independent test set. By experiments of constructing the spectroscopy analysis model on three near infrared spectroscopy (NIRS) datasets, it was shown that the EPRC located key wavelengths, gave a promotion to regression performance, and was more stable and interpretable to the domain experts. |
| |
Keywords: | Ensemble learning ensemble partial least squares regression coefficient (EPRC) feature selection near-infrared spectroscopy (NIRS) partial least squares (PLS) |
|
|