Tournament screening cum EBIC for feature selection with high-dimensional feature spaces |
| |
Authors: | ZeHua Chen and JiaHua Chen |
| |
Affiliation: | (1) Department of Statistics & Applied Probability, National University of Singapore, 3 Science Drive 2, 117543 Singapore, Singapore;(2) Department of Statistics, University of British Columbia, Vancouver, BC, V6T 1Z2, Canada |
| |
Abstract: | The feature selection characterized by relatively small sample size and extremely high-dimensional feature space is common in many areas of contemporary statistics. The high dimensionality of the feature space causes serious difficulties: (i) the sample correlations between features become high even if the features are stochastically independent; (ii) the computation becomes intractable. These difficulties make conventional approaches either inapplicable or inefficient. The reduction of dimensionality of the feature space followed by low dimensional approaches appears the only feasible way to tackle the problem. Along this line, we develop in this article a tournament screening cum EBIC approach for feature selection with high dimensional feature space. The procedure of tournament screening mimics that of a tournament. It is shown theoretically that the tournament screening has the sure screening property, a necessary property which should be satisfied by any valid screening procedure. It is demonstrated by numerical studies that the tournament screening cum EBIC approach enjoys desirable properties such as having higher positive selection rate and lower false discovery rate than other approaches. Zehua Chen was supported by Singapore Ministry of Educations ACRF Tier 1 (Grant No. R-155-000-065-112). Jiahua Chen was supported by the National Science and Engineering Research Countil of Canada and MITACS, Canada. |
| |
Keywords: | extended Bayes information criterion feature selection penalized likelihood reduction of dimensionality small-n-large-P sure screening |
本文献已被 CNKI SpringerLink 等数据库收录! |
|