首页 | 本学科首页   官方微博 | 高级检索  
     


Variable selection in model-based discriminant analysis
Authors:C. Maugis  G. Celeux  M.-L. Martin-Magniette
Affiliation:aInstitut de Mathématiques de Toulouse, INSA de Toulouse, Université de Toulouse, France;bInria Saclay Île-de-France, France;cUMR AgroParisTech/INRA MIA 518, Paris, France;dURGV UMR INRA 1165, UEVE, ERL CNRS 8196, Evry, France
Abstract:A general methodology for selecting predictors for Gaussian generative classification models is presented. The problem is regarded as a model selection problem. Three different roles for each possible predictor are considered: a variable can be a relevant classification predictor or not, and the irrelevant classification variables can be linearly dependent on a part of the relevant predictors or independent variables. This variable selection model was inspired by a previous work on variable selection in model-based clustering. A BIC-like model selection criterion is proposed. It is optimized through two embedded forward stepwise variable selection algorithms for classification and linear regression. The model identifiability and the consistency of the variable selection criterion are proved. Numerical experiments on simulated and real data sets illustrate the interest of this variable selection methodology. In particular, it is shown that this well ground variable selection model can be of great interest to improve the classification performance of the quadratic discriminant analysis in a high dimension context.
Keywords:AMS subject classification: 62H30
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号