首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Canonical Forest
Authors:Yu-Chuan Chen  Hyejung Ha  Hyunjoong Kim  Hongshik Ahn
Institution:1. Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY?, 11794-3600, USA
3. Department of Applied Statistics, Yonsei University, Seoul?, 120-749, South Korea
2. SUNY Korea, Incheon?, 406-840, South Korea
Abstract:We propose a new classification ensemble method named Canonical Forest. The new method uses canonical linear discriminant analysis (CLDA) and bootstrapping to obtain accurate and diverse classifiers that constitute an ensemble. We note CLDA serves as a linear transformation tool rather than a dimension reduction tool. Since CLDA will find the transformed space that separates the classes farther in distribution, classifiers built on this space will be more accurate than those on the original space. To further facilitate the diversity of the classifiers in an ensemble, CLDA is applied only on a partial feature space for each bootstrapped data. To compare the performance of Canonical Forest and other widely used ensemble methods, we tested them on 29 real or artificial data sets. Canonical Forest performed significantly better in accuracy than other ensemble methods in most data sets. According to the investigation on the bias and variance decomposition, the success of Canonical Forest can be attributed to the variance reduction.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号