首页 | 本学科首页   官方微博 | 高级检索  
     

面向非平衡数据处理的样例惩罚支持向量机
引用本文:金鑫,李玉鑑. 面向非平衡数据处理的样例惩罚支持向量机[J]. 武汉大学学报(理学版), 2012, 58(2): 139-143
作者姓名:金鑫  李玉鑑
作者单位:北京工业大学计算机学院,北京,100124
基金项目:国家自然科学基金,北京市自然科学基金,北京市教委科技发展项目,北京工业大学高层次人才培养项目
摘    要:支持向量机在处理非平衡数据集时常常不能取得良好的效果,因为其分类性能只考虑了总体分类精度,而忽略了不同类别样例之间的精度权衡.本文提出了一种基于样例分布的样例惩罚支持向量机,可以针对每一个样例根据其相应的分布特性选取惩罚以获得高敏感度的分类面.实验表明,该模型比标准支持向量机在非平衡数据上具有更好的性能.

关 键 词:支持向量机  非平衡数据  样例惩罚  样例分布

Support Vector Machines with Example Dependent Costs for Dealing with Imbalanced Data
JIN Xin,LI Yujian. Support Vector Machines with Example Dependent Costs for Dealing with Imbalanced Data[J]. JOurnal of Wuhan University:Natural Science Edition, 2012, 58(2): 139-143
Authors:JIN Xin  LI Yujian
Affiliation:(College of Computer Science,Beijing University of Technology,Beijing 100124,China)
Abstract:Standard SVM often performs poorly on imbalanced datasets for the reason that SVM ignores the tradeoff of the precision between different classes while just takes the overall classification accuracy into account.A new example dependent costs SVM method was proposed,from which we can get more sensitive hyperplane by selecting penalty for every sample according to its corresponding distribution.Experimental results show that this method can efficiently and effectively improve the performance on imbalanced datasets,better than the standard SVM method for comparison.
Keywords:support vector machine  imbalanced data  example dependent costs  data distribution
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号