首页 | 本学科首页   官方微博 | 高级检索  
     

基于GBMTS算法的不平衡数据分类研究
引用本文:顾玉萍,程龙生,陈湘来. 基于GBMTS算法的不平衡数据分类研究[J]. 数理统计与管理, 2016, 0(6): 1016-1027. DOI: 10.13860/j.cnki.sltj.20160418-006
作者姓名:顾玉萍  程龙生  陈湘来
作者单位:1. 南京理工大学经济管理学院,江苏南京,210094;2. 南京康尼集团综合管理部,江苏南京,210038
基金项目:国家自然科学基金资助项目(71271114)
摘    要:解决不平衡数据分类问题,在现实中有着深远的意义。马田系统利用单一的正常类别构建基准空间和测量基准尺度,并由此建立数据分类模型,十分适合不平衡数据分类问题的处理。本文以传统马田系统方法为基础,结合信噪比及F-value、G-mean等分类精度,建立了基于遗传算法的基准空间优化模型,同时运用Bagging集成化算法,构造了改进马田系统模型算法GBMTS。通过对不同分类方法及相关数据集的实验分析,表明:GBMTS算法较其他分类算法,更能够有效的处理不平衡数据的分类问题。

关 键 词:马田系统  不平衡数据  分类  遗传算法  Bagging算法

Research on the Classification of Imbalanced Data Based on GBMTS Algorithm
Abstract:It is of great significance in reality to solve the problem of classification with imbalanced data.Mahalanobis-Taguchi system (MTS) uses a single normal group to construct the refcrcncc spacc and measurement reference scale,and thus establishes the data classification model which is suitable for the classification problem of imbalanced data.In this paper,the reference space optimization model is constructed based on the traditional MTS method combined with the signal-to-noise ratio and classification accuracy indicators such as F-value and G-mean,and then an improved MTS model algorithm GBMTS is proposed by using the bagging algorithm.Through the experimental analysis of diffcrcnt classification methods and related data sets,it is shown that the GBMTS algorithm is more effective to deal with the classification problem of imbalanced data compared to the other methods.
Keywords:Mahalanobis-Taguchi system  imbalanced data  classification  genetic algorithm  bagging algorithm
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号