首页 | 本学科首页   官方微博 | 高级检索  
     检索      

对聚类算法普遍存在问题的解决办法
引用本文:姜园,张朝阳,仇佩亮,戚玉鹏.对聚类算法普遍存在问题的解决办法[J].电路与系统学报,2004,9(3):92-99.
作者姓名:姜园  张朝阳  仇佩亮  戚玉鹏
作者单位:浙江大学,信息与通信工程研究所,浙江,杭州,310027
基金项目:国家自然科学基金资助项目(60002003)
摘    要:聚类广泛应用于统计、机器学习、模式识别、数据分析等领域并越来越受重视。本文研究了各种聚类算法共同面临的五个问题:聚类效果评估、类数目估计、数据预处理、样本间相似性测量、抗干扰性能,分析了对这些问题的有代表性的解决方法,总结并预测了未来聚类算法在这五个方面的研究方向。

关 键 词:聚类  效果评估  类数目估计  预处理  相似性测量  抗干扰性能
文章编号:1007-0249(2004)03-0092-08
修稿时间:2003年8月14日

Solutions to General Clustering Algorithmic Issues
JIANG Yuan,ZHANG Zhao-Yang,QIU Pei-Liang,QI Yu-Peng.Solutions to General Clustering Algorithmic Issues[J].Journal of Circuits and Systems,2004,9(3):92-99.
Authors:JIANG Yuan  ZHANG Zhao-Yang  QIU Pei-Liang  QI Yu-Peng
Abstract:Clustering is widely used in several fields such as statistics, machine learning, pattern recognition and numerical analysis. Recently, more and more attention has been paid to it. In this paper, five issues commonly concerned are discussed, they are: assessment of clustering results, estimation of total number of clusters, data preparation, measures of data proximity and outlier handling. Representative solutions to these issues are surveyed, conclusions are summed up, development trend of algorithms to deal with these five issues is forecasted.
Keywords:clustering  assessment of results  estimation of total number of clusters  data preparation  proximity measure  outlier handling
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号