首页 | 本学科首页   官方微博 | 高级检索  
     检索      


An Improved K-Means Algorithm Based on Evidence Distance
Authors:Ailin Zhu  Zexi Hua  Yu Shi  Yongchuan Tang  Lingwei Miao
Institution:1.School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756, China;2.School of Electrical Engineering, Southwest Jiaotong University, Chengdu 611756, China; (Y.S.); (L.M.);3.School of Big Data and Software Engineering, Chongqing University, Chongqing 401331, China;4.Qianghua Times (Chengdu) Technology Co., Ltd., Chengdu 610095, China
Abstract:The main influencing factors of the clustering effect of the k-means algorithm are the selection of the initial clustering center and the distance measurement between the sample points. The traditional k-mean algorithm uses Euclidean distance to measure the distance between sample points, thus it suffers from low differentiation of attributes between sample points and is prone to local optimal solutions. For this feature, this paper proposes an improved k-means algorithm based on evidence distance. Firstly, the attribute values of sample points are modelled as the basic probability assignment (BPA) of sample points. Then, the traditional Euclidean distance is replaced by the evidence distance for measuring the distance between sample points, and finally k-means clustering is carried out using UCI data. Experimental comparisons are made with the traditional k-means algorithm, the k-means algorithm based on the aggregation distance parameter, and the Gaussian mixture model. The experimental results show that the improved k-means algorithm based on evidence distance proposed in this paper has a better clustering effect and the convergence of the algorithm is also better.
Keywords:k-means clustering  evidence distance  cluster analysis  evidence theory
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号