首页 | 本学科首页   官方微博 | 高级检索  
     

k-平均问题及其变形的算法综述
引用本文:徐大川,许宜诚,张冬梅. k-平均问题及其变形的算法综述[J]. 运筹学学报, 2017, 21(2): 101-109. DOI: 10.15960/j.cnki.issn.1007-6093.2017.02.011
作者姓名:徐大川  许宜诚  张冬梅
作者单位:1. 北京工业大学应用数理学院, 北京 100124; 2. 山东建筑大学计算机科学与技术学院, 济南 250101
基金项目:国家自然科学基金(No. 11531014), 山东省高等学校科研计划项目(No. J15LN23)
摘    要:k-平均问题是计算机科学和组合优化领域的经典问题之一.k-平均聚类作为最受重视而且最简单易懂的一种聚类分析方法流行于数据挖掘领域.k-平均问题可描述为:给定n个元素的观测集,其中每个观测点都是d维实向量,目标是把这n个观测点划分到k(≤n)个集合中,使得所有集合中的点到对应的聚类中心的距离的平方和最小,其中一个集合的聚类中心指的是该集合中所有观测点的均值.k-平均问题在理论上是NP-难的,但有高效的启发式算法,广泛应用在市场划分、机器视觉、地质统计学、天文学和农业等实际背景中.随着实际问题中遇到的k-平均问题更加复杂,数据量更加庞大,还需学者进行更深一步的研究.罗列出k-平均问题及其诸多变形及推广问题的经典算法,并总结k-平均中尚待研究的若干问题.

关 键 词:聚类问题  k-平均  NP-难  
收稿时间:2017-04-05

A survey on algorithms for k-means problem and its variants
XU Dachuan,XU Yicheng,ZHANG Dongmei. A survey on algorithms for k-means problem and its variants[J]. OR Transactions, 2017, 21(2): 101-109. DOI: 10.15960/j.cnki.issn.1007-6093.2017.02.011
Authors:XU Dachuan  XU Yicheng  ZHANG Dongmei
Affiliation:1. College of Applied Sciences, Beijing University of Technology, Beijing 100124, China; 2. School of Computer Science and Technology, Shandong Jianzhu University, Jinan 250101, China
Abstract:The k-means is one of the most classical problems in both computer science and combinatorial optimization. The k-means clustering is popular as one of the most significant and easiest methods of cluster analysis in data mining. The k-means problem can be described as: Given a set of observations with n elements, where each element is a d-dimensional real vector. The objective is to partition the n observations into k sets, so as to minimize within-cluster sum of squares, where we call the center of the cluster the mean of the observations in it. The k-means is NP-hard in theory however, there are efficient heuristic algorithms employed widely in market segmentation, machine vision, geostatistics, astronomy, agriculture and so on. With more and more complicated the problem we meet in practical, and larger and larger the data grows, further research is needed. This article aims at listing the classical algorithms for solving k-means as well as the variety of the variants and generalization of it, and summarize some problems in k-means which have not been solved for readers.
Keywords:clustering problems  k-means  NP-hard  
本文献已被 CNKI 等数据库收录!
点击此处可从《运筹学学报》浏览原始摘要信息
点击此处可从《运筹学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号