首页 | 本学科首页   官方微博 | 高级检索  
     


A complex networks approach for data clustering
Authors:Guilherme F. de Arruda  Luciano da Fontoura Costa  Francisco A. Rodrigues
Affiliation:1. Departamento de Matemática Aplicada e Estatística, Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo - Campus de São Carlos, Caixa Postal 668, 13560-970 São Carlos, SP, Brazil;2. Instituto de Física de São Carlos, Universidade de São Paulo, Caixa Postal 369, CEP 13560-970, São Carlos, São Paulo, Brazil
Abstract:This work proposes a method for data clustering based on complex networks theory. A data set is represented as a network by considering different metrics to establish the connection between each pair of objects. The clusters are obtained by taking into account five community detection algorithms. The network-based clustering approach is applied in two real-world databases and two sets of artificially generated data. The obtained results suggest that the exponential of the Minkowski distance is the most suitable metric to quantify the similarities between pairs of objects. In addition, the community identification method based on the greedy optimization provides the best cluster solution. We compare the network-based clustering approach with some traditional clustering algorithms and verify that it provides the lowest classification error rate.
Keywords:Clustering   Complex networks   Pattern recognition   Community
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号