首页 | 本学科首页   官方微博 | 高级检索  
     


Factor probabilistic distance clustering (FPDC): a new clustering method
Authors:Cristina Tortora  Mireille Gettler Summa  Marina Marino  Francesco Palumbo
Affiliation:1.Department of Mathematics and Statistics,McMaster University,Hamilton,Canada;2.CEREMADE, Université Paris Dauphine,Paris,France;3.Dipartimento di Scienze Sociali,University of Naples Federico II,Naples,Italy;4.Dipartimento di Scienze Politiche,University of Naples Federico II,Naples,Italy
Abstract:Factor clustering methods have been developed in recent years thanks to improvements in computational power. These methods perform a linear transformation of data and a clustering of the transformed data, optimizing a common criterion. Probabilistic distance (PD)-clustering is an iterative, distribution free, probabilistic clustering method. Factor PD-clustering (FPDC) is based on PD-clustering and involves a linear transformation of the original variables into a reduced number of orthogonal ones using a common criterion with PD-clustering. This paper demonstrates that Tucker3 decomposition can be used to accomplish this transformation. Factor PD-clustering alternatingly exploits Tucker3 decomposition and PD-clustering on transformed data until convergence is achieved. This method can significantly improve the PD-clustering algorithm performance; large data sets can thus be partitioned into clusters with increasing stability and robustness of the results. Real and simulated data sets are used to compare FPDC with its main competitors, where it performs equally well when clusters are elliptically shaped but outperforms its competitors with non-Gaussian shaped clusters or noisy data.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号