首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A divisive clustering method for functional data with special consideration of outliers
Authors:Ana Justel  Marcela Svarc
Institution:1.Department of Mathematics,Universidad Autònoma de Madrid,Madrid,Spain;2.UC3M-BS Institute of Financial Big Data,Universidad Carlos III de Madrid,Madrid,Spain;3.Department of Mathematics and Sciences,Universidad de San Andrés,Victoria,Argentina;4.CONICET,Buenos Aires,Argentina
Abstract:This paper presents DivClusFD, a new divisive hierarchical method for the non-supervised classification of functional data. Data of this type present the peculiarity that the differences among clusters may be caused by changes as well in level as in shape. Different clusters can be separated in different subregion and there may be no subregion in which all clusters are separated. In each step of division, the DivClusFD method explores the functions and their derivatives at several fixed points, seeking the subregion in which the highest number of clusters can be separated. The number of clusters is estimated via the gap statistic. The functions are assigned to the new clusters by combining the k-means algorithm with the use of functional boxplots to identify functions that have been incorrectly classified because of their atypical local behavior. The DivClusFD method provides the number of clusters, the classification of the observed functions into the clusters and guidelines that may be for interpreting the clusters. A simulation study using synthetic data and tests of the performance of the DivClusFD method on real data sets indicate that this method is able to classify functions accurately.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号