首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA
Authors:Anastasios Bellas  Charles Bouveyron  Marie Cottrell  Jérôme Lacaille
Institution:1. SAMM (EA 4543), Université Paris 1, 90, rue de Tolbiac, 75634, Paris Cedex 13, France
2. Snecma, Groupe Safran, 77550, Moissy Cramayel, France
Abstract:Model-based clustering is a popular tool which is renowned for its probabilistic foundations and its flexibility. However, model-based clustering techniques usually perform poorly when dealing with high-dimensional data streams, which are nowadays a frequent data type. To overcome this limitation of model-based clustering, we propose an online inference algorithm for the mixture of probabilistic PCA model. The proposed algorithm relies on an EM-based procedure and on a probabilistic and incremental version of PCA. Model selection is also considered in the online setting through parallel computing. Numerical experiments on simulated and real data demonstrate the effectiveness of our approach and compare it to state-of-the-art online EM-based algorithms.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号