首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Latent class model with conditional dependency per modes to cluster categorical data
Authors:Matthieu Marbac  Christophe Biernacki  Vincent Vandewalle
Institution:1.Department of Mathematics and Statistics,McMaster University,Hamilton,Canada;2.University Lille 1 and CNRS and Inria Lille,Villeneuve-d’Ascq,France;3.EA 2694 University Lille 2 and Inria Lille,Lille,France
Abstract:We propose a parsimonious extension of the classical latent class model to cluster categorical data by relaxing the conditional independence assumption. Under this new mixture model, named conditional modes model (CMM), variables are grouped into conditionally independent blocks. Each block follows a parsimonious multinomial distribution where the few free parameters model the probabilities of the most likely levels, while the remaining probability mass is uniformly spread over the other levels of the block. Thus, when the conditional independence assumption holds, this model defines parsimonious versions of the standard latent class model. Moreover, when this assumption is violated, the proposed model brings out the main intra-class dependencies between variables, summarizing thus each class with relatively few characteristic levels. The model selection is carried out by an hybrid MCMC algorithm that does not require preliminary parameter estimation. Then, the maximum likelihood estimation is performed via an EM algorithm only for the best model. The model properties are illustrated on simulated data and on three real data sets by using the associated R package CoModes. The results show that this model allows to reduce biases involved by the conditional independence assumption while providing meaningful parameters.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号