首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Estimation of parameters in latent class models using fuzzy clustering algorithms
Institution:1. Big Data Institute, College of Computer Science & Software Engineering, Shenzhen University, Shenzhen 518060, China;2. National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen 518060, China;3. Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen University, Shenzhen 518060, China;4. College of Forensic Science and Technology, Criminal Investigation Police University of China, Shenyang 110854, China;1. National Institute of Water and Atmosphere, NIWA, 301 Evans Bay Parade, Hataitai, Wellington 6021, New Zealand;2. Zaita Design, 29 Burbank Crescent, Churton Park, Wellington, New Zealand;3. Ministry for Primary Industries, 25 The Terrace, Wellington 6011, New Zealand;1. Posgrado en Ciencias Biológicas, Universidad Nacional Autónoma de México. Ciudad Universitaria, 04510, Ciudad de México, Mexico;2. Departamento de Ecología y Recursos Naturales, Facultad de Ciencias, Universidad Nacional Autónoma de México. Ciudad Universitaria, 04510, Ciudad de México, Mexico
Abstract:A mixture approach to clustering is an important technique in cluster analysis. A mixture of multivariate multinomial distributions is usually used to analyze categorical data with latent class model. The parameter estimation is an important step for a mixture distribution. Described here are four approaches to estimating the parameters of a mixture of multivariate multinomial distributions. The first approach is an extended maximum likelihood (ML) method. The second approach is based on the well-known expectation maximization (EM) algorithm. The third approach is the classification maximum likelihood (CML) algorithm. In this paper, we propose a new approach using the so-called fuzzy class model and then create the fuzzy classification maximum likelihood (FCML) approach for categorical data. The accuracy, robustness and effectiveness of these four types of algorithms for estimating the parameters of multivariate binomial mixtures are compared using real empirical data and samples drawn from the multivariate binomial mixtures of two classes. The results show that the proposed FCML algorithm presents better accuracy, robustness and effectiveness. Overall, the FCML algorithm has the superiority over the ML, EM and CML algorithms. Thus, we recommend FCML as another good tool for estimating the parameters of mixture multivariate multinomial models.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号