首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Markov chain order estimation with conditional mutual information
Authors:M Papapetrou  D Kugiumtzis
Institution:Department of Mathematical, Physical and Computational Science, Faculty of Engineering, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
Abstract:We introduce the Conditional Mutual Information (CMI) for the estimation of the Markov chain order. For a Markov chain of KK symbols, we define CMI of order mm, Ic(m)Ic(m), as the mutual information of two variables in the chain being mm time steps apart, conditioning on the intermediate variables of the chain. We find approximate analytic significance limits based on the estimation bias of CMI and develop a randomization significance test of Ic(m)Ic(m), where the randomized symbol sequences are formed by random permutation of the components of the original symbol sequence. The significance test is applied for increasing mm and the Markov chain order is estimated by the last order for which the null hypothesis is rejected. We present the appropriateness of CMI-testing on Monte Carlo simulations and compare it to the Akaike and Bayesian information criteria, the maximal fluctuation method (Peres–Shields estimator) and a likelihood ratio test for increasing orders using ??-divergence. The order criterion of CMI-testing turns out to be superior for orders larger than one, but its effectiveness for large orders depends on data availability. In view of the results from the simulations, we interpret the estimated orders by the CMI-testing and the other criteria on genes and intergenic regions of DNA chains.
Keywords:Order estimation  Markov chains  Conditional mutual information (CMI)  Randomization test  DNA
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号