Technical Note: On Ordinal Comparison of Policies in Markov Reward Processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Technical Note: On Ordinal Comparison of Policies in Markov Reward Processes

Authors:	Chang H S

Institution:	(1) Department of Computer Science and Engineering, Sogang University, Seoul, Korea

Abstract:	An asymptotic exponential convergence rate of ordinal comparison from large deviations theory is well known for selecting the true best solution from the candidate solutions sample means. This note supplements the theories developed by Dai within the framework of ergodic Markov reward processes for -ordinal comparison of policies, establishing an asymptotic exponential convergence rate for the infinite-horizon average criterion.

Keywords:	Ordinal comparisons large deviations stochastic simulations Markov reward processes
本文献已被 SpringerLink 等数据库收录！