Markov ratio decision processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Markov ratio decision processes

Authors:	V Aggarwal R Chandrasekaran K P K Nair

Institution:	(1) School of Administration, University of New Brunswick, Fredericton, New Brunswick, Canada;(2) School of Management, University of Texas at Dallas, Dallas, Texas

Abstract:	A finite-state Markov decision process, in which, associated with each action in each state, there are two rewards, is considered. The objective is to optimize the ratio of the two rewards over an infinite horizon. In the discounted version of this decision problem, it is shown that the optimal value is unique and the optimal strategy is pure and stationary; however, they are dependent on the starting state. Also, a finite algorithm for computing the solution is given.

Keywords:	Markov decision processes ratio rewards discounting optimal solutions algorithms
本文献已被 SpringerLink 等数据库收录！