Structured policies in the sequential design of experiments期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Structured policies in the sequential design of experiments

Authors:	Ulrich Rieder Hartmut Wagner

Institution:	(1) Department of Mathematics, University of Ulm, D-7900 Ulm, Germany

Abstract:	A general control model under uncertainty is considered. Using a Bayesian approach and dynamic programming, we investigate structural properties of optimal decision rules. In particular, we show the monotonicity of the total expected reward and of the so-called Gittins-Index. We extend the stopping rule and the stay-on-a-winner rule, which are well-known in bandit problems. Our approach is based on the multivariate likelihood ratio order andTP ₂ functions.

Keywords:
本文献已被 SpringerLink 等数据库收录！