共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
We introduce and study a class of non-stationary semi-Markov decision processes on a finite horizon. By constructing an equivalent Markov decision process, we establish the existence of a piecewise open loop relaxed control which is optimal for the finite horizon problem. 相似文献
4.
This paper deals with semi-Markov decision processes under the average expected criterion. The state and action spaces are Borel spaces, and the cost/reward function is allowed to be unbounded from above and from below. We give another set of conditions, under which the existence of an optimal (deterministic) stationary policy is proven by a new technique of two average optimality inequalities. Our conditions are slightly weaker than those in the existing literature, and some new sufficient conditions for the verifications of our assumptions are imposed on the primitive data of the model. Finally, we illustrate our results with three examples. 相似文献
5.
6.
Nikolaos Limnios 《Methodology and Computing in Applied Probability》2012,14(4):895-917
The aim of this paper is to present a systematic modeling of reliability and related measures: availability, maintainability, failure rate, rate of occurrence of failures, mean times, etc., known in the literature under the term dependability. This model includes the continuous and discrete time semi-Markov processes with general state space. This is one of the most general models in reliability theory since it includes as particular cases the Markov and renewal processes. 相似文献
7.
8.
Rolando Cavazos-Cadena Francisco Salem-Silva 《Applied Mathematics and Optimization》2010,61(2):167-190
This note concerns discrete-time controlled Markov chains with Borel state and action spaces. Given a nonnegative cost function, the performance of a control policy is measured by the superior limit risk-sensitive average criterion associated with a constant and positive risk sensitivity coefficient. Within such a framework, the discounted approach is used (a) to establish the existence of solutions for the corresponding optimality inequality, and (b) to show that, under mild conditions on the cost function, the optimal value functions corresponding to the superior and inferior limit average criteria coincide on a certain subset of the state space. The approach of the paper relies on standard dynamic programming ideas and on a simple analytical derivation of a Tauberian relation. 相似文献
9.
In this article, we study risk-sensitive control problem with controlled continuous time Markov chain state dynamics. Using multiplicative dynamic programming principle along with the atomic structure of the state dynamics, we prove the existence and a characterization of optimal risk-sensitive control under geometric ergodicity of the state dynamics along with a smallness condition on the running cost. 相似文献
10.
基于状态空间模型与改进HSMM相结合的装备故障诊断方法研究 总被引:1,自引:0,他引:1
提出了一种将状态空间模型与隐半马尔科夫模型(HSMM)相结合的装备故障诊断方法,将能有效利用直至前一时刻状态监测数据的HSMM预测概率模型与能有效利用直至当前状态监测数据的状态空间模型进行了有机结合.针对已有的HSMM研究中状态持续时间均服从指数分布,小样本下求解状态持续时间分布难的缺陷,以及将每一监测时刻的状态剩余持续时间与状态的持续时间完全等同起来造成诊断误差的不足,对HSMM进行了两点改进:一是将状态持续时间的概率分布连续化、参数化,并将其先验分布设定为威布尔分布;二是基于状态开始时间识别提出了状态剩余持续时间的概念.对基于改进HSMM的预测概率进行了推导,给出了基于直至当前状态监测数据下故障诊断判据的计算公式和应用这一新方法进行故障诊断的步骤.案例研究表明方法是合理有效的. 相似文献
11.
Rolando Cavazos-Cadena Raúl Montes-de-Oca Karel Sladký 《Journal of Optimization Theory and Applications》2014,163(2):674-684
This note deals with Markov decision chains evolving on a denumerable state space. Under standard continuity-compactness requirements, an explicit example is provided to show that, with respect to a strong sample-path average reward criterion, the Lyapunov function condition does not ensure the existence of an optimal stationary policy. 相似文献
12.
13.
研究了一般状态空间马氏链随机泛函的矩,利用最小非负解理论,得到了随机泛函的矩是相应方程的最小非负解,作为应用,证明了随机泛函的矩与漂移条件等价. 相似文献
14.
V. S. Koroliuk V. V. Koroliuk N. Limnios 《Methodology and Computing in Applied Probability》2009,11(2):201-209
We study asymptotic average and diffusion approximation schemes for semi-Markov queuing systems by a random evolution approach
and using compensating operator of the corresponding extended Markov renewal process. These results generalize Markov and
renewal flow queuing systems.
相似文献
15.
16.
Chen Fang Guo Xianping Liao Zhong-Wei 《Journal of Optimization Theory and Applications》2022,194(2):408-439
Journal of Optimization Theory and Applications - In this paper, we consider the optimal stopping problems on semi-Markov processes (sMPs) with finite horizon and aim to establish the existence and... 相似文献
17.
S. V. Nagaev 《Siberian Advances in Mathematics》2018,28(4):265-302
We consider a Markov chain with general state space and an embedded Markov chain sampled at the times of successive returns to a subsetA0 of the state space.We assume that the latter chain is uniformly ergodic but the originalMarkov chain need not possess this property.We develop amodification of the spectralmethod and utilize it in proving the central limit theorem for theMarkov chain under consideration. 相似文献
18.
Rolando Cavazos-Cadena Daniel Hernandez-Hernandez 《Applied Mathematics and Optimization》2006,53(1):101-119
Given a discrete-time Markov chain with finite state space and a stationary transition matrix, a system of "local" Poisson
equations characterizing the (exponential) Varadhan's functional J(·) is given. The main results, which
are derived for an arbitrary transition structure so that J(·) may be nonconstant, are as follows: (i) Any solution to the
local Poisson equations immediately renders Varadhan's functional, and (ii) a solution of the system always exist. The proof
of this latter result is constructive and suggests a method
to solve the local Poisson equations. 相似文献
19.
本文考虑具有 Borel状态空间和行动空间非平稳 MDP的平均方差准则.首先,在遍历条件下,利用最优方程,证明了关于平均期望目标最优马氏策略的存在性.然后,通过构造新的模型,利用马氏过程的理论,进一步证明了在关于平均期望目标是最优的一类马氏策略中,存在一个马氏策略使得平均方差达到最小.作为本文的特例还得到了 Dynkin E. B.和 Yushkevich A. A.及 Kurano M.等中的主要结果. 相似文献