共查询到20条相似文献,搜索用时 15 毫秒
1.
Gianfranco Ciardo 《PAMM》2007,7(1):1080705-1080706
Decision diagrams of various types can be used to encode the exact state space and transition rate matrix of large Markov models. However, the exact solution of such models still requires to store at least one real vector with one entry per reachable state, a formidable limitation to the practical use of these encodings. Thus, we discuss automatic techniques for the approximate computation of performance measures when the Markov model can be compactly encoded but not exactly solved. (© 2008 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim) 相似文献
2.
3.
4.
Hung-Chu Hsu 《Monatshefte für Mathematik》2015,176(1):143-152
5.
6.
类似[1-4],折扣向量值马氏决策规划(DVMDP)描述为: {S,(A(i),i∈S),q,r,Vβ},(1)其中S为可数状态集,A(i)是有限决策集,q(j|i,a)是转移概率,r=r(i,j,a)=(r1(i,j,a),r2(i,j,a),…,rp(i,j,a))是状态i处使用决策a于下一步转移到状态j时所获的p维报酬向量,r(i,j,a)对i,j,a一致有界,Vβ是β折扣目标. 以Π表一般策略集,若 π ∈Π有 Vβ(π)≤ Vβ(π*),则称π* 为DVMDP的“强有效策略”,Vβ(π*)为“… 相似文献
7.
L. C. Thomas 《Journal of Mathematical Analysis and Applications》1981,80(2):294-297
The paper describes a bound on the optimal value of a Markov Decision Process using the iterates of the value iteration algorithm. Previous bounds have depended upon the values of the last two iterations, but here we consider a bound which depends on the last three iterates. We show that in a maximisation problem, this leads to a better lower bound on the optimal value. 相似文献
8.
Simona Popp 《Zeitschrift für Angewandte Mathematik und Physik (ZAMP)》1969,20(6):990-993
Résumé On considère l'écoulement plan, permanent et irrotationnel d'un jet gazeux aux grandes vitesses subsoniques. En appliquant le procédé deFalkovich, concernant la méthode hodographique deChaplygin, on obtient la solution exacte pour le modèle de Roshko situé dans un canal aux parois parallèles (Figure). On détermine l'expression exacte du coefficient de résistance et on donne quelques relations entre les différents paramètres de la configuration. 相似文献
9.
T. W. Archibald K. I. M. McKinnon L. C. Thomas 《The Journal of the Operational Research Society》1995,46(3):354-361
Comparisons of the performance of solution algorithms for Markov decision processes rely heavily on problem generators to provide sizeable sets of test problems. Existing generation techniques allow little control over the properties of the test problems and often result in problems which are not typical of real-world examples. This paper identifies the properties of Markov decision processes which affect the performance of solution algorithms, and also describes a new problem generation technique which allows all of these properties to be controlled. 相似文献
10.
In this paper, we show that a discounted continuous-time Markov decision process in Borel spaces with randomized history-dependent policies, arbitrarily unbounded transition rates and a non-negative reward rate is equivalent to a discrete-time Markov decision process. Based on a completely new proof, which does not involve Kolmogorov??s forward equation, it is shown that the value function for both models is given by the minimal non-negative solution to the same Bellman equation. A verifiable necessary and sufficient condition for the finiteness of this value function is given, which induces a new condition for the non-explosion of the underlying controlled process. 相似文献
11.
12.
离散事件系统静态稳定性的马氏决策过程方法 总被引:3,自引:0,他引:3
本文用马氏决策过程方法来讨论离散事件系统(DES)的静态稳定性问题,包括强吸引域和弱吸引域的计算,同时讨论了弱吸引域的稳定控制器的计算问题,所用方法不要求给定谓词的∑-(或∑u-)不变性和对环路的判别。 相似文献
13.
We consider q-component models on both the integer lattice ℤ2 and a second-order Cayley tree and study phase diagrams of these models.
__________
Translated from Teoreticheskaya i Matematicheskaya Fizika, Vol. 149, No. 2, pp. 244–251, November, 2006. 相似文献
14.
15.
《Mathematical and Computer Modelling》2000,31(10-12):289-298
A characterization is given of the complete time-dependent evolution of a general stochastic rumour which includes the two Daley-Kendall models and the Maki-Thompson model as special cases. 相似文献
16.
17.
The method introduced by Leroux [Maximum likelihood estimation for hidden Markov models, Stochastic Process Appl. 40 (1992) 127–143] to study the exact likelihood of hidden Markov models is extended to the case where the state variable evolves in an open interval of the real line. Under rather minimal assumptions, we obtain the convergence of the normalized log-likelihood function to a limit that we identify at the true value of the parameter. The method is illustrated in full details on the Kalman filter model. 相似文献
18.
We prove that any ergodic non-singular transformation is orbit equivalent to a Markov odometer which is uniquely ergodic. 相似文献
19.
《Stochastic Processes and their Applications》2020,130(2):733-759
We establish the posterior consistency for parametric, partially observed, fully dominated Markov models. The prior is assumed to assign positive probability to all neighborhoods of the true parameter, for a distance induced by the expected Kullback–Leibler divergence between the parametric family members’ Markov transition densities. This assumption is easily checked in general. In addition, we show that the posterior consistency is implied by the consistency of the maximum likelihood estimator. The result is extended to possibly improper priors and non-stationary observations. Finally, we check our assumptions on a linear Gaussian model and a well-known stochastic volatility model. 相似文献
20.
Adirectional Markov models for 2D systems 总被引:1,自引:0,他引:1
Several deterministic Markovian properties are introduced inan acausal context. We analyse the relationship between theseproperties and investigate to which kind of models they correspondin the scalar case. We find that, in this case, the nearest-neighbourmodel is the natural model describing such systems. In the generalmultivariable case, we need higher-order models even in theID case. 相似文献