首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
In this paper, we discuss Markovian decision programming with recursive vector-reward andgive an algorithm to find optimal policies. We prove that: (1) There is a Markovian optimal policy for the nonstationary case; (2) Thereis a stationary optimal policy for the stationary case.  相似文献   

2.
This paper considers a first passage model for discounted semi-Markov decision processes with denumerable states and nonnegative costs.The criterion to be optimized is the expected discounted cost incurred during a first passage time to a given target set.We first construct a semi-Markov decision process under a given semi-Markov decision kernel and a policy.Then,we prove that the value function satisfies the optimality equation and there exists an optimal(or e-optimal) stationary policy under suitable conditions by using a minimum nonnegative solution approach.Further we give some properties of optimal policies.In addition,a value iteration algorithm for computing the value function and optimal policies is developed and an example is given.Finally,it is showed that our model is an extension of the first passage models for both discrete-time and continuous-time Markov decision processes.  相似文献   

3.
Decision makers often face the need of performance guarantee with some sufficiently high probability. Such problems can be modelled using a discrete time Markov decision process (MDP) with a probability criterion for the first achieving target value. The objective is to find a policy that maximizes the probability of the total discounted reward exceeding a target value in the preceding stages. We show that our formulation cannot be described by former models with standard criteria. We provide the properties of the objective functions, optimal value functions and optimal policies. An algorithm for computing the optimal policies for the finite horizon case is given. In this stochastic stopping model, we prove that there exists an optimal deterministic and stationary policy and the optimality equation has a unique solution. Using perturbation analysis, we approximate general models and prove the existence of e-optimal policy for finite state space. We give an example for the reliability of the satellite sy  相似文献   

4.
Let p≥7 be an odd prime. Based on the Toda bracket α1βp-11, α1 β1, p, γs,the authors show that the relation α1βp-11h2,0 γs= βp/p-1γs holds. As a result, they can obtain α1βp1h2,0 γs = 0 ∈π*(S0) for 2≤s≤p- 2, even though α1h2,0γs and β1α1h2,0 γs are not trivial. They also prove that βp-11α1 h2,0 γ3 is nontrivial in π*(S0) and conjecture that βp-11α1 h2,0 γs is nontrivial in π*(S0) for 3≤s≤p- 2. Moreover, it is known thatβp/p-1γ3 = 0 ∈ Ext5,*BP*BP(BP*, BP*), but βp/p-1γ3 is nontrivial in π*(S0) and represents the element βp-11α1 h2,0 γ3.  相似文献   

5.
Necessary and sufficient conditions are studied that a bounded operator T_x =(x_1~*x, x_2~*x,···) on the space ?_∞, where x_n~*∈ ?_∞~*, is lower or upper semi-Fredholm; in particular, topological properties of the set {x_1~*, x_2~*,···} are investigated. Various estimates of the defect d(T) = codim R(T), where R(T) is the range of T, are given. The case of x_n~*= d_nx_(tn)~*,where dn ∈ R and x_(tn)~*≥ 0 are extreme points of the unit ball B_?_∞~*, that is, t_n ∈βN, is considered. In terms of the sequence {t_n}, the conditions of the closedness of the range R(T)are given and the value d(T) is calculated. For example, the condition {n:0 |d_n| δ} = Φ for some δ is sufficient and if for large n points tn are isolated elements of the sequence {t_n},then it is also necessary for the closedness of R(T)(t_(n0) is isolated if there is a neighborhood U of t_(n0) satisfying t_n ■ U for all n ≠ n0). If {n:|d_n| δ} =Φ, then d(T) is equal to the defect δ{_tn} of {t_n}. It is shown that if d(T) = ∞ and R(T) is closed, then there exists a sequence {A_n} of pairwise disjoint subsets of N satisfying χ_(A_n)■R(T).  相似文献   

6.
本文讨论Lippman型无界报酬折扣半马氏决策规划ε最优策略的性质,在§2中证明了:若策略π~*=(π_0~*、π_1~*,…)是ε最优的,则对任何自然数n,策略(π_0~*,π_1~*,…,π_(n+)~*)为(1-β~n)~(-1)ε最优;若策略π~*=(f_0,f_1,…,f_n,π_(n+1),…)是ε最优的,则策略f_n~∞为某ε_n最优。在§3中讨论策略的组合与分解,在§4中给出了一个策略π~*为最优的充要条件和为ε最优的充分条件。  相似文献   

7.
A property(C) for permutation pairs is introduced. It is shown that if a pair{π_1, π_2} of permutations of(1,2,…,n) has property(C),then the D-type map Φ_(π_1,π_2) on n× n complex matrices constructed from {π_1,π_2} is positive. A necessary and sufficient condition is obtained for a pair {π_1,π_2} to have property(C),and an easily checked necessary and sufficient condition for the pairs of the form {π~p,π~q} to have property(C) is given, whereπ is the permutation defined by π(i) = i + 1 mod n and 1≤ p q≤ n.  相似文献   

8.
We consider the tensor product π_α ? π_βof complementary series representations π_α and π_β of classical rank one groups SO_0(n, 1), SU(n, 1) and Sp(n, 1). We prove that there is a discrete component π_(α+β)for small parameters α and β(in our parametrization). We prove further that for SO_0(n, 1) there are finitely many complementary series of the form π_(α+β+2j,)j = 0, 1,..., k, appearing in the tensor product π_α ? π_βof two complementary series π_α and π_β, where k = k(α, β, n) depends on α, β and n.  相似文献   

9.
Assume that {Xn} is a strictly stationary β-mixing random sequence with the β-mixing coefficient βk = O(k-r), 0 < r ≤1. Yu (1994) obtained convergence rates of empirical processes of strictly stationary β-mixing random sequence indexed by bounded classes of functions. Here, a new truncation method is proposed and used to study the convergence for empirical processes of strictly stationary β-mixing sequences indexed by an unbounded class of functions. The research results show that if the envelope of the index class of functions is in Lp, p > 2 or p > 4, uniform convergence rates of empirical processes of strictly stationary β-mixing random sequence over the index classes can reach O((nr/(l+r)/logn)-1/2) or O((nr/(1+r)/ log n)-3/4) and that the Central Limit Theorem does not always hold for the empirical processes.``  相似文献   

10.
We give a lower bound for the first gap λ_2—λ_1 of the twolowerst eigenvalues of the Schr(o|¨)dinger operator-△+W(p) with the Dirichletboundary condition and a strictly convex potential W(p)on M in which M is acompact simple Riemannian manifold with smooth strictly convex boundary (?)MHere a compact Riemannian manifold M is said to be simple if M~(?)M istopologically R~2.We prove thatλ_2-λ_1≥(π~2)/(d~2)+min{0,-(n-1)K}where d is the diameter of M and-(n-1)K,(K≥0)the lower bound of theRicci curvature of M.This work generalizes the results in the classical Eucli-dean situation due to Singer,Wong and Yau,Yu and Zhong to a kind of curvedRiemannian manifold.  相似文献   

11.
This paper is concerned with the convergence of a sequence of discrete-time Markov decision processes(DTMDPs)with constraints,state-action dependent discount factors,and possibly unbounded costs.Using the convex analytic approach under mild conditions,we prove that the optimal values and optimal policies of the original DTMDPs converge to those of the"limit"one.Furthermore,we show that any countablestate DTMDP can be approximated by a sequence of finite-state DTMDPs,which are constructed using the truncation technique.Finally,we illustrate the approximation by solving a controlled queueing system numerically,and give the corresponding error bound of the approximation.  相似文献   

12.
We prove that the class of Banach spaces Y such that the pair(?_1, Y) has the Bishop-Phelps-Bollobás property for operators is stable under finite products when the norm of the product is given by an absolute norm. We also provide examples showing that the previous stability results obtained for that property are optimal.  相似文献   

13.
In this paper we discuss the discrete, time non--homogeneous discounted Markovian decisionprogramming, where the state space and all action sets are countable. Suppose that the optimumvalue function is finite. We give the necessary and sufficient conditions for the existence of anoptimal policy. Suppose that the absolute mean of rewards is relatively bounded. We also give thenecessary and sufficient conditions for the existence of an optimal policy.  相似文献   

14.
In this paper,we first show that for a Banach space X,there is a fully order-reversing mapping T from conv(X)(the cone of all the extended real-valued lower semicontinuous proper convex functions defined on X) onto itself if and only if X is reflexive and linearly isomorphic to its dual X~*.Then we further prove the following generalized Artstein-Avidan-Milman representation theorem:For every fully order-reversing mapping T:conv(X)→conv(X),there exist a linear isomorphism U:X→X~*,x_0~*,φ_0∈X~*,α0 and r_0∈R so that(Tf)(x)=α(Ff)(Ux+x_0~*)+φ_0,x+r_0,■x∈X where T:conv(X)→conv(X~*) is the Fenchel transform.Hence,these resolve two open questions.We also show several representation theorems of fully order-preserving mappings defined on certain cones of convex functions.For example,for every fully order-preserving mapping S:semn(X)→ semn(X),there is a linear isomorphism U:X→ X so that(Sf)(x)=f(Ux),■f∈semn(X),x∈X where semn(X) is the cone of all the lower semicontinuous seminorms on X.  相似文献   

15.
王慕秋  王联 《数学学报》1980,23(3):331-340
<正> §1.前言研究方程(d~2φ)/(dt~2)+α(dφ)/(dt)+F(φ)=β(1)其中α>0,β>0,F(φ)满足下列条件:(i)F(φ)=F(φ+2π),F(φ)=-F(-φ),F(0)=0,F(φ)∈C~1.(ii)在[0,π]上,φ=0,φ′_1,…,φ′_(n-1),π为 F(φ)=0之单根.(iii)在[0,π]上,曲线 y=F(φ)在φ_1~*,…,φ_n~*处取极值,不妨设在φ_1~*,φ_3~*,…处取极大值,在φ_2~*,φ_4~*,…处取极小值.  相似文献   

16.
Suppose that a continuous 27r-periodic function f on the real axis changes its monotonicity at points y_1:-π≤ y_(2s)y_(2s-1)… y_1 π,s ∈ IN.In this PaPer,for each n≥N,a trigonometric polynomial P_n of order cn is found such that:P_n has the same monotonicity as f,everywhere except,perhaps,the small intervals(y_i-π/n,y_i+π/n)and‖f-P_n‖c(s)ω_3(f,π/n),where N is a constant depending only on mini=1,...,2s {y_i-y_(i+1)},c,c(s) are constants depending only on s,ω_3(f_1,·) is the modulus of smoothness of the 3-rd order of the function f,and ||·|| is the max-norm.  相似文献   

17.
Consider the following nonlinear programming problem:where f, gi's are sufficiently smooth functions in R~n. LetIt is well known that if x~*∈Ω is a solution of (1) then there exists λ~*=(λ_I~*,…,λ_m~*)∈R~m such that(x~*,λ~*) is a solution of the K-K-T system  相似文献   

18.
For partial linear model Y=X~τβ_0 _(g0)(T) εwith unknown β_0∈R~d and an unknown smooth function go, this paper considers the Huber-Dutter estimators of β_0, scale σfor the errors and the function go respectively, in which the smoothing B-spline function is used. Under some regular conditions, it is shown that the Huber-Dutter estimators of β_0 and σare asymptotically normal with convergence rate n~((-1)/2) and the B-spline Huber-Dutter estimator of go achieves the optimal convergence rate in nonparametric regression. A simulation study demonstrates that the Huber-Dutter estimator of β_0 is competitive with its M-estimator without scale parameter and the ordinary least square estimator. An example is presented after the simulation study.  相似文献   

19.
Suppose that C 1 and C 2 are two simple curves joining 0 to ∞, non-intersecting in the finite plane except at 0 and enclosing a domain D which is such that, for all large r, the set {θ : re iθ∈ D} has measure at most 2α, where 0 α π. Suppose also that u is a non-constant subharmonic function in the plane such that u(z) = Φ(|z|) for all large z ∈ C 1 ∪ C 2 ∪~D, where Φ(|z|) is a convex, non-decreasing function of |z| and ~D is the complement of D. Let A D (r, u) = inf{u(z) : z ∈ D and |z| = r}. It is shown that if A D (r, u) = O(1) then lim inf r→∞ B(r, u)/r π/(2α) 0.  相似文献   

20.
Using the blocking techniques and m-dependent methods,the asymptotic behavior of kernel density estimators for a class of stationary processes,which includes some nonlinear time series models,is investigated.First,the pointwise and uniformly weak convergence rates of the deviation of kernel density estimator with respect to its mean(and the true density function)are derived.Secondly,the corresponding strong convergence rates are investigated.It is showed,under mild conditions on the kernel functions and bandwidths,that the optimal rates for the i.i.d.density models are also optimal for these processes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号