首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We study an infinite horizon optimal stopping Markov problem which is either undiscounted (total reward) or with a general Markovian discount rate. Using ergodic properties of the underlying Markov process, we establish the feasibility of the stopping problem and prove the existence of optimal and εε-optimal stopping times. We show the continuity of the value function and its variational characterisation (in the viscosity sense) under different sets of assumptions satisfied by large classes of diffusion and jump–diffusion processes. In the case of a general discounted problem we relax a classical assumption that the discount rate is uniformly separated from zero.  相似文献   

2.
This paper deals with the optimal stopping problem under partial observation for piecewise-deterministic Markov processes. We first obtain a recursive formulation of the optimal filter process and derive the dynamic programming equation of the partially observed optimal stopping problem. Then, we propose a numerical method, based on the quantization of the discrete-time filter process and the inter-jump times, to approximate the value function and to compute an ??-optimal stopping time. We prove the convergence of the algorithms and bound the rates of convergence.  相似文献   

3.
4.
One-armed bandit models with continuous and delayed responses   总被引:2,自引:0,他引:2  
One-armed bandit processes with continuous delayed responses are formulated as controlled stochastic processes following the Bayesian approach. It is shown that under some regularity conditions, a Gittins-like index exists which is the limit of a monotonic sequence of break-even values characterizing optimal initial selections of arms for finite horizon bandit processes. Furthermore, there is an optimal stopping solution when all observations on the unknown arm are complete. Results are illustrated with a bandit model having exponentially distributed responses, in which case the controlled stochastic process becomes a Markov decision process, the Gittins-like index is the Gittins index and the Gittins index strategy is optimal. Acknowledgement.We thank an anonymous referee for constructive and insightful comments, especially those related to the notion of the Gittins index.Both authors are funded by the Natural Sciences and Engineering Research Council (NSERC) of Canada.  相似文献   

5.
Suppose you have one unit of stock, currently worth 1, which you must sell before time T. The Optional Sampling Theorem tells us that whatever stopping time we choose to sell, the expected discounted value we get when we sell will be 1. Suppose however that we are able to see a units of time into the future, and base our stopping rule on that; we should be able to do better than expected value 1. But how much better can we do? And how would we exploit the additional information? The optimal solution to this problem will never be found, but in this paper we establish remarkably close bounds on the value of the problem, and we derive a fairly simple exercise rule that manages to extract most of the value of foresight.  相似文献   

6.
We consider the problem of maximizing the long-run average reward in a service facility with dynamic pricing. We investigate sensitivity of optimal pricing policies to the parameters of the service facility which is modelled as an M/M/s/KM/M/s/K queueing system. Arrival process to the facility is Poisson with arrival rate a decreasing function of the price currently being charged by the facility. We prove structural results on the optimal pricing policies when the parameters in the facility change. Namely, we show that optimal prices decrease when the capacity of the facility or the number of servers in the facility increase. Under a reasonable assumption, we also show that optimal prices increase as the overall demand for the service provided by the facility increases or when the service rate of the facility decreases. We illustrate how these structural results simplify the required computational effort while finding the optimal policy.  相似文献   

7.
In this paper, we study probabilistic numerical methods based on optimal quantization algorithms for computing the solution to optimal multiple switching problems with regime-dependent state process. We first consider a discrete-time approximation of the optimal switching problem, and analyse its rate of convergence. Given a time step hh, the error is in general of order (hlog(1/h))1/2(hlog(1/h))1/2, and of order h1/2h1/2 when the switching costs do not depend on the state process. We next propose quantization numerical schemes for the space discretization of the discrete-time Euler state process. A Markovian quantization approach relying on the optimal quantization of the normal distribution arising in the Euler scheme is analysed. In the particular case of uncontrolled state process, we describe an alternative marginal quantization method, which extends the recursive algorithm for optimal stopping problems as in Bally (2003) [1]. A priori LpLp-error estimates are stated in terms of quantization errors. Finally, some numerical tests are performed for an optimal switching problem with two regimes.  相似文献   

8.
We consider a discrete-time constrained Markov decision process under the discounted cost optimality criterion. The state and action spaces are assumed to be Borel spaces, while the cost and constraint functions might be unbounded. We are interested in approximating numerically the optimal discounted constrained cost. To this end, we suppose that the transition kernel of the Markov decision process is absolutely continuous with respect to some probability measure μ  . Then, by solving the linear programming formulation of a constrained control problem related to the empirical probability measure μnμn of μ, we obtain the corresponding approximation of the optimal constrained cost. We derive a concentration inequality which gives bounds on the probability that the estimation error is larger than some given constant. This bound is shown to decrease exponentially in n. Our theoretical results are illustrated with a numerical application based on a stochastic version of the Beverton–Holt population model.  相似文献   

9.
10.
We examine the connections between a novel class of multi-person stopping games with redistribution of payoffs and multi-dimensional reflected BSDEs in discrete- and continuous-time frameworks. Our goal is to provide an essential extension of classic results for two-player stopping games (Dynkin games) to the multi-player framework. We show the link between certain multi-period mm-player stopping games and a new kind of mm-dimensional reflected BSDEs. The existence and uniqueness of a solution to continuous-time reflected BSDEs are established. Continuous-time redistribution games are constructed with the help of reflected BSDEs and a characterization of the value of such stopping games is provided.  相似文献   

11.
In this paper we investigate the distribution of trimmed sums of dependent observations with heavy tails. We consider the case of autoregressive processes of order one with independent innovations in the domain of attraction of a stable law. We show if the d largest (in magnitude) terms are removed from the sample, then the sum of the remaining elements satisfies a functional central limit theorem with random centering provided d=d(n)nγ (for some γ>0) and d(n)/n0. This result is used to get asymptotics for the widely used CUSUM process in case of dependent heavy tailed observations.  相似文献   

12.
This paper proposes two related approximation schemes, based on a discrete grid on a finite time interval [0,T][0,T], and having a finite number of states, for a pure jump Lévy process LtLt. The sequences of discrete processes converge to the original process, as the time interval becomes finer and the number of states grows larger, in various modes of weak and strong convergence, according to the way they are constructed. An important feature is that the filtrations generated at each stage by the approximations are sub-filtrations of the filtration generated by the continuous time Lévy process. This property is useful for applications of these results, especially to optimal stopping problems, as we illustrate with an application to American option pricing. The rates of convergence of the discrete approximations to the underlying continuous time process are assessed in terms of a “complexity” measure for the option pricing algorithm.  相似文献   

13.
14.
We investigate the high resolution quantization and entropy coding problem for solutions of stochastic differential equations under supremum norm distortion. Tight asymptotic formulas are found under mild regularity assumptions. The main technical tool is a decoupling method which allows us to relate the complexity of the diffusion process to that of the Wiener process. The technique is also applicable when considering the Lp[0,1]Lp[0,1]-norm distortion.  相似文献   

15.
Let X be a regular one-dimensional transient diffusion and Ly be its local time at y. The stochastic differential equation (SDE) whose solution corresponds to the process X conditioned on [Ly=a] for a given a0 is constructed and a new path decomposition result for transient diffusions is given. In the course of the construction Bessel-type motions as well as their SDE representations are studied. Moreover, the Engelbert–Schmidt theory for the weak solutions of one dimensional SDEs is extended to the case when the initial condition is an entrance boundary for the diffusion. This extension was necessary for the construction of the Bessel-type motion which played an essential part in the SDE representation of X conditioned on [Ly=a].  相似文献   

16.
Let Z={Zt(h);h∈Rd,t∈R}Z={Zt(h);hRd,tR} be a space–time Gaussian process which is stationary in the time variable tt. We study Mn(h)=supt[0,n]Zt(snh)Mn(h)=supt[0,n]Zt(snh), the supremum of ZZ taken over t∈[0,n]t[0,n] and rescaled by a properly chosen sequence sn→0sn0. Under appropriate conditions on ZZ, we show that for some normalizing sequence bn→∞bn, the process bn(Mnbn)bn(Mnbn) converges as n→∞n to a stationary max-stable process of Brown–Resnick type. Using strong approximation, we derive an analogous result for the empirical process.  相似文献   

17.
In this paper we solve the exit problems for (reflected) spectrally negative Lévy processes, which are exponentially killed with a killing intensity dependent on the present state of the process and analyze respective resolvents. All identities are given in terms of new generalizations of scale functions. For the particular cases ω(x)=q and ω(x)=q1(a,b)(x), we obtain results for the classical exit problems and the Laplace transforms of the occupation times in a given interval, until first passage times, respectively. Our results can also be applied to find the bankruptcy probability in the so-called Omega model, where bankruptcy occurs at rate ω(x) when the Lévy surplus process is at level x<0. Finally, we apply these results to obtain some exit identities for spectrally positive self-similar Markov processes. The main method throughout all the proofs relies on the classical fluctuation identities for Lévy processes, the Markov property and some basic properties of a Poisson process.  相似文献   

18.
19.
This paper develops systematically the stochastic calculus via regularization in the case of jump processes. In particular one continues the analysis of real-valued càdlàg weak Dirichlet processes with respect to a given filtration. Such a process is the sum of a local martingale and an adapted process A such that [N,A]=0, for any continuous local martingale N. Given a function u:[0,T]×RR, which is of class C0,1 (or sometimes less), we provide a chain rule type expansion for u(t,Xt) which stands in applications for a chain Itô type rule.  相似文献   

20.
In this paper, we analyze a real-valued reflected backward stochastic differential equation (RBSDE) with an unbounded obstacle and an unbounded terminal condition when its generator ff has quadratic growth in the zz-variable. In particular, we obtain existence, uniqueness, and stability results, and consider the optimal stopping for quadratic gg-evaluations. As an application of our results we analyze the obstacle problem for semi-linear parabolic PDEs in which the non-linearity appears as the square of the gradient. Finally, we prove a comparison theorem for these obstacle problems when the generator is concave in the zz-variable.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号