期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces

Rolando Cavazos-Cadena Raúl Montes-de-Oca 《Mathematical Methods of Operations Research》2000,52(1):133-167

相似文献

2.

关于相对紧性质的研究

张洪刚《数学的实践与认识》2012,42(21):253-256

利用有限交性质的集族及网的性质描述了相对紧性质,给出了相对紧性质的两个等价结果. 相似文献

3.

Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming

Verena Schmid 《European Journal of Operational Research》2012

Emergency service providers are supposed to locate ambulances such that in case of emergency patients can be reached in a time-efficient manner. Two fundamental decisions and choices need to be made real-time. First of all immediately after a request emerges an appropriate vehicle needs to be dispatched and send to the requests’ site. After having served a request the vehicle needs to be relocated to its next waiting location. We are going to propose a model and solve the underlying optimization problem using approximate dynamic programming (ADP), an emerging and powerful tool for solving stochastic and dynamic problems typically arising in the field of operations research. Empirical tests based on real data from the city of Vienna indicate that by deviating from the classical dispatching rules the average response time can be decreased from 4.60 to 4.01 minutes, which corresponds to an improvement of 12.89%. Furthermore we are going to show that it is essential to consider time-dependent information such as travel times and changes with respect to the request volume explicitly. Ignoring the current time and its consequences thereafter during the stage of modeling and optimization leads to suboptimal decisions. 相似文献

4.

The Weak Compactness of the Sets of L₁(

Ω 《数学学报》1999,42(3)

设( 相似文献

5.

Multicriteria dynamic programming with an application to the integer case

B. Villarreal M. H. Karwan 《Journal of Optimization Theory and Applications》1982,38(1):43-69

Fundamental dynamic programming recursive equations are extended to the multicriteria framework. In particular, a more detailed procedure for a general recursive solution scheme for the multicriteria discrete mathematical programming problem is developed. Definitions of lower and upper bounds are offered for the multicriteria case and are incorporated into the recursive equations to aid problem solution by eliminating inefficient subpolicies. Computational results are reported for a set of 0–1 integer linear programming problems.This research was supported in part by CONACYT (Consejo Nacional de Ciencia y Technologia), Mexico City, Mexico. 相似文献

6.

Existence for the dynamic programming equation of control diffusion processes in Hilbert space

T. Havârneanu 《Nonlinear Analysis: Theory, Methods & Applications》1985,9(6):619-629

相似文献

7.

Structural results for the control of queueing systems using event-based dynamic programming

Koole Ger 《Queueing Systems》1998,30(3-4):323-339

In this paper we study monotonicity results for optimal policies of various queueing and resource sharing models. The standard approach is to propagate, for each specific model, certain properties of the dynamic programming value function. We propose a unified treatment of these models by concentrating on the events and the form of the value function instead of on the value function itself. This is illustrated with the systematic treatment of one and two-dimensional models. This revised version was published online in June 2006 with corrections to the Cover Date. 相似文献

8.

On maximizing the average time at a goal

S. Demko T.P. Hill 《Stochastic Processes and their Applications》1984,17(2):349-357

In a decision process (gambling or dynamic programming problem) with finite state space and arbitrary decision sets (gambles or actions), there is always available a Markov strategy which uniformly (nearly) maximizes the average time spent at a goal. If the decision sets are closed, there is even a stationary strategy with the same property.Examples are given to show that approximations by discounted or finite horizon payoffs are not useful for the general average reward problem. 相似文献

9.

求动态规划最优解的一种简便新方法——图解法

林志红《数学理论与应用》2005,25(1):125-128

离散型确定性的动态规划问题，是运筹学规划论中一个重要的组成部分，其内容包含的问题比较多．求其最优解的方法，叫逆序法(又叫回推法)．本提出一种统一的解法——图解法．其优点是：方便简便，计算简单．相似文献

10.

A dynamic programming approach for the airport capacity allocation problem 总被引：5，自引：0，他引：5

Dell'Olmo Paolo; Lulli Guglielmo 《IMA Journal of Management Mathematics》2003,14(3):235-249

In most of the optimization models developed to manage airportsoperations, arrivals and departures capacities are treated asindependent variables: that is the number of flights allowedto take off does not affect the number of landings in any unitof time, and vice versa. This assumption is seldom verifiedin most of the congested airports, where many interactions betweenarrivals and departures take place. In this paper, we face the problem of finding the optimal trade-offbetween the number of arrivals and departures in order to reducea delay function of all the flights, using a more realisticrepresentation of the airport capacity, i.e. the capacity envelope. Under the assumption of piecewise linear convex capacity envelopesand of the exact interpolation of all the Pareto-optimal operationalpoints, we show that the problem can be formulated as a linearprogramming model. For general airport capacity envelopes, wepropose a dynamic programming formulation with a correspondingbackward solution algorithm, which is robust, easy to implementand has a linear computational complexity. The algorithm performancesare evaluated on different realistic scenarios, and the optimalsolutions are compared with those computed by a greedy algorithm,which can be seen as an approximation of the current decisionprocedures. The percentage deviation of the cost of these twosolutions ranges from 3.98 to 35.64%. 相似文献

11.

A new dynamic programming algorithm for the single item capacitated dynamic lot size model 总被引：1，自引：0，他引：1

Hsin-Der Chen Donald W. Hearn Chung-Yee Lee 《Journal of Global Optimization》1994,4(3):285-300

We develop a new dynamic programming method for the single item capacitated dynamic lot size model with non-negative demands and no backlogging. This approach builds the Optimal value function in piecewise linear segments. It works very well on the test problems, requiring less than 0.3 seconds to solve problems with 48 periods on a VAX 8600. Problems with the time horizon up to 768 periods are solved. Empirically, the computing effort increases only at a quadratic rate relative to the number of periods in the time horizon.This research was supported in part by NSF grants DDM-8814075 and DMC-8504786. 相似文献

12.

Discounted dynamic programming with unbounded returns: Application to economic models

Anna Ja?kiewicz Andrzej S. Nowak 《Journal of Mathematical Analysis and Applications》2011,378(2):450-462

In this paper, we study discounted Markov decision processes on an uncountable state space. We allow a utility (reward) function to be unbounded both from above and below. A new feature in our approach is an easily verifiable rate of growth condition introduced for a positive part of the utility function. This assumption, in turn, enables us to prove the convergence of a value iteration algorithm to a solution to the Bellman equation. Moreover, by virtue of the optimality equation we show the existence of an optimal stationary policy. 相似文献

13.

Linear programming formulation of MDPs in countable state space: The multichain case

Arie Hordijk Jean B. Lasserre 《Mathematical Methods of Operations Research》1994,40(1):91-108

We present an Linear Programming formulation of MDPs with countable state and action spaces and no unichain assumption. This is an extension of the Hordijk and Kallenberg (1979) formulation in finite state and action spaces. We provide sufficient conditions for both existence of optimal solutions to the primal LP program and absence of duality gap. Then, existence of a (possibly randomized) average optimal policy is also guaranteed. Existence of a stationary average optimal deterministic policy is also investigated. 相似文献

14.

Approximate dynamic programming algorithms for optimal dosage decisions in controlled ovarian hyperstimulation

Miao He Lei Zhao Warren B. Powell 《European Journal of Operational Research》2012

In the controlled ovarian hyperstimulation (COH) treatment, clinicians monitor the patients’ physiological responses to gonadotropin administration to tradeoff between pregnancy probability and ovarian hyperstimulation syndrome (OHSS). We formulate the dosage control problem in the COH treatment as a stochastic dynamic program and design approximate dynamic programming (ADP) algorithms to overcome the well-known curses of dimensionality in Markov decision processes (MDP). Our numerical experiments indicate that the piecewise linear (PWL) approximation ADP algorithms can obtain policies that are very close to the one obtained by the MDP benchmark with significantly less solution time. 相似文献

15.

Approximate dynamic programming via direct search in the space of value function approximations

E.F. Arruda M.D. Fragoso 《European Journal of Operational Research》2011,211(2):343-351

This paper deals with approximate value iteration (AVI) algorithms applied to discounted dynamic programming (DP) problems. For a fixed control policy, the span semi-norm of the so-called Bellman residual is shown to be convex in the Banach space of candidate solutions to the DP problem. This fact motivates the introduction of an AVI algorithm with local search that seeks to minimize the span semi-norm of the Bellman residual in a convex value function approximation space. The novelty here is that the optimality of a point in the approximation architecture is characterized by means of convex optimization concepts and necessary and sufficient conditions to local optimality are derived. The procedure employs the classical AVI algorithm direction (Bellman residual) combined with a set of independent search directions, to improve the convergence rate. It has guaranteed convergence and satisfies, at least, the necessary optimality conditions over a prescribed set of directions. To illustrate the method, examples are presented that deal with a class of problems from the literature and a large state space queueing problem setting. 相似文献

16.

An approximate dynamic programming approach for the vehicle routing problem with stochastic demands

Clara Novoa Robert Storer 《European Journal of Operational Research》2009

This paper examines approximate dynamic programming algorithms for the single-vehicle routing problem with stochastic demands from a dynamic or reoptimization perspective. The methods extend the rollout algorithm by implementing different base sequences (i.e. a priori solutions), look-ahead policies, and pruning schemes. The paper also considers computing the cost-to-go with Monte Carlo simulation in addition to direct approaches. The best new method found is a two-step lookahead rollout started with a stochastic base sequence. The routing cost is about 4.8% less than the one-step rollout algorithm started with a deterministic sequence. Results also show that Monte Carlo cost-to-go estimation reduces computation time 65% in large instances with little or no loss in solution quality. Moreover, the paper compares results to the perfect information case from solving exact a posteriori solutions for sampled vehicle routing problems. The confidence interval for the overall mean difference is (3.56%, 4.11%). 相似文献

17.

On the convergence of stochastic dual dynamic programming and related methods 总被引：1，自引：0，他引：1

A.B. Philpott Z. Guan 《Operations Research Letters》2008,36(4):450-455

We discuss the almost-sure convergence of a broad class of sampling algorithms for multistage stochastic linear programs. We provide a convergence proof based on the finiteness of the set of distinct cut coefficients. This differs from existing published proofs in that it does not require a restrictive assumption. 相似文献

18.

Efficient dynamic programming implementations of Newton's method for unconstrained optimal control problems 总被引：1，自引：0，他引：1

J. C. Dunn D. P. Bertsekas 《Journal of Optimization Theory and Applications》1989,63(1):23-38

Naive implementations of Newton's method for unconstrainedN-stage discrete-time optimal control problems with Bolza objective functions tend to increase in cost likeN ³ asN increases. However, if the inherent recursive structure of the Bolza problem is properly exploited, the cost of computing a Newton step will increase only linearly withN. The efficient Newton implementation scheme proposed here is similar to Mayne's DDP (differential dynamic programming) method but produces the Newton step exactly, even when the dynamical equations are nonlinear. The proposed scheme is also related to a Riccati treatment of the linear, two-point boundary-value problems that characterize optimal solutions. For discrete-time problems, the dynamic programming approach and the Riccati substitution differ in an interesting way; however, these differences essentially vanish in the continuous-time limit.This work was supported by the National Science Foundation, Grant No. DMS-85-03746. 相似文献

19.

Recent results on conditions for the existence of average optimal stationary policies

Rolando Cavazos-Cadena 《Annals of Operations Research》1991,28(1):3-27

This paper concerns countable state space Markov decision processes endowed with a (long-run expected)average reward criterion. For these models we summarize and, in some cases,extend some recent results on sufficient conditions to establish the existence of optimal stationary policies. The topics considered are the following: (i) the new assumptions introduced by Sennott in [20–23], (ii)necessary and sufficient conditions for the existence of a bounded solution to the optimality equation, and (iii) equivalence of average optimality criteria. Some problems are posed.This research was partially supported by the Third World Academy of Sciences (TWAS) under Grant No. TWAS RG MP 898-152. 相似文献

20.

The generalized optimality conditions of multiobjective programming problem in topological vector space

Yuda Hu Chen Ling 《Journal of Mathematical Analysis and Applications》2004,290(2):363-372

In this study, an alternative theorem for the subconvexlike mapping in topological vector space is established. With this alternative theorem as an aid, the generalized Fritz John conditions and the generalized Kuhn-Tucker conditions in terms of Gâteaux derivatives of multiobjective programming problem in the ordered topological vector space are given. 相似文献