期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Reduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted MDPs

Eugene A. Feinberg Jefferson Huang 《Operations Research Letters》2018,46(2):179-184

This note describes sufficient conditions under which total-cost and average-cost Markov decision processes (MDPs) with general state and action spaces, and with weakly continuous transition probabilities, can be reduced to discounted MDPs. For undiscounted problems, these reductions imply the validity of optimality equations and the existence of stationary optimal policies. The reductions also provide methods for computing optimal policies. The results are applied to a capacitated inventory control problem with fixed costs and lost sales. 相似文献

2.

Strong convergence and certain control conditions for modified Mann iteration

Yonghong Yao Rudong Chen Jen-Chih Yao 《Nonlinear Analysis: Theory, Methods & Applications》2008

In this paper we propose a new modified Mann iteration for computing fixed points of nonexpansive mappings in a Banach space setting. This new iterative scheme combines the modified Mann iteration introduced by Kim and Xu [T.H. Kim, H.K. Xu, Strong convergence of modified Mann iterations, Nonlinear Anal. 61 (2005) 51–60] and the viscosity approximation method introduced by Moudafi [A. Moudafi, Viscosity approximation methods for fixed-points problems, J. Math. Anal. Appl. 241 (2000) 46–55]. We give certain different control conditions for the modified Mann iteration. Strong convergence in a uniformly smooth Banach space is established. 相似文献

3.

Accelerating the convergence of value iteration by using partial transition functions

Edilson F. Arruda Fabrício O. Ourique Jason LaCombe Anthony Almudevar 《European Journal of Operational Research》2013

This work proposes an algorithm that makes use of partial information to improve the convergence properties of the value iteration algorithm in terms of the overall computational complexity. The algorithm iterates on a series of increasingly refined approximate models that converges to the true model according to an optimal linear rate, which coincides with the convergence rate of the original value iteration algorithm. The paper investigates the properties of the proposed algorithm and features a series of switchover queue examples which illustrates the efficacy of the method. 相似文献

4.

On the rate of convergence of infinite horizon discounted optimal value functions

Lars Grüne Fabian Wirth 《Nonlinear Analysis: Real World Applications》2000,1(4):545

In this paper we investigate the rate of convergence of the optimal value function of an infinite horizon discounted optimal control problem as the discount rate tends to zero. Using the Integration Theorem for Laplace transformations we provide conditions on averaged functionals along suitable trajectories yielding quadratic pointwise convergence. From this we derive under appropriate controllability conditions criteria for linear uniform convergence of the value functions on control sets. Applications of these results are given and an example is discussed in which both linear and slower rates of convergence occur depending on the cost functional. 相似文献

5.

A forward algorithm and planning horizon procedure for the production smoothing problem without inventory

Jay E. Aronson Thomas E. Morton Gerald L. Thompson 《European Journal of Operational Research》1984,15(3):348-365

We develop a forward algorithm for solving and finding planning horizons for the infinite horizon version of the deterministic production smoothing problem without inventory. This model approximates the real world situation for highly obsolescent or perishable commodities such as newspapers and fresh produce. Computational results show that the algorithm is linear in problem length while linear programming is at least quadratic. 相似文献

6.

On Newton-like iteration functions: General convergence theorems and a specific algorithm

Dr. Kenneth M. Brown J. E. Dennis Jr. 《Numerische Mathematik》1968,12(3):186-191

相似文献

7.

On the convergence of policy iteration for controlled diffusions

M. L. Puterman 《Journal of Optimization Theory and Applications》1981,33(1):137-144

The convergence of an approximation scheme known as policy iteration has been demonstrated for controlled diffusions by Fleming, Puterman, and Bismut. In this paper, we show that this approximation scheme is equivalent to the Newton-Kantorovich iteration for solving the optimality equation and exploit this equivalence to obtain a new proof of convergence. Estimates of the rate of convergence of this procedure are also obtained.This research was partially supported by NRC Grant No. A-3609. 相似文献

8.

Improvement of a convergence condition for the Durand-Kerner iteration

《Journal of Computational and Applied Mathematics》1998,96(2):117-125

The Durand-Kerner iteration is a well-known simultaneous method for approximation of (simple) zeros of a polynomial. By relating Weierstrass' correction and the minimal distance between approximations practical conditions for convergence have been obtained. These conditions also ensure the existence of isolating discs for the polynomial roots, i.e. each iteration step gives a refined set of inclusion discs. In this paper refined conditions of convergence are given. 相似文献

9.

A rolling horizon approach for disruption management of railway rolling stock

Lars Kjær Nielsen Leo Kroon Gábor Maróti 《European Journal of Operational Research》2012

This paper deals with real-time disruption management of rolling stock in passenger railway transportation. We describe a generic framework for dealing with disruptions of railway rolling stock schedules. The framework is presented as an online combinatorial decision problem, where the uncertainty of a disruption is modeled by a sequence of information updates. To decompose the problem and to reduce the computation time, we propose a rolling horizon approach: rolling stock decisions are only considered if they are within a certain time horizon from the time of rescheduling. The schedules are then revised as time progresses and new information becomes available. We extend an existing model for rolling stock scheduling to the specific requirements of the real-time situation, and we apply it in the rolling horizon framework. We perform computational tests on instances constructed from real-life cases of Netherlands Railways (NS), the main operator of passenger trains in the Netherlands. We explore the consequences of different settings of the approach for the trade-off between solution quality and computation time. 相似文献

10.

Mann迭代和具误差的Ishikawa迭代收敛性的等价定理

倪仁兴《高校应用数学学报(A辑)》2010,25(3)

在迭代参数仅满足(?) supβ_n(k/L(L+1)),(?)α_n=0和(?)α_n=+∞的条件下,用不同与于已有的方法证明了任意实Banach空间中的Lipschitz强伪压缩算子的Mann迭代和具误差的Ishikawa迭代收敛是等价的.这推广和改进了目前需假设limβ_n=0和两迭代程序的初始点的取值需相同条件下的已知结果. 相似文献

11.

On the convergence of higher-order orthogonal iteration

Yangyang Xu 《Linear and Multilinear Algebra》2018,66(11):2247-2265

The higher-order orthogonal iteration (HOOI) has been popularly used for finding a best low-multilinear rank approximation of a tensor. However, its convergence is still an open question. In this paper, we first analyse a greedy HOOI, which updates each factor matrix by selecting from the best candidates one that is closest to the current iterate. Assuming the existence of a block-nondegenerate cluster point, we establish its global iterate sequence convergence through the so-called Kurdyka–?ojasiewicz property. In addition, we show that if the starting point is sufficiently close to any block-nondegenerate globally optimal solution, the greedy HOOI produces an iterate sequence convergent to a globally optimal solution. Relating the iterate sequence by the original HOOI to that by the greedy HOOI, we then show that the original HOOI has global convergence on the multilinear subspace sequence and thus positively address the open question. 相似文献

12.

Strong convergence of the composite Halpern iteration

Xiaolong Qin Yongfu Su 《Journal of Mathematical Analysis and Applications》2008,339(2):996-1002

Let C be a closed convex subset of a uniformly smooth Banach space E and let T:C→C be a nonexpansive mapping with a nonempty fixed points set. Given a point u∈C, the initial guess x₀∈C is chosen arbitrarily and given sequences , and in (0,1), the following conditions are satisfied:

(i): ;
(ii): αn→0, βn→0 and 0<a?γn, for some a∈(0,1);
(iii): , and . Let be a composite iteration process defined by
相似文献

13.

The value iteration algorithm is not strongly polynomial for discounted dynamic programming

Eugene A. Feinberg Jefferson Huang 《Operations Research Letters》2014

This note provides a simple example demonstrating that, if exact computations are allowed, the number of iterations required for the value iteration algorithm to find an optimal policy for discounted dynamic programming problems may grow arbitrarily quickly with the size of the problem. In particular, the number of iterations can be exponential in the number of actions. Thus, unlike policy iterations, the value iteration algorithm is not strongly polynomial for discounted dynamic programming. 相似文献

14.

Lower bound for the convergence rate of nonstationary Jacobi-like iteration

A. A. Maleev 《Computational Mathematics and Mathematical Physics》2006,46(12):2031-2039

Stationary and nonstationary Jacobi-like iterative processes for solving systems of linear algebraic equations are examined. For a system whose coefficient matrix A is an H-matrix, it is shown that the convergence rate of any Jacobi-like process is at least as high as that of the point Jacobi method as applied to a system with 〈A〉 as the coefficient matrix, where 〈A〉 is a comparison matrix of A. 相似文献

15.

On the quadratic convergence of Kogbetliantz's algorithm for computing the singular value decomposition

《Linear algebra and its applications》1986

It is shown that the cyclic Kogbetliantz algorithm ultimately converges quadratically when no pathologically close singular values are present. 相似文献

16.

A wider convergence area for the MSTMAOR iteration methods for LCP

Ljiljana Cvetković Vladimir Kostić Ernest Šanca 《Numerical Algorithms》2016,73(1):77-89

In this paper, based on the Hermitian and skew-Hermitian splitting, we give a generalized modified Hermitian and skew-Hermitian splitting (GMHSS) method to solve singular complex symmetric linear systems, this method has two parameters. We give the semi-convergent conditions, and some numerical experiments are given to illustrate the efficiency of this method. 相似文献

17.

On the convergence of He's variational iteration method

Mehdi Tatari Mehdi Dehghan 《Journal of Computational and Applied Mathematics》2007

In this work we will consider He's variational iteration method for solving second-order initial value problems. We will discuss the use of this approach for solving several important partial differential equations. This method is based on the use of Lagrange multipliers for identification of optimal value of a parameter in a functional. This procedure is a powerful tool for solving the large amount of problems. Using the variational iteration method, it is possible to find the exact solution or an approximate solution of the problem. This technique provides a sequence of functions which converges to the exact solution of the problem. Our emphasis will be on the convergence of the variational iteration method. In the current paper this scheme will be investigated in details and efficiency of the approach will be shown by applying the procedure on several interesting and important models. 相似文献

18.

Improving the order of convergence od iteration functions

《Journal of Computational and Applied Mathematics》1998,98(2):311-315

Improving methods for the order of convergence of iteration functions are give,. Using these methods new third or fourth-order root-finding methods for a single equation with a multiple root are derived. Numerical examples are given. 相似文献

19.

Analyzing the convergence factor of residual inverse iteration

Elias Jarlebring Wim Michiels 《BIT Numerical Mathematics》2011,51(4):937-957

We will establish here a formula for the convergence factor of the method called residual inverse iteration, which is a method for nonlinear eigenvalue problems and a generalization of the well-known inverse iteration. The formula for the convergence factor is explicit and involves quantities associated with the eigenvalue to which the iteration converges, in particular the eigenvalue and eigenvector. Residual inverse iteration allows for some freedom in the choice of a vector w _k and we can use the formula for the convergence factor to analyze how it depends on the choice of w _k. We also use the formula to illustrate the convergence when the shift is close to the eigenvalue. Finally, we explain the slow convergence for double eigenvalues by showing that under generic conditions, the convergence factor is one, unless the eigenvalue is semisimple. If the eigenvalue is semisimple, it turns out that we can expect convergence similar to the simple case. 相似文献

20.

On the global convergence of Halley's iteration formula

M. Davies B. Dawson 《Numerische Mathematik》1975,24(2):133-135

A point-iterative process similar to, but structurally simpler than, Ostrowski's square root technique is examined. This process is shown to be globally convergent monotonically to the zeros of entire functions of genus 0 and 1 (and in certain cases of genus 2) which are real for real arguments and have only real zeros. 相似文献