期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

杨润生李树君《运筹与管理》2007,16(5):53-57

本文提出了一类带不等式约束和简单边界的非线性优化问题的非单调信赖域算法,在一定的条件下,证明了算法的全局收敛性,并通过数值实验验证了算法的合理性。相似文献

2.

A Stochastic Trust-Region Framework for Policy Optimization

Mingming Zhao Yongfeng Li & Zaiwen Wen 《计算数学(英文版)》2022,40(6):1004-1030

In this paper, we study a few challenging theoretical and numerical issues on the well known trust region policy optimization for deep reinforcement learning. The goal is to find a policy that maximizes the total expected reward when the agent acts according to the policy. The trust region subproblem is constructed with a surrogate function coherent to the total expected reward and a general distance constraint around the latest policy. We solve the subproblem using a reconditioned stochastic gradient method with a line search scheme to ensure that each step promotes the model function and stays in the trust region. To overcome the bias caused by sampling to the function estimations under the random settings, we add the empirical standard deviation of the total expected reward to the predicted increase in a ratio in order to update the trust region radius and decide whether the trial point is accepted. Moreover, for a Gaussian policy which is commonly used for continuous action space, the maximization with respect to the mean and covariance is performed separately to control the entropy loss. Our theoretical analysis shows that the deterministic version of the proposed algorithm tends to generate a monotonic improvement of the total expected reward and the global convergence is guaranteed under moderate assumptions. Comparisons with the state-of-the-art methods demonstrate the effectiveness and robustness of our method over robotic controls and game playings from OpenAI Gym. 相似文献

3.

一类带线搜索的非单调信赖域新算法

王祥陈金梅郭忠海王川龙《数学的实践与认识》2014,(11)

结合非单调信赖域方法,和非单调线搜索技术,提出了一种新的无约束优化算法.信赖域方法的每一步采用线搜索,使得迭代每一步都充分下降加快了迭代速度.在一定条件下,证明了算法具有全局收敛性和局部超线性.收敛速度.数值试验表明算法是十分有效的. 相似文献

4.

关于不等式约束的信赖域算法 总被引：3，自引：0，他引：3

郭文英《运筹学学报》2002,6(4):15-20

对于具有不等式约束的非线性优化问题，本文给出一个依赖域算法，由于算法中依赖区域约束采用向量的∞范数约束的形式，从而使子问题变二次规划，同时使算法变得更实用。在通常假设条件下，证明了算法的整体收敛性和超线性收敛性。相似文献

5.

A Smoothing Method of Global Optimization that Preserves Global Minima

Mark S. K. Lau C. P. Kwong 《Journal of Global Optimization》2006,34(3):369-398

A new smoothing method of global optimization is proposed in the present paper, which prevents shifting of global minima. In this method, smoothed functions are solutions of a heat diffusion equation with external heat source. The source helps to control the diffusion such that a global minimum of the smoothed function is again a global minimum of the cost function. This property, and the existence and uniqueness of the solution are proved using results in theory of viscosity solutions. Moreover, we devise an iterative equation by which smoothed functions can be obtained analytically for a class of cost functions. The effectiveness and potential of our method are then demonstrated with some experimental results. 相似文献

6.

A Simple Multistart Algorithm for Global Optimization 总被引：1，自引：0，他引：1

FRED J. HICKERNELL 《运筹学学报》1997,(2)

1.IntroductionConsidertheunconstrainedoptimizationproblem:findx*suchthatf(x*)~caf(x),(1)wheref(x)isanonlinearfllnctiondefinedonW"andXCR".Ourobjectiveistofindtheglobalminimizeroff(x)inthefeasibleset.Withoutassuminganyconditionsonf(x)globaloptimizationproblemsareunsolvableinthefollowingsensefnoalgorithmcanbeguaranteedtofindaglobalminimizerofageneralnonlinearfunctionwithinfinitelymanyiterations.Supposethatanalgorithmappliedtoanonlinearfunctionf(x)producesiteratesxlandterminatesafterKiterations.… 相似文献

7.

A New Version of the Price's Algorithm for Global Optimization

P. Brachetti M. De Felice Ciccoli G. Di Pillo S. Lucidi 《Journal of Global Optimization》1997,10(2):165-184

We present an algorithm for finding a global minimum of a multimodal,multivariate function whose evaluation is very expensive, affected by noise andwhose derivatives are not available. The proposed algorithm is a new version ofthe well known Price's algorithm and its distinguishing feature is that ittries to employ as much as possible the information about the objectivefunction obtained at previous iterates. The algorithm has been tested on alarge set of standard test problems and it has shown a satisfactorycomputational behaviour. The proposed algorithm has been used to solveefficiently some difficult optimization problems deriving from the study ofeclipsing binary star light curves. 相似文献

8.

一类非光滑总体极值的区间算法 总被引：1，自引：1，他引：0

申培萍张娟王燕军《数学的实践与认识》2001,31(5):539-544

本文利用区间分析知识 ,构造了一类 n维非光滑函数总体极值的区间算法 ,理论分析和实例计算均表明本文算法安全可靠 ;能求出全部总体极小点 ;收敛速度也比以前方法[1] 明显加快相似文献

9.

A New Trust-Region Algorithm for Equality Constrained Optimization 总被引：1，自引：0，他引：1

Thomas F. Coleman Jianguo Liu Wei Yuan 《Computational Optimization and Applications》2002,21(2):177-199

We present a new trust-region algorithm for solving nonlinear equality constrained optimization problems. Quadratic penalty functions are employed to obtain global convergence. At each iteration a local change of variables is performed to improve the ability of the algorithm to follow the constraint level set. Under certain assumptions we prove that this algorithm globally converges to a point satisfying the second-order necessary optimality conditions. Results of preliminary numerical experiments are reported. 相似文献

10.

求总极值的区间压缩方法

谢铁军刘任平王黎明《数学的实践与认识》2002,32(3):427-429

对求解无约束最优化问题 ,本文给出了一个区间压缩方法 .应用此方法能使函数值按几何级数收敛于 f (x)的极小值 ,并且计算量远小于郑权等人方法的计算量相似文献

11.

Value-Estimation Function Method for Constrained Global Optimization 总被引：5，自引：0，他引：5

X. L. Sun D. Li 《Journal of Optimization Theory and Applications》1999,102(2):385-409

A novel value-estimation function method for global optimization problems with inequality constraints is proposed in this paper. The value-estimation function formulation is an auxiliary unconstrained optimization problem with a univariate parameter that represents an estimated optimal value of the objective function of the original optimization problem. A solution is optimal to the original problem if and only if it is also optimal to the auxiliary unconstrained optimization with the parameter set at the optimal objective value of the original problem, which turns out to be the unique root of a basic value-estimation function. A logarithmic-exponential value-estimation function formulation is further developed to acquire computational tractability and efficiency. The optimal objective value of the original problem as well as the optimal solution are sought iteratively by applying either a generalized Newton method or a bisection method to the logarithmic-exponential value-estimation function formulation. The convergence properties of the solution algorithms guarantee the identification of an approximate optimal solution of the original problem, up to any predetermined degree of accuracy, within a finite number of iterations. 相似文献

12.

A Stochastic/Perturbation Global Optimization Algorithm for Distance Geometry Problems

Zhihong Zou Richard H. Bird Robert B. Schnabel 《Journal of Global Optimization》1997,11(1):91-105

We present a new global optimization approach for solving exactly or inexactly constrained distance geometry problems. Distance geometry problems are concerned with determining spatial structures from measurements of internal distances. They arise in the structural interpretation of nuclear magnetic resonance data and in the prediction of protein structure. These problems can be naturally formulated as global optimization problems which generally are large and difficult. The global optimization method that we present is related to our previous stochastic/perturbation global optimization methods for finding minimum energy configurations, but has several key differences that are important to its success. Our computational results show that the method readily solves a set of artificial problems introduced by Moré and Wu that have up to 343 atoms. On a set of considerably more difficult protein fragment problems introduced by Hendrickson, the method solves all the problems with up to 377 atoms exactly, and finds nearly exact solution for all the remaining problems which have up to 777 atoms. These preliminary results indicate that this approach has very good promise for helping to solve distance geometry problems. 相似文献

13.

全局优化问题的一个无参数填充函数算法

吴波高岳林《数学的实践与认识》2017,(4):170-174

填充函数法是求解全局优化问题的一种有效的确定性算法,方法的关键在于填充函数的构造.对于一般无约束优化问题提出了一个新的无参数填充函数,通过定义证明了此填充函数能保持填充性质.利用其理论性质设计了相应的算法并对几个经典的算例进行了数值实验,实验结果表明算法有效可行. 相似文献

14.

一类齐次多项式优化问题的一个全局优化算法

王宜举《应用数学学报》2010,33(2)

对图像与信号处理中遇到的一类齐次多项式优化问题,本文首先借助平移技术将目标函数转化为凸函数,然后结合初始点技术提出了求解该类问题的一个全局优化算法.与求解该类问题的幂方法相比,本文给出的方法不但能在一般情形下保证算法的全局收敛性,而且数值结果表明在多数情况下可以得到问题的一个全局最优值解. 相似文献

15.

A Continuous Genetic Algorithm Designed for the Global Optimization of Multimodal Functions 总被引：10，自引：0，他引：10

R. Chelouah P. Siarry 《Journal of Heuristics》2000,6(2):191-213

Genetic algorithms are stochastic search approaches based on randomized operators, such as selection, crossover and mutation, inspired by the natural reproduction and evolution of the living creatures. However, few published works deal with their application to the global optimization of functions depending on continuous variables.A new algorithm called Continuous Genetic Algorithm (CGA) is proposed for the global optimization of multiminima functions. In order to cover a wide domain of possible solutions, our algorithm first takes care over the choice of the initial population. Then it locates the most promising area of the solution space, and continues the search through an intensification inside this area. The selection, the crossover and the mutation are performed by using the decimal code. The efficiency of CGA is tested in detail through a set of benchmark multimodal functions, of which global and local minima are known. CGA is compared to Tabu Search and Simulated Annealing, as alternative algorithms. 相似文献

16.

非线性优化的非单调信赖域算法及全局收敛性

TongXiaojiao ZhouShuzi 《高校应用数学学报(英文版)》2000,15(2):201-210

Abstract. A trust region algorithm for equality constrained optimization is given in this paper.The algorithm does not enforce strict monotonicity of the merit function for every iteration.Global convergence of the algorithm is proved under the same conditions of usual trust regionmethod. 相似文献

17.

The Hyperbell Algorithm for Global Optimization: A Random Walk Using Cauchy Densities

Pierre Courrieu 《Journal of Global Optimization》1997,10(1):37-55

This article presents a new algorithm, called theHyperbell Algorithm, that searches for the global extrema ofnumerical functions of numerical variables. The algorithm relies on theprinciple of a monotone improving random walk whose steps aregenerated around the current position according to a gradually scaleddown Cauchy distribution. The convergence of the algorithm is provenand its rate of convergence is discussed. Its performance is tested onsome hard test functions and compared to that of other recentalgorithms and possible variants. An experimental study of complexityis also provided, and simple tuning procedures for applications areproposed. 相似文献

18.

非线性比式和问题的全局优化算法

焦红伟郭运瑞陈永强《数学季刊》2008,23(4)

In this paper,a global optimization algorithm is proposed for nonlinear sum of ratios problem(P).The algorithm works by globally solving problem(P1) that is equivalent to problem(P),by utilizing linearization technique a linear relaxation programming of the (P1) is then obtained.The proposed algorithm is convergent to the global minimum of(P1) through the successive refinement of linear relaxation of the feasible region of objective function and solutions of a series of linear relaxation programming.Nume... 相似文献

19.

Convergence of a Simulated Annealing Algorithm for Continuous Global Optimization

M. Locatelli 《Journal of Global Optimization》2000,18(3):219-233

In this paper a simulated annealing algorithm for continuous global optimization will be considered. The algorithm, in which a cooling schedule based on the distance between the function value in the current point and an estimate of the global optimum value is employed, has been first introduced in Bohachevsky, Johnson and Stein (1986) [2], but without any proof of convergence. Here it will be proved that, under suitable assumptions, the algorithm is convergent 相似文献

20.

一类LC1规划问题的超线性收敛ODE信赖域算法

欧宜贵侯定丕《数学季刊》2003,18(2):140-145

In this paper, a new trust region algorithm for unconstrained LC1 optimization problems is given. Compare with those existing trust regiion methods, this algorithm has a different feature: it obtains a stepsize at each iteration not by soloving a quadratic subproblem with a trust region bound, but by solving a system of linear equations. Thus it reduces computational complexity and improves computation efficiency. It is proven that this algorithm is globally convergent and locally superlinear under some conditions. 相似文献