首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Convergence analysis of gradient descent stochastic algorithms
Authors:A Shapiro  Y Wardi
Institution:(1) School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia;(2) School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, Georgia
Abstract:This paper proves convergence of a sample-path based stochastic gradient-descent algorithm for optimizing expected-value performance measures in discrete event systems. The algorithm uses increasing precision at successive iterations, and it moves against the direction of a generalized gradient of the computed sample performance function. Two convergence results are established: one, for the case where the expected-value function is continuously differentiable; and the other, when that function is nondifferentiable but the sample performance functions are convex. The proofs are based on a version of the uniform law of large numbers which is provable for many discrete event systems where infinitesimal perturbation analysis is known to be strongly consistent.
Keywords:Gradient descent  subdifferentials  uniform laws of large numbers  infinitesimal perturbation analysis  discrete event dynamic systems
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号