首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
We show the existence of almost stationary -equilibria, for all > 0, in zero-sum stochastic games with finite state and action spaces. These are -equilibria with the property that, if neither player deviates, then stationary strategies are played forever with probability almost 1. The proof is based on the construction of specific stationary strategy pairs, with corresponding rewards equal to the value, which can be supplemented with history-dependent -optimal strategies, with small > 0, in order to obtain almost stationary -equilibria.  相似文献   

2.
We present existence and uniqueness results for a hierarchical or Stackelberg equilibrium in a two-player differential game with open-loop information structure. There is a known convexity condition ensuring the existence of a Stackelberg equilibrium, which was derived by Simaan and Cruz (Ref. 1). This condition applies to games with a rather nonconflicting structure of their cost criteria. By another approach, we obtain here new sufficient existence conditions for an open-loop equilibrium in terms of the solvability of a terminal-value problem of two symmetric Riccati differential equations and a coupled system of Riccati matrix differential equations. The latter coupled system appears also in the necessary conditions, but contrary to the above as a boundary-value problem. In case that the convexity condition holds, both symmetric equations are of standard type and admit globally a positive-semidefinite solution. But the conditions apply also to more conflicting situations. Then, the corresponding Riccati differential equations may be of H-type. We obtain also different uniqueness conditions using a Lyapunov-type approach. The case of time-invariant parameters is discussed in more detail and we present a numerical example.  相似文献   

3.
This paper considers a class of risk-sensitive stochastic nonzero-sum differential games with parametrized nonlinear dynamics and parametrized cost functions. The parametrization is such that, if all or some of the parameters are set equal to some nominal values, then the differential game either becomes equivalent to a risk-sensitive stochastic control (RSSC) problem or decouples into several independent RSSC problems, which in turn are equivalent to a class of stochastic zero-sum differential games. This framework allows us to study the sensitivity of the Nash equilibrium (NE) of the original stochastic game to changes in the values of these parameters, and to relate the NE (generally difficult to compute and to establish existence and uniqueness, at least directly) to solutions of RSSC problems, which are relatively easier to obtain. It also allows us to quantify the sensitivity of solutions to RSSC problems (and thereby nonlinear H-control problems) to unmodeled subsystem dynamics controlled by multiple players.  相似文献   

4.
Metric Characterizations of Tikhonov Well-Posedness in Value   总被引:1,自引:0,他引:1  
In this paper, we discuss and give metric characterizations of Tikhonov well-posedness in value for Nash equilibria. Roughly speaking, Tikhonov well-posedness of a problem means that approximate solutions converge to the true solution when the degree of approximation goes to zero. If we add to the condition of -equilibrium that of -closeness in value to some Nash equilibrium, we obtain Tikhonov well-posedness in value, which we have defined in a previous paper. This generalization of Tikhonov well-posedness has the remarkable property of ordinality; namely, it is preserved under monotonic transformations of the payoffs. We show that a metric characterization of Tikhonov well-posedness in value is not possible unless the set of Nash equilibria is compact and nonempty.  相似文献   

5.
In Ref. 1, the author claimed that the problem y=y 3 is soluble only for a certain range of the parameter . An analytic approach, as adopted in the following contribution, reveals that a unique solution exists for any positive value of . The solution is given in closed form by means of Jacobian elliptic functions, which can be numerically computed very efficiently. In the limit 0+, the solutions exhibit boundary-layer behavior at both endpoints. An easily interpretable approximate solution for small is obtained using a three-variable approach.  相似文献   

6.
We are concerned with -mixed solutions for weak Stackelberg problems corresponding to two-player nonzero-sum noncooperative games. Two cases are considered: (i) mixed strategies for only the second player; (ii) mixed strategies for both players. After giving basic results relating convergence of functions and weak convergence of probability measures, we establish existence and stability results for -mixed solutions under general assumptions of minimal character without any convexity assumption. Our results improve previous work of Mallozzi and Morgan (Refs. 1–2).  相似文献   

7.
We shall derive existence, uniqueness and comparison results for the functional differential equationx(t)=f(t, x), a. e.tI, with classical Nicoletti boundary conditionsx i(ti)=y iX, iA, whereI is a real interval,A is a nonempty set andX is a Banach space.  相似文献   

8.
In this paper, the effect on values and optimal strategies of perturbations of game parameters (payoff function, transition probability function, and discount factor) is studied for the class of zero-sum games in normal form and for the class of stationary, discounted, two-person, zero-sum stochastic games.A main result is that, under certain conditions, the value depends on these parameters in a pointwise Lipschitz continuous way and that the sets of -optimal strategies for both players are upper semicontinuous multifunctions of the game parameters.Extensions to general-sum games and nonstationary stochastic games are also indicated.  相似文献   

9.
Most existing interior-point methods for a linear complementarity problem (LCP) require the existence of a strictly feasible point to guarantee that the iterates are bounded. Based on a regularized central path, we present an infeasible interior-point algorithm for LCPs without requiring the strict feasibility condition. The iterates generated by the algorithm are bounded when the problem is a P * LCP and has a solution. Moreover, when the problem is a monotone LCP and has a solution, we prove that the convergence rate is globally linear and it achieves `-feasibility and `-complementarity in at most O(n 2 ln(1/`)) iterations with a properly chosen starting point.  相似文献   

10.
This paper is strictly related to Ref. 1. A pursuit-evasion game described in part by the system and is considered. The state variablesx andy are restricted, in the sense that (x(t),t) N 1 and (y(t),t) N 2. The existence of a value in the sense of Varaiya and Lin is proved under the assumption that the sets of all admissible trajectories for the two players are compact and the lower value is not greater than the upper value.  相似文献   

11.
Using the concept for -efficient solutions introduced in Refs. 1 and 2, we extend the Ekeland variational principle and another variational principle given in Ref. 3 to vector-valued objective functions. This enables us to establish a kind of well-posedness for the resulting perturbed vector optimization problems. Based on this definition of -efficiency, we also formulate a concept for level sets and prove some results about the Kuratowski limits of level sets.This research was supported by the Bulgarian Ministry of Education and Science and by the Deutsche Forschungsgemeinschaft.  相似文献   

12.
Mixed strategy -equilibrium points are given forN-person games with cost functions consisting of quadratic, bilinear, and linear terms and strategy spaces consisting of closed balls in Hilbert spaces. The results are applied to linear-quadratic differential games with no information and quadratic integral constraints on the control functions.This work was supported by a Commonwealth of Australia, Postgraduate Research Award.  相似文献   

13.
We consider a system of two semilinear differential inclusions with infinitesimal generators of C 0-semigroups. The nonlinear terms are of high frequency with respect to time and periodic with a specified period. Moreover, they are condensing in the state variables (x,y) with respect to a suitable measure of noncompactness. The goal of the paper is to give sufficient conditions to guarantee, for >0 sufficiently small, the existence of periodic solutions and to study their behaviour as 0. The main tool to achieve this is the topological degree theory for uppersemicontinuous, condensing vector fields.  相似文献   

14.
New formulations of the inverse nonstationary Stefan problems are considered: (a) forx [0,1] (the inverse problem IP1; (b) forx [0, (t)] with a degenerate initial condition (the inverse problem IP). Necessary conditions for the existence and uniqueness of a solution to these problems are formulated. On the first phase {x [0, y(t)]{, the solution of the inverse problem is found in the form of a series; on the second phase {x [y(t), 1] orx [y(t), (t)]{, it is found as a sum of heat double-layer potentials. By representing the inverse problem in the form of two connected boundary-value problems for the heat conduction equation in the domains with moving boundaries, it can be reduced to the integral Volterra equations of the second kind. An exact solution of the problem IP is found for the self similar motion of the boundariesx=y(t) andx=(t).Translated from Ukrainskii Matematicheskii Zhurnal, Vol. 45, No. 8, pp. 1058–1065, August, 1993.  相似文献   

15.
This paper studies denumerable state continuous-time controlled Markov chains with the discounted reward criterion and a Borel action space. The reward and transition rates are unbounded, and the reward rates are allowed to take positive or negative values. First, we present new conditions for a nonhomogeneous Q(t)-process to be regular. Then, using these conditions, we give a new set of mild hypotheses that ensure the existence of -optimal (0) stationary policies. We also present a martingale characterization of an optimal stationary policy. Our results are illustrated with controlled birth and death processes.  相似文献   

16.
We define the degree of a family of convex functionsf x, wherex , and prove that, if this degree is different from zero, one has at least one Nash equilibrium point in . This is a criterion of localization of such equilibrium points.  相似文献   

17.
Campbell  Alain  Nazarov  Sergueï A. 《Positivity》2001,5(3):275-295
Different types of asymptotic expansions are constructed and justified for eigenvalues of the Dirichlet problem for the biharmonic operator in a plane domain with a small hole of the diameter (the Kirchhoff-Love plate clamped at ). Depending on properties of an eigenfunction of the limiting problem, the expansions happen either to be series in powers of , or to contain terms, holomorphic in | ln |–1  相似文献   

18.
We give efficiency estimates for proximal bundle methods for finding f*minXf, where f and X are convex. We show that, for any accuracy <0, these methods find a point xkX such that f(xk)–f* after at most k=O(1/3) objective and subgradient evaluations.  相似文献   

19.
LetX denote a reflexive Banach space and {A(t)|t[0,T]} a time dependent family of accretive operators defined onX. Conditions are placed on {A(t)|t[0,T]} which are sufficient to guarantee the existence of solutions to the Cauchy initial value problem:u(t,x)+A(t)u(t,x)=0; u(0,x)=x. These solutions are obtained via the method of product integration; however limits for the infinite product are taken with respect to the weak topology.  相似文献   

20.
LetY = (X, {R i } oid) denote aP-polynomial association scheme. By a kite of lengthi (2 i d) inY, we mean a 4-tuplexyzu (x, y, z, u X) such that(x, y) R 1,(x, z) R 1,(y, z) R 1,(u, y) R i–1,(u, z) R i–1,(u, x) R i. Our main result in this paper is the following.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号