期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

On-line solution of a stochastic pursuit-evasion game

C. T. Leondes B. Mons 《Journal of Optimization Theory and Applications》1979,28(3):411-428

In this paper, the concepts of prior-commitment and delayed-commitment strategies for zero-sum, linear-quadratic differential games with noise-corrupted measurements are applied to a detailed example of a pursuit-evasion game. Quasilinearization is used to solve the nonlinear two-point boundary-value problem of the prior-commitment game. A closed form solution is obtained for the delayed-commitment strategies. Comparison of the payoff functionals confirms the relationships discussed in Ref. 1. 相似文献

2.

Stochastic Discrete-Time Nash Games with Constrained State Estimators 总被引：3，自引：0，他引：3

Kian A.R. Cruz J.B. Simaan M. A. 《Journal of Optimization Theory and Applications》2002,114(1):171-188

In this paper, we consider stochastic linear-quadratic discrete-time Nash games in which two players have access only to noise-corrupted output measurements. We assume that each player is constrained to use a linear Kalman filter-like state estimator to implement his optimal strategies. Two information structures available to the players in their state estimators are investigated. The first has access to one-step delayed output and a one-step delayed control input of the player. The second has access to the current output and a one-step delayed control input of the player. In both cases, statistics of the process and statistics of the measurements of each player are known to both players. A simple example of a two-zone energy trading system is considered to illustrate the developed Nash strategies. In this example, the Nash strategies are calculated for the two cases of unlimited and limited transmission capacity constraints. 相似文献

3.

A pursuit-evasion differential game with noisy measurements of the evader's bearing from the pursuer

Y. Yavin 《Journal of Optimization Theory and Applications》1986,51(1):161-177

A stochastic pursuit-evasion differential game involving two players, E and P, moving in the plane is considered. It is assumed that player E (the evader) has complete observation of the position and velocity of player P, whereas player P (the pursuer) can measure the distanced (P, E) between P and E but receives noise-corrupted measurements of the bearing of E from P. Three cases are dealt with: (a) using the noise-corrupted measurements of , player P applies the proportional navigation guidance law; (b) P has complete observation ofd (P, E) and (this case is treated for the sake of completeness); (c) using the noise-corrupted measurements of , P applies an erroneous line-of sight guidance law. For each of the cases, sufficient conditions on optimal strategies are derived. In each of the cases, these conditions require the solution of a nonlinear partial differential equation on a in ². Finally, optimal strategies are computed by solving the corresponding equations numerically. 相似文献

4.

State transformations and the derivation of Nash closed-loop equilibria for non-zero-sum differential games

A Mehlmann 《Applied Mathematical Modelling》1985,9(5):353-357

In this paper the usefulness of state transformations in differential games is demonstrated. It is shown that different (admissible) state transformations give rise to different closed-loop Nash equilibrium candidates, which may all be found by solving systems of ordinary differential equations. A linear-quadratic duopoly differential game is solved to illustrate the results. 相似文献

5.

Upper Bounds of the Pursuer Control Based on a Linear-Quadratic Differential Game 总被引：2，自引：2，他引：0

Turetsky V. 《Journal of Optimization Theory and Applications》2004,121(1):163-191

Differential game formulations provide an adequate basis for a guidance law synthesis against highly maneuvering targets. This paper deals with a guidance law based on a linear-quadratic differential game formulation. This guidance law has many attractive properties: it is continuous, linear with respect to the state variables, and its gain coefficients can be precalculated offline. Nevertheless, due to the lack of hard control constraints in the formulation, the magnitude of the control can exceed the admissible level imposed by the nature of the problem. In this paper, the upper bound of the interceptor control is obtained depending on the system parameters and the penalty coefficients of the game performance index. It is shown that the interceptor can guarantee an arbitrarily small miss distance without exceeding the control constraints if it has sufficient maneuverability and if the penalty coefficients are chosen properly. By manipulating the penalty coefficients, it is possible to reduce significantly the maneuverability requirements compared to the case of zero interceptor penalty coefficient. 相似文献

6.

Linear-quadratic two-person zero-sum differential games,necessary and sufficient conditions: Comment

P. Bernhard 《Journal of Optimization Theory and Applications》1980,31(2):283-284

We give a simpler, easier-to-check, version of the theorem of the paper referred to, i.e., a necessary and sufficient condition for the existence of a saddle point to the linear-quadratic two-person zero-sum perfect information differential game. 相似文献

7.

On the Isaacs equation of differential games for descriptor systems

H. Xu K. Mizukami 《Journal of Optimization Theory and Applications》1994,83(2):405-419

相似文献

8.

Mixed strategy solutions for quadratic games

D. J. Wilson 《Journal of Optimization Theory and Applications》1974,13(3):319-333

Mixed strategy solutions are given for two-person, zero-sum games with payoff functions consisting of quadratic, bilinear, and linear terms, and strategy spaces consisting of closed balls in a Hilbert space. The results are applied to linear-quadratic differential games with no information, and with quadratic integral constraints on the control functions. 相似文献

9.

Robust Equilibria in Indefinite Linear-Quadratic Differential Games 总被引：1，自引：0，他引：1

van den Broek W. A. Engwerda J. C. Schumacher J. M. 《Journal of Optimization Theory and Applications》2003,119(3):565-595

Equilibria in dynamic games are formulated often under the assumption that the players have full knowledge of the dynamics to which they are subject. Here, we formulate equilibria in which players are looking for robustness and take model uncertainty explicitly into account in their decisions. Specifically, we consider feedback Nash equilibria in indefinite linear-quadratic differential games on an infinite time horizon. Model uncertainty is represented by a malevolent input which is subject to a cost penalty or to a direct bound. We derive conditions for the existence of robust equilibria in terms of solutions of sets of algebraic Riccati equations. 相似文献

10.

Asymptotic Analysis of Linear Feedback Nash Equilibria in Nonzero-Sum Linear-Quadratic Differential Games 总被引：2，自引：0，他引：2

A. J. T. M. Weeren J. M. Schumacher J. C. Engwerda 《Journal of Optimization Theory and Applications》1999,101(3):693-722

In this paper, we discuss nonzero-sum linear-quadratic differential games. For this kind of games, the Nash equilibria for different kinds of information structures were first studied by Starr and Ho. Most of the literature on the topic of nonzero-sum linear-quadratic differential games is concerned with games of fixed, finite duration; i.e., games are studied over a finite time horizon t _f. In this paper, we study the behavior of feedback Nash equilibria for t _f.In the case of memoryless perfect-state information, we study the so-called feedback Nash equilibrium. Contrary to the open-loop case, we note that the coupled Riccati equations for the feedback Nash equilibrium are inherently nonlinear. Therefore, we limit the dynamic analysis to the scalar case. For the special case that all parameters are scalar, a detailed dynamical analysis is given for the quadratic system of coupled Riccati equations. We show that the asymptotic behavior of the solutions of the Riccati equations depends strongly on the specified terminal values. Finally, we show that, although the feedback Nash equilibrium over any fixed finite horizon is generically unique, there can exist several different feedback Nash equilibria in stationary strategies for the infinite-horizon problem, even when we restrict our attention to Nash equilibria that are stable in the dynamical sense. 相似文献

11.

均值-方差准则下具有负债的随机微分博弈

杨鹏王震孙卫《经济数学》2016,(1):25-29

研究了均值-方差准则下,具有负债的随机微分博弈.研究目标是:在终值财富的均值等于k的限制下,在市场出现最坏的情况下找到最优的投资策略使终值财富的方差最小.即:基于均值-方差随机微分博弈的投资组合选择问题.使用线性-二次控制的理论解决了该问题,获得了最优的投资策略、最优市场策略和有效边界的显示解.并通过对所得结果进行进一步分析,在经济上给出了进一步的解释.通过本文的研究,可以指导金融公司在面临负债和金融市场情况恶劣时,选择恰当的投资策略使自身获得一定的财富而面临的风险最小. 相似文献

12.

基于再保险和投资的随机微分博弈

下载免费PDF全文

杨鹏《数学杂志》2014,34(4):779-786

本文研究了具有再保险和投资的随机微分博弈.应用线性-二次控制的理论,在指数效用和幂效用下,求得了最优再保险策略、最优投资策略、最优市场策略和值函数的显示解,推广了文[8]的结果.通过本文的研究,当市场出现最坏的情况时,可以指导保险公司选择恰当的再保险和投资策略使自身所获得的财富最大化. 相似文献

13.

Some thoughts on saddle-point conditions and information structures in zero-sum differential games

Basar T. 《Journal of Optimization Theory and Applications》1976,18(1):165-170

For a very simple two-stage, linear-quadratic, zero-sum difference game with dynamic information structure, we show that (i) there exist nonlinear saddle-point strategies which require the same existence conditions as the well-known linear, closed-loop, no-memory solution and (ii) there exist both linear and nonlinear saddle-point strategies which require more stringent conditions than the unique open-loop solution. We then discuss the implication of this result with respect to the existence of saddle points in zero-sum differential games for different information patterns. 相似文献

14.

Robust State-Feedback Controllability of Linear Systems to a Hyperplane in a Class of Bounded Controls

V. Turetsky V. Y. Glizer 《Journal of Optimization Theory and Applications》2004,123(3):639-667

The paper deals with a linear time-dependent dynamic system with scalar control and input uncertainty (disturbance). Two admissible classes of input uncertainty realizations are considered: the class of measurable bounded functions and the class of measurable quadratically integrable functions. The problem to be studied is the existence of a state feedback control with measurable bounded time realizations transferring the system to a given hyperplane (a target set) from any initial position in a prescribed time for any admissible input uncertainty realization. Necessary and sufficient conditions for the existence of such a control are derived, based on the explicit construction of this control by using an auxilary zero-sum linear-quadratic differential game with a cheap control for the minimizing player. Examples illustrting the theoritical results are presented. 相似文献

15.

Saddle-point equilibrium sequence in one class of singular infinite horizon zero-sum linear-quadratic differential games with state delays

Valery Y. Glizer 《Optimization》2019,68(1):349-384

ABSTRACT

We consider an infinite horizon zero-sum linear-quadratic differential game with state delays in the dynamics. The cost functional of this game does not contain a control cost of the minimizing player (the minimizer), meaning that the considered game is singular. For this game, definitions of the saddle-point equilibrium and the game value are proposed. These saddle-point equilibrium and game value are obtained by a regularization of the singular game. Namely, we associate this game with a new differential game for the same equation of dynamics. The cost functional in the new game is the sum of the original cost functional and an infinite horizon integral of the square of the minimizer's control with a small positive weight coefficient. This new game is regular, and it is a cheap control game. An asymptotic analysis of this cheap control game is carried out. Using this asymptotic analysis, the existence of the saddle-point equilibrium and the value of the original game is established, and their expressions are derived. Illustrative example is presented. 相似文献

16.

ͨ��Ӱ��»��΢�ֲ��ĵ��ٱ��պ�Ͷ��

�� 《应用概率统计》2016,32(2):147-156

??Under inflation influence, this paper investigate a stochastic differential game with reinsurance and investment. Insurance company chose a strategy to minimizing the variance of the final wealth, and the financial markets as a game ``virtual hand' chosen a probability measure represents the economic ``environment' to maximize the variance of the final wealth. Through this double game between the insurance companies and the financial markets, get optimal portfolio strategies. When investing, we consider inflation, the method of dealing with inflation is: Firstly, the inflation is converted to the risky assets, and then constructs the wealth process. Through change the original based on the mean-variance criteria stochastic differential game into unrestricted cases, then application linear-quadratic control theory obtain optimal reinsurance strategy and investment strategy and optimal market strategy as well as the closed form expression of efficient frontier are obtained; finally get reinsurance strategy and optimal investment strategy and optimal market strategy as well as the closed form expression of efficient frontier for the original stochastic differential game. 相似文献

17.

Magnus integrators for solving linear-quadratic differential games

Sergio Blanes Enrique Ponsoda 《Journal of Computational and Applied Mathematics》2012

We consider Magnus integrators to solve linear-quadratic N

N

-player differential games. These problems require to solve, backward in time, non-autonomous matrix Riccati differential equations which are coupled with the linear differential equations for the dynamic state of the game, to be integrated forward in time. We analyze different Magnus integrators which can provide either analytical or numerical approximations to the equations. They can be considered as time-averaging methods and frequently are used as exponential integrators. We show that they preserve some of the most relevant qualitative properties of the solution for the matrix Riccati differential equations as well as for the remaining equations. The analytical approximations allow us to study the problem in terms of the parameters involved. Some numerical examples are also considered which show that exponential methods are, in general, superior to standard methods. 相似文献

18.

A note on clemhout and Wan's dynamic games of common property resources

C. Plourde D. Yeung 《Journal of Optimization Theory and Applications》1987,55(2):327-331

Recently, Clemhout and Wan (Ref. 1) provided solutions to a class of stochastic differential game models concerning common property resources. However, they only established closed-loop strategies for the infinite-horizon problem using the Kushner test. In this paper, we provide closed-loop feedback solutions to their game in a finite-time horizon with deterministic evolution dynamics.The helpful comments of an anonymous referee are gratefully acknowledged. 相似文献

19.

Linear-quadratic,two-person,zero-sum differential games: Necessary and sufficient conditions

P. Bernhard 《Journal of Optimization Theory and Applications》1979,27(1):51-69

We consider linear-quadratic, two-person, zero-sum perfect information differential games, possibly with a linear target set. We show a necessary and sufficient condition for the existence of a saddle point, within a wide class of causal strategies (including, but not restricted to, pure state feedbacks). The main result is that, when they exist, the optimal strategies are pure feedbacks, given by the classical formulas suitably extended, and that existence may be obtained even in the presence of a conjugate point within the time interval, provided it is of a special type that we calleven.The partial support of the Trieste Unit of the GNAS, Italian CNR, is gratefully acknowledged. 相似文献

20.

Application of stochastic differential games to medium-range air-to-air missiles

Y. Yavin R. De Villiers 《Journal of Optimization Theory and Applications》1990,67(2):355-367

Stochastic differential game techniques are applied to compare the performance of a medium-range air-to-air missile for three different thrust-mass profiles. The measure of the performance of the missile is the probability that it will reach a lock-on point with a favorable range of guidance and flight parameters during a fixed time interval [0,t _f]. 相似文献