首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Zero-Sum Stochastic Games with Partial Information   总被引:1,自引:0,他引:1  
We study a zero-sum stochastic game on a Borel state space where the state of the game is not known to the players. Both players take their decisions based on an observation process. We transform this into an equivalent problem with complete information. Then, we establish the existence of a value and optimal strategies for both players.  相似文献   

2.
In this paper, we study the impact of informativeness on the performance of linear quadratic Gaussian Nash and Stackelberg games. We first show that, in two-person static Nash games, if one of the players acquires more information, then this extra information is beneficial to him, provided that it is orthogonal to both players' information. A special case is that when one of the players is informationally stronger than the other, then any new information is beneficial to him. We then show that a similar result holds for dynamic Nash games. In the dynamic games, the players use strategies that are linear functions of the current estimates of the state, generated by two Kalman filters. The same properties are proved to hold in static and feedback Stackelberg games as well.This work was partially supported by the US Air Force Office of Scientific Research under Grant No. AFOSR-82-0174.  相似文献   

3.
本文结合文[1,2]中关于拟阵上静态结构和动态结构合作对策Shapley函数的描述,探讨了两类拟阵上的Banzhaf函数.通过给出相应的公理体系,论述了两类拟阵上Banzhaf函数的存在性和唯一性,拓展了拟阵上分配指标的研究范围.同时讨论了两类合作对策上Banzhaf函数的有关性质.最后通过算例来说明局中人在此类合作对策中的Banzhaf指标.  相似文献   

4.
In this paper, a large class of time-varying Riccati equations arising in stochastic dynamic games is considered. The problem of the existence and uniqueness of some globally defined solution, namely the bounded and stabilizing solution, is investigated. As an application of the obtained existence results, we address in a second step the problem of infinite-horizon zero-sum two players linear quadratic (LQ) dynamic game for a stochastic discrete-time dynamical system subject to both random switching of its coefficients and multiplicative noise. We show that in the solution of such an optimal control problem, a crucial role is played by the unique bounded and stabilizing solution of the considered class of generalized Riccati equations.  相似文献   

5.
Robust Equilibria in Indefinite Linear-Quadratic Differential Games   总被引:1,自引:0,他引:1  
Equilibria in dynamic games are formulated often under the assumption that the players have full knowledge of the dynamics to which they are subject. Here, we formulate equilibria in which players are looking for robustness and take model uncertainty explicitly into account in their decisions. Specifically, we consider feedback Nash equilibria in indefinite linear-quadratic differential games on an infinite time horizon. Model uncertainty is represented by a malevolent input which is subject to a cost penalty or to a direct bound. We derive conditions for the existence of robust equilibria in terms of solutions of sets of algebraic Riccati equations.  相似文献   

6.
7.
A searchlight game is a two-person zero-sum dynamic game of the pursuit-evasion type in which at least one of the two players has a searchlight. A searchlight can be flashed a given number of times within a fixed time period and the objective is to catch the opponent in the region illuminated by the flash. Olsder and Papavassilopoulos instituted the study of these games and, in this paper, we supplement their results, obtaining a closed formula for the value and optimal strategies for the players in their basic game.  相似文献   

8.
We consider a multi-objective control problem of time-discrete systems with given starting and final states. The dynamics of the system are controlled by p actors (players). Each of the players intends to minimize his own integral-time cost of the system’s transitions using a certain admissible trajectory. Nash Equilibria conditions are derived and algorithms for solving dynamic games in positional form are proposed in this paper. The existence theorem for Nash equilibria is related to the introduction of an auxiliary dynamic c-game. Stationary and non-stationary cases are described. The paper concludes with a complexity analysis for that decision process.  相似文献   

9.
A differential game of prescribed duration with general-type phase constraints is investigated. The existence of a value in the Varaiya-Lin sense and an optimal strategy for one of the players is obtained under assumptions ensuring that the sets of all admissible trajectories for the two players are compact in the Banach space of all continuous functions. These results are next widened on more general games, examined earlier by Varaiya.The author wishes to express his thanks to an anonymous reviewer for his many valuable comments.  相似文献   

10.
For a noncooperative differential game, the value functions of the various players satisfy a system of Hamilton-Jacobi equations. In the present paper, we consider a class of infinite horizon games with nonlinear costs exponentially discounted in time. By the analysis of the value functions, we establish the existence of Nash equilibrium solutions in feedback form and provide results and counterexamples on their uniqueness and stability.  相似文献   

11.
12.
Stackelberg games, which was originally introduced by Stackelberg, are widely applied in such fields as economics, management, politics and behavioral sciences. Stackelberg games can be modelled as a bi-level optimization problem. There exists an extensive literature about static bi-level optimization problems. However, the studies on dynamic bi-level optimization problems are fairly scarce in spite of the importance in explaining and predicting some phenomena rationally. In this paper, we consider discrete time dynamic Stackelberg games with feedback information. In general, the lower-level strategies are non-unique in practice. For a unique solution, dynamic programming algorithms have been presented with multiple players. We revisit dynamic programming for feedback information dynamic Stackelberg games with non-unique lower-level solution. First, we define some kind of solutions related to the decisions styles. Then, we analyze them, respectively. Moreover, dynamic programming algorithm is successful in solving solve feedback information dynamic Stackelberg games with non-unique lower-level solutions.  相似文献   

13.
Dynamic process is an approach to cooperative games, and it can be defined as that which leads the players to a solution for cooperative games. Hwang et al. (2005) adopted Hamiache’s associated game (2001) to provide a dynamic process leading to the Shapley value. In this paper, we propose a dynamic transfer scheme on the basis of the dual similar associated game, to lead to any solution satisfying both the inessential game property and continuity, starting from an arbitrary efficient payoff vector.  相似文献   

14.
研究了具有任意多个局中人的非合作博弈(大博弈)中Nash均衡的存在性.将1969年Ma的截口定理推广得到新的截口定理.用这个新的截口定理进一步证明了:1)大博弈中Nash均衡的存在性;2)纯策略集为紧度量空间而且支付函数为连续函数时,连续大博弈中混合策略Nash均衡的存在性.并且存在性定理推出了2010年Salonen的结果,即此研究结果较Salonen的结论更具普遍意义.  相似文献   

15.
Systems that involve more than one decision maker are often optimized using the theory of games. In the traditional game theory, it is assumed that each player has a well-defined quantitative utility function over a set of the player decision space. Each player attempts to maximize/minimize his/her own expected utility and each is assumed to know the extensive game in full. At present, it cannot be claimed that the first assumption has been shown to be true in a wide variety of situations involving complex problems in economics, engineering, social and political sciences due to the difficulty inherent in defining an adequate utility function for each player in these types of problems. On the other hand, in many of such complex problems, each player has a heuristic knowledge of the desires of the other players and a heuristic knowledge of the control choices that they will make in order to meet their ends.In this paper, we utilize fuzzy set theory in order to incorporate the players' heuristic knowledge of decision making into the framework of conventional game theory or ordinal game theory. We define a new approach to N-person static fuzzy noncooperative games and develop a solution concept such as Nash for these types of games. We show that this general formulation of fuzzy noncooperative games can be applied to solve multidecision-making problems where no objective function is specified. The computational procedure is illustrated via application to a multiagent optimization problem dealing with the design and operation of future military operations.  相似文献   

16.
This paper considers nonzero-sum multicriteria games with continuous kernels. Solution concepts based on the notions of Pareto optimality, equilibrium, and security are extended to these games. Separate necessary and sufficient conditions and existence results are presented for equilibrium, Pareto-optimal response, and Pareto-optimal security strategies of the players.This paper is based partially on research supported by the Council of Scientific and Industrial Research, India, through a Research Associateship Grant to the first author.The authors are grateful to two anonymous referees for suggesting useful changes and pointing out some errors in a previous draft.  相似文献   

17.
In this article, we study three aspects of mean field games (MFG). The first one is the case when the dynamics of each player depend on the strategies of the other players. The second one concerns the modeling of “noise” in discrete space models and the formulation of the Master Equation in this case. Finally, we show how MFG reduce to agent based models when the intertemporal preference rate goes to infinity, i.e. when the anticipation of the players vanishes.  相似文献   

18.
Stopping games (without simultaneous stopping) are sequential games in which at every stage one of the players is chosen, who decides whether to continue the interaction or stop it, whereby a terminal payoff vector is obtained. Periodic stopping games are stopping games in which both of the processes that define it, the payoff process as well as the process by which players are chosen, are periodic and do not depend on the past choices. We prove that every periodic stopping game without simultaneous stopping, has either periodic subgame perfect ϵ-equilibrium or a subgame perfect 0-equilibrium in pure strategies. This work is part of the master thesis of the author done under the supervision of Prof. Eilon Solan. I am thankful to Prof. Solan for his inspiring guidance. I also thank two anonymous referees of the International Journal of Game Theory for their comments.  相似文献   

19.
Qualitative (game of kind) outcomes of two-target games are analyzed in this paper, under both the zero-sum and nonzero-sum preference ordering of outcomes by the players. The outcome regions of each player are defined from a security standpoint. The secured draw and mutual-kill regions of a player depend explicitly on his preference ordering of outcomes and should be constructed separately for each player, especially in a nonzero-sum game. General guidelines are presented for identifying the secured outcome regions of players in a class of two-target games that satisfy an Isaacs-like condition, in terms of the qualitative solutions of the two underlying single-target pursuit-evasion games. A construction has been proposed for obtaining the qualitative solution of a large class of two-target games. Illustrative examples are included.This work was done while the first author was a Research Associate in the Department of Electrical Engineering at the Indian Institute of Science, Bangalore, and was financially supported by the Council of Scientific and Industrial Research, Delhi, India.  相似文献   

20.
This paper obtains the Stackelberg solution to a class of two-player stochastic differential games described by linear state dynamics and quadratic objective functionals. The information structure of the problem is such that the players make independent noisy measurements of the initial state and are permitted to utilize only this information in constructing their controls. Furthermore, by the very nature of the Stackelberg solution concept, one of the players is assumed to know, in advance, the strategy of the other player (the leader). For this class of problems, we first establish existence and uniqueness of the Stackelberg solution and then relate the derivation of the leader's Stackelberg solution to the optimal solution of a nonstandard stochastic control problem. This stochastic control problem is solved in a more general context, and its solution is utilized in constructing the Stackelberg strategy of the leader. For the special case Gaussian statistics, it is shown that this optimal strategy is affine in observation of the leader. The paper also discusses numerical aspects of the Stackelberg solution under general statistics and develops algorithms which converge to the unique Stackelberg solution.This work was performed while the second author was on sabbatical leave at the Department of Applied Mathematics, Twente University of Technology, Enschede, Holland.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号