期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Perturbation theory for games in normal form and stochastic games

S. H. Tijs O. J. Vrieze 《Journal of Optimization Theory and Applications》1980,30(4):549-567

In this paper, the effect on values and optimal strategies of perturbations of game parameters (payoff function, transition probability function, and discount factor) is studied for the class of zero-sum games in normal form and for the class of stationary, discounted, two-person, zero-sum stochastic games.A main result is that, under certain conditions, the value depends on these parameters in a pointwise Lipschitz continuous way and that the sets of -optimal strategies for both players are upper semicontinuous multifunctions of the game parameters.Extensions to general-sum games and nonstationary stochastic games are also indicated. 相似文献

2.

Evolutionary games on graphs and discrete dynamical systems

Jeremias Epperlein Petr Stehlík 《Journal of Difference Equations and Applications》2013,19(2):72-95

相似文献

3.

Preferred coalitions in cooperative differential games

H. L. Logan Jr. 《Journal of Optimization Theory and Applications》1974,13(2):186-202

There are many interesting situations which can be described by anN-person general-sum differential game. Such games are characterized by the fact that the strategy of each player depends upon reasonable assumptions about the strategies of the remaining players; and, thus, these games cannot be considered asN uncoupled optimal control problems. In such cases, we say that the game is not strictly competitive, but involves a mutual interest which makes it possible for all of the players to reduce their costs by cooperating with one another, provided the resulting agreement can be enforced. When cooperation is allowed and there are more than two players, there is always the question of whether all possible subcoalitions will be formed with equal ease. This work considers the situation in which a particular subcoalition is preferred. A theory of general-sum games with preferred coalitions is presented, together with constructive examples of alternative approaches which are unsatisfactory. 相似文献

4.

Onn-person noncooperative multicriteria games described in strategic form

Lech Kruś Piotr Bronisz 《Annals of Operations Research》1994,51(2):83-97

This paper undertakes the problem of multicriteria decision support in conflict situations described as a noncooperative game. Construction of such a decision support requires the development of the noncooperative game theory to be generalized for the multicriteria case. New theoretical results in this case are presented. Features of the multicriteria noncooperative games are shown with use of a duopoly model in case of two goods and two criteria of each player. Concepts of the decision support are discussed. 相似文献

5.

Convex semi-infinite games

M. A. Lopez D. E. Vercher 《Journal of Optimization Theory and Applications》1986,50(2):289-312

This paper introduces a generalization of semi-infinite games. The pure strategies for player I involve choosing one function from an infinite family of convex functions, while the set of mixed strategies for player II is a closed convex setC inR ⁿ. The minimax theorem applies under a condition which limits the directions of recession ofC. Player II always has optimal strategies. These are shown to exist for player I also if a certain infinite system verifies the property of Farkas-Minkowski. The paper also studies certain conditions that guarantee the finiteness of the value of the game and the existence of optimal pure strategies for player I.Many thanks are due to the referees for their detailed comments. 相似文献

6.

Learning algorithms for repeated bimatrix Nash games with incomplete information

G. P. Papavassilopoulos 《Journal of Optimization Theory and Applications》1989,62(3):467-488

The purpose of this paper is to study a particular recursive scheme for updating the actions of two players involved in a Nash game, who do not know the parameters of the game, so that the resulting costs and strategies converge to (or approach a neighborhood of) those that could be calculated in the known parameter case. We study this problem in the context of a matrix Nash game, where the elements of the matrices are unknown to both players. The essence of the contribution of this paper is twofold. On the one hand, it shows that learning algorithms which are known to work for zero-sum games or team problems can also perform well for Nash games. On the other hand, it shows that, if two players act without even knowing that they are involved in a game, but merely thinking that they try to maximize their output using the learning algorithm proposed, they end up being in Nash equilibrium.This research was supported in part by NSF Grant No. ECS-87-14777. 相似文献

7.

Values for strategic games in which players cooperate

Luisa Carpente Balbina Casas-Méndez Ignacio García-Jurado Anne van den Nouwel 《International Journal of Game Theory》2005,33(3):397-419

In this paper we propose a new method to associate a coalitional game with each strategic game. The method is based on the lower value of finite two-player zero-sum games. We axiomatically characterize this new method, as well as the method that was described in Von Neumann and Morgenstern (1944). As an intermediate step, we provide axiomatic characterizations of the value and the lower value of matrix games and finite two-player zero-sum games, respectively.The authors acknowledge the financial support of Ministerio de Ciencia y Tecnologia, FEDER andXunta de Galicia through projects BEC2002-04102-C02-02 and PGIDIT03PXIC20701PN.We wish to thank Professor William Thomson as well as an anonymous referee for useful comments. 相似文献

8.

Effective learning in the presence of adaptive counterparts

Andriy Burkov Brahim Chaib-draa 《Journal of Algorithms in Cognition, Informatics and Logic》2009,64(4):127-138

Adaptive learning algorithms (ALAs) is an important class of agents that learn the utilities of their strategies jointly with the maintenance of the beliefs about their counterparts' future actions. In this paper, we propose an approach of learning in the presence of adaptive counterparts. Our Q-learning based algorithm, called Adaptive Dynamics Learner (ADL), assigns Q-values to the fixed-length interaction histories. This makes it capable of exploiting the strategy update dynamics of the adaptive learners. By so doing, ADL usually obtains higher utilities than those of equilibrium solutions. We tested our algorithm on a substantial representative set of the most known and demonstrative matrix games. We observed that ADL is highly effective in the presence of such ALAs as Adaptive Play Q-learning, Infinitesimal Gradient Ascent, Policy Hill-Climbing and Fictitious Play Q-learning. Further, in self-play ADL usually converges to a Pareto efficient average utility. 相似文献

9.

Existence of minmax points in discontinuous strategic games

Vincenzo Scalzo 《Journal of Mathematical Analysis and Applications》2011,374(1):331-333

In this note we prove the existence of minmax points for strategic form games where the sets of strategies are topological spaces and the payoff functions satisfy conditions weaker than continuity. The employed tools are the class of transfer weakly upper continuous functions and the class of weakly lower pseudocontinuous functions. An example shows that our result is of minimal character. 相似文献

10.

Almost global convergence to p-dominant equilibrium

William H. Sandholm 《International Journal of Game Theory》2001,30(1):107-116

A population of players repeatedly plays an n strategy symmetric game. Players update their strategies by sampling the behavior of k opponents and playing a best response to the distribution of strategies in the sample. Suppose the game possesses a -dominant strategy which is initially played by a positive fraction of the population. Then if the population size is large enough, play converges to the -dominant equilibrium with arbitrarily high probability. Received December 1999/Revised version November 2000 相似文献

11.

Construction of Nash equilibria in symmetric stochastic games of capital accumulation

Łukasz Balbus Andrzej S. Nowak 《Mathematical Methods of Operations Research》2004,60(2):267-277

相似文献

12.

Deposit games with reinvestment

Gerwald van Gulick Peter Borm Anja De Waegenaere Ruud Hendrickx 《European Journal of Operational Research》2010,200(3):788-799

In a deposit game coalitions are formed by players combining their capital. The proceeds of their investments then have to be divided among those players. The current model extends earlier work on capital deposits by allowing reinvestment of returns. Two specific subclasses of deposit games are introduced. These subclasses provide insight in two extreme cases. It is seen that each term dependent deposit game possesses a core element. Capital dependent deposit games are also shown to have a core element and even a population monotonic allocation scheme if the revenue function exhibits increasing returns to scale. Furthermore, it is shown that all superadditive games are deposit games if one allows for debt. 相似文献

13.

A solution for noncooperative games

P. P. Shenoy 《Journal of Optimization Theory and Applications》1982,38(4):565-579

In this paper, we study solutions of strict noncooperative games that are played just once. The players are not allowed to communicate with each other. The main ingredient of our theory is the concept of rationalizing a set of strategies for each player of a game. We state an axiom based on this concept that every solution of a noncooperative game is required to satisfy. Strong Nash solvability is shown to be a sufficient condition for the rationalizing set to exist, but it is not necessary. Also, Nash solvability is neither necessary nor sufficient for the existence of the rationalizing set of a game. For a game with no solution (in our sense), a player is assumed to recourse to a standard of behavior. Some standards of behavior are examined and discussed.This work was sponsored by the United States Army under Contract No. DAAG29-75-C-0024 and by the National Science Foundation under Grant No. MCS-75-17385-A01. The author is grateful to J. C. Harsanyi for his comments and to S. M. Robinson for suggesting the problem. 相似文献

14.

Stay-in-a-set games

Piercesare Secchi William D. Sudderth 《International Journal of Game Theory》2002,30(4):479-490

There exists a Nash equilibrium (ε-Nash equilibrium) for every n-person stochastic game with a finite (countable) state space and finite action sets for the players if the payoff to each player i is one when the process of states remains in a given set of states G _i and is zero otherwise. Received: December 2000 相似文献

15.

A reinforcement learning process in extensive form games

Jean-François Laslier Bernard Walliser 《International Journal of Game Theory》2005,33(2):219-227

The CPR (“cumulative proportional reinforcement”) learning rule stipulates that an agent chooses a move with a probability proportional to the cumulative payoff she obtained in the past with that move. Previously considered for strategies in normal form games (Laslier, Topol and Walliser, Games and Econ. Behav., 2001), the CPR rule is here adapted for actions in perfect information extensive form games. The paper shows that the action-based CPR process converges with probability one to the (unique) subgame perfect equilibrium.Received: October 2004 相似文献

16.

Recall in extensive form games

Klaus Ritzberger 《International Journal of Game Theory》1999,28(1):69-87

This paper considers characterizations of perfect recall in extensive form games. It is shown that perfect recall can be expressed in terms of choices without any reference to infomation sets. When information sets are taken into account, it is decomposable into an ordering of information sets and that players do not forget what they knew nor what they did. Thus, if information sets are partially ordered, then perfect recall is implied by the player's inability to refine her information from the memory. Received: August 1997/final version: September 1998 相似文献

17.

Differential games without pure strategy saddle-point solutions

W. E. Schmitendorf 《Journal of Optimization Theory and Applications》1976,18(1):81-92

In some two-player, zero-sum differential games, pure strategy saddle-point solutions do not exist. For such games, the concept of a minmax strategy is examined, and sufficient conditions for a control to be a minmax control are presented. Both the open-loop and the closed-loop cases are considered.The research was partially supported by ONR under Contract No. N00014-69-A-0200-12. An earlier version of this paper was presented at the Eleventh Annual Allerton Conference on Circuit and System Theory, Monticello, Illinois, 1973.The author wishes to acknowledge his many valuable discussions of this problem with Professor G. Leitmann and also to thank one of the reviewers for his suggestions for simplifying the proof of Theorem 2.1. 相似文献

18.

再制造市场OEM与UOEM的博弈与学习研究 总被引：1，自引：0，他引：1

石岿然孙玉玲吴鸽《运筹与管理》2015,24(1):129-136

在再制造利益的驱动下,一些非原始设备制造商(UOEM)欲进入再制造市场。为探究UOEM参与再制造的进入博弈,应用演化博弈理论构建了原始设备制造商(OEM)和UOEM策略选择的复制动态。研究表明:博弈双方的回收价格、UOEM排除障碍的成本会影响UOEM的策略选择;OEM选择默许而潜在的UOEM进入再制造品市场是二维动态系统唯一的演化稳定策略。进一步考虑了参与人的学习行为,将噪声项引入复制动态方程中,得到了一个非子博弈完美均衡,即当带着噪声项的OEM采取竞争策略时,进入者的最优策略是置身于市场之外。相似文献

19.

Using spatial games to model dynamic evolutionary systems

Yan Kuang Chih-Hang Wu Songnian Zhao 《Mathematical and Computer Modelling of Dynamical Systems: Methods, Tools and Applications in Engineering and Related Sciences》2018,24(3):296-313

Modelling the dynamics of evolutionary competing species on a physical grid is a challenging modelling problem. This paper presents a novel modelling approach for synthesizing evolutionary dynamics of competing species using a spatial game perspective. This modelling approach describes the movement of players (‘species’ in our context) across a lattice. The model is based on a payoff function which controls the move likelihood and direction of the players (‘predators’ and ‘preys’). Using simulated results, the paper provides a comparison between the spatial game model and an existing predator-prey dynamic model. Finally, a case study is performed to illustrate the application of this formalism and validate the model. 相似文献

20.

Self-Adaptive Genetic Algorithm for Clustering 总被引：6，自引：0，他引：6

Juha Kivijärvi Pasi Fränti Olli Nevalainen 《Journal of Heuristics》2003,9(2):113-129

Clustering is a hard combinatorial problem which has many applications in science and practice. Genetic algorithms (GAs) have turned out to be very effective in solving the clustering problem. However, GAs have many parameters, the optimal selection of which depends on the problem instance. We introduce a new self-adaptive GA that finds the parameter setup on-line during the execution of the algorithm. In this way, the algorithm is able to find the most suitable combination of the available components. The method is robust and achieves results comparable to or better than a carefully fine-tuned non-adaptive GA. 相似文献