fbpx
Wikipedia

Epsilon-equilibrium

In game theory, an epsilon-equilibrium, or near-Nash equilibrium, is a strategy profile that approximately satisfies the condition of Nash equilibrium. In a Nash equilibrium, no player has an incentive to change his behavior. In an approximate Nash equilibrium, this requirement is weakened to allow the possibility that a player may have a small incentive to do something different. This may still be considered an adequate solution concept, assuming for example status quo bias. This solution concept may be preferred to Nash equilibrium due to being easier to compute, or alternatively due to the possibility that in games of more than 2 players, the probabilities involved in an exact Nash equilibrium need not be rational numbers.[1]

Epsilon-equilibrium
A solution concept in game theory
Relationship
Superset ofNash Equilibrium
Significance
Used forstochastic games

Definition edit

There is more than one alternative definition.

The standard definition edit

Given a game and a real non-negative parameter  , a strategy profile is said to be an  -equilibrium if it is not possible for any player to gain more than   in expected payoff by unilaterally deviating from his strategy.[2]: 45  Every Nash Equilibrium is equivalent to an  -equilibrium where  .

Formally, let   be an  -player game with action sets   for each player   and utility function  . Let   denote the payoff to player   when strategy profile   is played. Let   be the space of probability distributions over  . A vector of strategies   is an  -Nash Equilibrium for   if

  for all  

Well-supported approximate equilibrium edit

The following definition[3] imposes the stronger requirement that a player may only assign positive probability to a pure strategy   if the payoff of   has expected payoff at most   less than the best response payoff. Let   be the probability that strategy profile   is played. For player   let   be strategy profiles of players other than  ; for   and a pure strategy   of   let   be the strategy profile where   plays   and other players play  . Let   be the payoff to   when strategy profile   is used. The requirement can be expressed by the formula

 

Results edit

The existence of a polynomial-time approximation scheme (PTAS) for ε-Nash equilibria is equivalent to the question of whether there exists one for ε-well-supported approximate Nash equilibria,[4] but the existence of a PTAS remains an open problem. For constant values of ε, polynomial-time algorithms for approximate equilibria are known for lower values of ε than are known for well-supported approximate equilibria. For games with payoffs in the range [0,1] and ε=0.3393, ε-Nash equilibria can be computed in polynomial time[5] For games with payoffs in the range [0,1] and ε=2/3, ε-well-supported equilibria can be computed in polynomial time[6]

Example edit

The notion of ε-equilibria is important in the theory of stochastic games of potentially infinite duration. There are simple examples of stochastic games with no Nash equilibrium but with an ε-equilibrium for any ε strictly bigger than 0.

Perhaps the simplest such example is the following variant of Matching Pennies, suggested by Everett. Player 1 hides a penny and Player 2 must guess if it is heads up or tails up. If Player 2 guesses correctly, he wins the penny from Player 1 and the game ends. If Player 2 incorrectly guesses that the penny is heads up, the game ends with payoff zero to both players. If he incorrectly guesses that it is tails up, the game repeats. If the play continues forever, the payoff to both players is zero.

Given a parameter ε > 0, any strategy profile where Player 2 guesses heads up with probability ε and tails up with probability 1 − ε (at every stage of the game, and independently from previous stages) is an ε-equilibrium for the game. The expected payoff of Player 2 in such a strategy profile is at least 1 − ε. However, it is easy to see that there is no strategy for Player 2 that can guarantee an expected payoff of exactly 1. Therefore, the game has no Nash equilibrium.

Another simple example is the finitely repeated prisoner's dilemma for T periods, where the payoff is averaged over the T periods. The only Nash equilibrium of this game is to choose Defect in each period. Now consider the two strategies tit-for-tat and grim trigger. Although neither tit-for-tat nor grim trigger are Nash equilibria for the game, both of them are  -equilibria for some positive  . The acceptable values of   depend on the payoffs of the constituent game and on the number T of periods.

In economics, the concept of a pure strategy epsilon-equilibrium is used when the mixed-strategy approach is seen as unrealistic. In a pure-strategy epsilon-equilibrium, each player chooses a pure-strategy that is within epsilon of its best pure-strategy. For example, in the Bertrand–Edgeworth model, where no pure-strategy equilibrium exists, a pure-strategy epsilon equilibrium may exist.

References edit

Inline citations
  1. ^ V. Bubelis (1979). "On equilibria in finite games". International Journal of Game Theory. 8 (2): 65–79. doi:10.1007/bf01768703. S2CID 122843303.
  2. ^ Vazirani, Vijay V.; Nisan, Noam; Roughgarden, Tim; Tardos, Éva (2007). Algorithmic Game Theory (PDF). Cambridge, UK: Cambridge University Press. ISBN 0-521-87282-0.
  3. ^ P.W. Goldberg and C.H. Papadimitriou (2006). "Reducibility Among Equilibrium Problems". 38th Symposium on Theory of Computing. pp. 61–70. doi:10.1145/1132516.1132526.
  4. ^ C. Daskalakis, P.W. Goldberg and C.H. Papadimitriou (2009). "The Complexity of Computing a Nash Equilibrium". SIAM Journal on Computing. 39 (3): 195–259. CiteSeerX 10.1.1.68.6111. doi:10.1137/070699652.
  5. ^ H. Tsaknakis and Paul G. Spirakis (2008). "An optimisation approach for approximate Nash equilibria". Internet Mathematics. 5 (4): 365–382. doi:10.1080/15427951.2008.10129172.
  6. ^ Spyros C. Kontogiannis and Paul G. Spirakis (2010). "Well Supported Approximate Equilibria in Bimatrix Games". Algorithmica. 57 (4): 653–667. doi:10.1007/s00453-008-9227-6. S2CID 15968419.
Sources
  • H Dixon Approximate Bertrand Equilibrium in a Replicated Industry, Review of Economic Studies, 54 (1987), pages 47–62.
  • H. Everett. "Recursive Games". In H.W. Kuhn and A.W. Tucker, editors. Contributions to the theory of games, vol. III, volume 39 of Annals of Mathematical Studies. Princeton University Press, 1957.
  • Leyton-Brown, Kevin; Shoham, Yoav (2008), Essentials of Game Theory: A Concise, Multidisciplinary Introduction, San Rafael, CA: Morgan & Claypool Publishers, ISBN 978-1-59829-593-1. An 88-page mathematical introduction; see Section 3.7. Free online at many universities.
  • R. Radner. Collusive behavior in non-cooperative epsilon equilibria of oligopolies with long but finite lives, Journal of Economic Theory, 22, 121–157, 1980.
  • Shoham, Yoav; Leyton-Brown, Kevin (2009), Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations, New York: Cambridge University Press, ISBN 978-0-521-89943-7. A comprehensive reference from a computational perspective; see Section 3.4.7. Downloadable free online.
  • S.H. Tijs. Nash equilibria for noncooperative n-person games in normal form, SIAM Review, 23, 225–237, 1981.

epsilon, equilibrium, game, theory, epsilon, equilibrium, near, nash, equilibrium, strategy, profile, that, approximately, satisfies, condition, nash, equilibrium, nash, equilibrium, player, incentive, change, behavior, approximate, nash, equilibrium, this, re. In game theory an epsilon equilibrium or near Nash equilibrium is a strategy profile that approximately satisfies the condition of Nash equilibrium In a Nash equilibrium no player has an incentive to change his behavior In an approximate Nash equilibrium this requirement is weakened to allow the possibility that a player may have a small incentive to do something different This may still be considered an adequate solution concept assuming for example status quo bias This solution concept may be preferred to Nash equilibrium due to being easier to compute or alternatively due to the possibility that in games of more than 2 players the probabilities involved in an exact Nash equilibrium need not be rational numbers 1 Epsilon equilibriumA solution concept in game theoryRelationshipSuperset ofNash EquilibriumSignificanceUsed forstochastic games Contents 1 Definition 1 1 The standard definition 1 2 Well supported approximate equilibrium 2 Results 3 Example 4 ReferencesDefinition editThere is more than one alternative definition The standard definition edit Given a game and a real non negative parameter e displaystyle varepsilon nbsp a strategy profile is said to be an e displaystyle varepsilon nbsp equilibrium if it is not possible for any player to gain more than e displaystyle varepsilon nbsp in expected payoff by unilaterally deviating from his strategy 2 45 Every Nash Equilibrium is equivalent to an e displaystyle varepsilon nbsp equilibrium where e 0 displaystyle varepsilon 0 nbsp Formally let G N A A 1 A N u A R N displaystyle G N A A 1 times dotsb times A N u colon A to R N nbsp be an N displaystyle N nbsp player game with action sets A i displaystyle A i nbsp for each player i displaystyle i nbsp and utility function u displaystyle u nbsp Let u i s displaystyle u i s nbsp denote the payoff to player i displaystyle i nbsp when strategy profile s displaystyle s nbsp is played Let D i displaystyle Delta i nbsp be the space of probability distributions over A i displaystyle A i nbsp A vector of strategies s D D 1 D N displaystyle sigma in Delta Delta 1 times dotsb times Delta N nbsp is an e displaystyle varepsilon nbsp Nash Equilibrium for G displaystyle G nbsp if u i s u i s i s i e displaystyle u i sigma geq u i sigma i sigma i varepsilon nbsp for all s i D i i N displaystyle sigma i in Delta i i in N nbsp Well supported approximate equilibrium edit The following definition 3 imposes the stronger requirement that a player may only assign positive probability to a pure strategy a displaystyle a nbsp if the payoff of a displaystyle a nbsp has expected payoff at most e displaystyle varepsilon nbsp less than the best response payoff Let x s displaystyle x s nbsp be the probability that strategy profile s displaystyle s nbsp is played For player p displaystyle p nbsp let S p displaystyle S p nbsp be strategy profiles of players other than p displaystyle p nbsp for s S p displaystyle s in S p nbsp and a pure strategy j displaystyle j nbsp of p displaystyle p nbsp let j s displaystyle js nbsp be the strategy profile where p displaystyle p nbsp plays j displaystyle j nbsp and other players play s displaystyle s nbsp Let u p s displaystyle u p s nbsp be the payoff to p displaystyle p nbsp when strategy profile s displaystyle s nbsp is used The requirement can be expressed by the formula s S p u p j s x s gt e s S p u p j s x s x j p 0 displaystyle sum s in S p u p js x s gt varepsilon sum s in S p u p j s x s Longrightarrow x j p 0 nbsp Results editThe existence of a polynomial time approximation scheme PTAS for e Nash equilibria is equivalent to the question of whether there exists one for e well supported approximate Nash equilibria 4 but the existence of a PTAS remains an open problem For constant values of e polynomial time algorithms for approximate equilibria are known for lower values of e than are known for well supported approximate equilibria For games with payoffs in the range 0 1 and e 0 3393 e Nash equilibria can be computed in polynomial time 5 For games with payoffs in the range 0 1 and e 2 3 e well supported equilibria can be computed in polynomial time 6 Example editThe notion of e equilibria is important in the theory of stochastic games of potentially infinite duration There are simple examples of stochastic games with no Nash equilibrium but with an e equilibrium for any e strictly bigger than 0 Perhaps the simplest such example is the following variant of Matching Pennies suggested by Everett Player 1 hides a penny and Player 2 must guess if it is heads up or tails up If Player 2 guesses correctly he wins the penny from Player 1 and the game ends If Player 2 incorrectly guesses that the penny is heads up the game ends with payoff zero to both players If he incorrectly guesses that it is tails up the game repeats If the play continues forever the payoff to both players is zero Given a parameter e gt 0 any strategy profile where Player 2 guesses heads up with probability e and tails up with probability 1 e at every stage of the game and independently from previous stages is an e equilibrium for the game The expected payoff of Player 2 in such a strategy profile is at least 1 e However it is easy to see that there is no strategy for Player 2 that can guarantee an expected payoff of exactly 1 Therefore the game has no Nash equilibrium Another simple example is the finitely repeated prisoner s dilemma for T periods where the payoff is averaged over the T periods The only Nash equilibrium of this game is to choose Defect in each period Now consider the two strategies tit for tat and grim trigger Although neither tit for tat nor grim trigger are Nash equilibria for the game both of them are ϵ displaystyle epsilon nbsp equilibria for some positive ϵ displaystyle epsilon nbsp The acceptable values of ϵ displaystyle epsilon nbsp depend on the payoffs of the constituent game and on the number T of periods In economics the concept of a pure strategy epsilon equilibrium is used when the mixed strategy approach is seen as unrealistic In a pure strategy epsilon equilibrium each player chooses a pure strategy that is within epsilon of its best pure strategy For example in the Bertrand Edgeworth model where no pure strategy equilibrium exists a pure strategy epsilon equilibrium may exist References editInline citations V Bubelis 1979 On equilibria in finite games International Journal of Game Theory 8 2 65 79 doi 10 1007 bf01768703 S2CID 122843303 Vazirani Vijay V Nisan Noam Roughgarden Tim Tardos Eva 2007 Algorithmic Game Theory PDF Cambridge UK Cambridge University Press ISBN 0 521 87282 0 P W Goldberg and C H Papadimitriou 2006 Reducibility Among Equilibrium Problems 38th Symposium on Theory of Computing pp 61 70 doi 10 1145 1132516 1132526 C Daskalakis P W Goldberg and C H Papadimitriou 2009 The Complexity of Computing a Nash Equilibrium SIAM Journal on Computing 39 3 195 259 CiteSeerX 10 1 1 68 6111 doi 10 1137 070699652 H Tsaknakis and Paul G Spirakis 2008 An optimisation approach for approximate Nash equilibria Internet Mathematics 5 4 365 382 doi 10 1080 15427951 2008 10129172 Spyros C Kontogiannis and Paul G Spirakis 2010 Well Supported Approximate Equilibria in Bimatrix Games Algorithmica 57 4 653 667 doi 10 1007 s00453 008 9227 6 S2CID 15968419 SourcesH Dixon Approximate Bertrand Equilibrium in a Replicated Industry Review of Economic Studies 54 1987 pages 47 62 H Everett Recursive Games In H W Kuhn and A W Tucker editors Contributions to the theory of games vol III volume 39 of Annals of Mathematical Studies Princeton University Press 1957 Leyton Brown Kevin Shoham Yoav 2008 Essentials of Game Theory A Concise Multidisciplinary Introduction San Rafael CA Morgan amp Claypool Publishers ISBN 978 1 59829 593 1 An 88 page mathematical introduction see Section 3 7 Free online at many universities R Radner Collusive behavior in non cooperative epsilon equilibria of oligopolies with long but finite lives Journal of Economic Theory 22 121 157 1980 Shoham Yoav Leyton Brown Kevin 2009 Multiagent Systems Algorithmic Game Theoretic and Logical Foundations New York Cambridge University Press ISBN 978 0 521 89943 7 A comprehensive reference from a computational perspective see Section 3 4 7 Downloadable free online S H Tijs Nash equilibria for noncooperativen person games in normal form SIAM Review 23 225 237 1981 Retrieved from https en wikipedia org w index php title Epsilon equilibrium amp oldid 1187493130, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.