fbpx
Wikipedia

Stackelberg competition

The Stackelberg leadership model is a strategic game in economics in which the leader firm moves first and then the follower firms move sequentially (hence, it is sometimes described as the "leader-follower game"). It is named after the German economist Heinrich Freiherr von Stackelberg who published Marktform und Gleichgewicht [Market Structure and Equilibrium] in 1934, which described the model. In game theory terms, the players of this game are a leader and a follower and they compete on quantity. The Stackelberg leader is sometimes referred to as the Market Leader.

There are some further constraints upon the sustaining of a Stackelberg equilibrium. The leader must know ex ante that the follower observes its action. The follower must have no means of committing to a future non-Stackelberg leader's action and the leader must know this. Indeed, if the 'follower' could commit to a Stackelberg leader action and the 'leader' knew this, the leader's best response would be to play a Stackelberg follower action.

Firms may engage in Stackelberg competition if one has some sort of advantage enabling it to move first. More generally, the leader must have commitment power. Moving observably first is the most obvious means of commitment: once the leader has made its move, it cannot undo it—it is committed to that action. Moving first may be possible if the leader was the incumbent monopoly of the industry and the follower is a new entrant. Holding excess capacity is another means of commitment.

Subgame perfect Nash equilibrium edit

The Stackelberg model can be solved to find the subgame perfect Nash equilibrium or equilibria (SPNE), i.e. the strategy profile that serves best each player, given the strategies of the other player and that entails every player playing in a Nash equilibrium in every subgame.

In very general terms, let the price function for the (duopoly) industry be  ; price is simply a function of total (industry) output, so is   where the subscript   represents the leader and   represents the follower. Suppose firm   has the cost structure  . The model is solved by backward induction. The leader considers what the best response of the follower is, i.e. how it will respond once it has observed the quantity of the leader. The leader then picks a quantity that maximises its payoff, anticipating the predicted response of the follower. The follower actually observes this and in equilibrium picks the expected quantity as a response.

To calculate the SPNE, the best response functions of the follower must first be calculated (calculation moves 'backwards' because of backward induction).

The profit of firm   (the follower) is revenue minus cost. Revenue is the product of price and quantity and cost is given by the firm's cost structure, so profit is:  . The best response is to find the value of   that maximises   given  , i.e. given the output of the leader (firm  ), the output that maximises the follower's profit is found. Hence, the maximum of   with respect to   is to be found. First differentiate   with respect to  :

 

Setting this to zero for maximisation:

 

The values of   that satisfy this equation are the best responses. Now the best response function of the leader is considered. This function is calculated by considering the follower's output as a function of the leader's output, as just computed.

The profit of firm   (the leader) is  , where   is the follower's quantity as a function of the leader's quantity, namely the function calculated above. The best response is to find the value of   that maximises   given  , i.e. given the best response function of the follower (firm  ), the output that maximises the leader's profit is found. Hence, the maximum of   with respect to   is to be found. First, differentiate   with respect to  :

 

Setting this to zero for maximisation:

 

Examples edit

The following example is very general. It assumes a generalised linear demand structure

 

and imposes some restrictions on cost structures for simplicity's sake so the problem can be resolved.

  and  

for ease of computation.

The follower's profit is:

 

The maximisation problem resolves to (from the general case):

 
 
 

Consider the leader's problem:

 

Substituting for   from the follower's problem:

 
 

The maximisation problem resolves to (from the general case):

 

Now solving for   yields  , the leader's optimal action:

 

This is the leader's best response to the reaction of the follower in equilibrium. The follower's actual can now be found by feeding this into its reaction function calculated earlier:

 
 

The Nash equilibria are all  . It is clear (if marginal costs are assumed to be zero - i.e. cost is essentially ignored) that the leader has a significant advantage. Intuitively, if the leader was no better off than the follower, it would simply adopt a Cournot competition strategy.

Plugging the follower's quantity  , back into the leader's best response function will not yield  . This is because once leader has committed to an output and observed the followers it always wants to reduce its output ex-post. However its inability to do so is what allows it to receive higher profits than under Cournot.

Economic analysis edit

An extensive-form representation is often used to analyze the Stackelberg leader-follower model. Also referred to as a “decision tree”, the model shows the combination of outputs and payoffs both firms have in the Stackelberg game.

 
A Stackelberg game represented in extensive form

The image on the left depicts in extensive form a Stackelberg game. The payoffs are shown on the right. This example is fairly simple. There is a basic cost structure involving only marginal cost (there is no fixed cost). The demand function is linear and price elasticity of demand is 1. However, it illustrates the leader's advantage.

The follower wants to choose   to maximise its payoff  . Taking the first order derivative and equating it to zero (for maximisation) yields   as the maximum value of  .

The leader wants to choose   to maximise its payoff  . However, in equilibrium, it knows the follower will choose   as above. So in fact the leader wants to maximise its payoff   (by substituting   for the follower's best response function). By differentiation, the maximum payoff is given by  . Feeding this into the follower's best response function yields  . Suppose marginal costs were equal for the firms (so the leader has no market advantage other than first move) and in particular  . The leader would produce 2000 and the follower would produce 1000. This would give the leader a profit (payoff) of two million and the follower a profit of one million. Simply by moving first, the leader has accrued twice the profit of the follower. However, Cournot profits here are 1.78 million apiece (strictly,   apiece), so the leader has not gained much, but the follower has lost. However, this is example-specific. There may be cases where a Stackelberg leader has huge gains beyond Cournot profit that approach monopoly profits (for example, if the leader also had a large cost structure advantage, perhaps due to a better production function). There may also be cases where the follower actually enjoys higher profits than the leader, but only because it, say, has much lower costs. This behaviour consistently work on duopoly markets even if the firms are asymmetrical.

Credible and non-credible threats by the follower edit

If, after the leader had selected its equilibrium quantity, the follower deviated from the equilibrium and chose some non-optimal quantity it would not only hurt itself, but it could also hurt the leader. If the follower chose a much larger quantity than its best response, the market price would lower and the leader's profits would be stung, perhaps below Cournot level profits. In this case, the follower could announce to the leader before the game starts that unless the leader chooses a Cournot equilibrium quantity, the follower will choose a deviant quantity that will hit the leader's profits. After all, the quantity chosen by the leader in equilibrium is only optimal if the follower also plays in equilibrium. The leader is, however, in no danger. Once the leader has chosen its equilibrium quantity, it would be irrational for the follower to deviate because it too would be hurt. Once the leader has chosen, the follower is better off by playing on the equilibrium path. Hence, such a threat by the follower would not be credible.

However, in an (indefinitely) repeated Stackelberg game, the follower might adopt a punishment strategy where it threatens to punish the leader in the next period unless it chooses a non-optimal strategy in the current period. This threat may be credible because it could be rational for the follower to punish in the next period so that the leader chooses Cournot quantities thereafter.

Stackelberg compared with Cournot edit

The Stackelberg and Cournot models are similar because in both competition is on quantity. However, as seen, the first move gives the leader in Stackelberg a crucial advantage. There is also the important assumption of perfect information in the Stackelberg game: the follower must observe the quantity chosen by the leader, otherwise the game reduces to Cournot. With imperfect information, the threats described above can be credible. If the follower cannot observe the leader's move, it is no longer irrational for the follower to choose, say, a Cournot level of quantity (in fact, that is the equilibrium action). However, it must be that there is imperfect information and the follower is unable to observe the leader's move because it is irrational for the follower not to observe if it can once the leader has moved. If it can observe, it will so that it can make the optimal decision. Any threat by the follower claiming that it will not observe even if it can is as uncredible as those above. This is an example of too much information hurting a player. In Cournot competition, it is the simultaneity of the game (the imperfection of knowledge) that results in neither player (ceteris paribus) being at a disadvantage.

Game theoretic considerations edit

As mentioned, imperfect information in a leadership game reduces to Cournot competition. However, some Cournot strategy profiles are sustained as Nash equilibria but can be eliminated as incredible threats (as described above) by applying the solution concept of subgame perfection. Indeed, it is the very thing that makes a Cournot strategy profile a Nash equilibrium in a Stackelberg game that prevents it from being subgame perfect.

Consider a Stackelberg game (i.e. one which fulfills the requirements described above for sustaining a Stackelberg equilibrium) in which, for some reason, the leader believes that whatever action it takes, the follower will choose a Cournot quantity (perhaps the leader believes that the follower is irrational). If the leader played a Stackelberg action, (it believes) that the follower will play Cournot. Hence it is non-optimal for the leader to play Stackelberg. In fact, its best response (by the definition of Cournot equilibrium) is to play Cournot quantity. Once it has done this, the best response of the follower is to play Cournot.

Consider the following strategy profiles: the leader plays Cournot; the follower plays Cournot if the leader plays Cournot and the follower plays Stackelberg if the leader plays Stackelberg and if the leader plays something else, the follower plays an arbitrary strategy (hence this actually describes several profiles). This profile is a Nash equilibrium. As argued above, on the equilibrium path play is a best response to a best response. However, playing Cournot would not have been the best response of the leader were it that the follower would play Stackelberg if it (the leader) played Stackelberg. In this case, the best response of the leader would be to play Stackelberg. Hence, what makes this profile (or rather, these profiles) a Nash equilibrium (or rather, Nash equilibria) is the fact that the follower would play non-Stackelberg if the leader were to play Stackelberg.

However, this very fact (that the follower would play non-Stackelberg if the leader were to play Stackelberg) means that this profile is not a Nash equilibrium of the subgame starting when the leader has already played Stackelberg (a subgame off the equilibrium path). If the leader has already played Stackelberg, the best response of the follower is to play Stackelberg (and therefore it is the only action that yields a Nash equilibrium in this subgame). Hence the strategy profile - which is Cournot - is not subgame perfect.

Comparison with other oligopoly models edit

In comparison with other oligopoly models,

  • The aggregate Stackelberg output is greater than the aggregate Cournot output, but less than the aggregate Bertrand output.
  • The Stackelberg price is lower than the Cournot price, but greater than the Bertrand price.
  • The Stackelberg consumer surplus is greater than the Cournot consumer surplus, but lower than the Bertrand consumer surplus.
  • The aggregate Stackelberg output is greater than pure monopoly or cartel, but less than the perfectly competitive output.
  • The Stackelberg price is lower than the pure monopoly or cartel price, but greater than the perfectly competitive price.

Applications edit

The Stackelberg concept has been extended to dynamic Stackelberg games.[1][2] With the addition of time as a dimension, phenomena not found in static games were discovered, such as violation of the principle of optimality by the leader.[2]

In recent years, Stackelberg games have been applied in the security domain.[3] In this context, the defender (leader) designs a strategy to protect a resource, such that the resource remains safe irrespective of the strategy adopted by the attacker (follower). Stackelberg differential games are also used to model supply chains and marketing channels.[4] Other applications of Stackelberg games include heterogeneous networks,[5] genetic privacy,[6][7] robotics,[8][9] autonomous driving,[10][11] electrical grids,[12][13] and integrated energy systems.[14]

See also edit

References edit

  1. ^ Simaan, M.; Cruz, J. B. (May 1973). "On the Stackelberg strategy in nonzero-sum games". Journal of Optimization Theory and Applications. 11 (5): 533–555. doi:10.1007/BF00935665. ISSN 0022-3239. S2CID 121400147.
  2. ^ a b Simaan, M.; Cruz, J. B. (June 1973). "Additional aspects of the Stackelberg strategy in nonzero-sum games". Journal of Optimization Theory and Applications. 11 (6): 613–626. doi:10.1007/BF00935561. ISSN 0022-3239.
  3. ^ Brown, Gerald (2006). "Defending critical infrastructure". Interfaces. 36 (6): 530–544. doi:10.1287/inte.1060.0252. hdl:10945/36732. S2CID 16223037.
  4. ^ He, Xiuli; Prasad, Ashutosh; Sethi, Suresh P.; Gutierrez, Genaro J. (December 2007). "A survey of Stackelberg differential game models in supply and marketing channels". Journal of Systems Science and Systems Engineering. 16 (4): 385–413. CiteSeerX 10.1.1.727.2952. doi:10.1007/s11518-007-5058-2. ISSN 1004-3756. S2CID 11443159.
  5. ^ Ghosh, Subha; De, Debashis (2021-04-28). "E²M³: energy-efficient massive MIMO–MISO 5G HetNet using Stackelberg game". The Journal of Supercomputing. 77 (11): 13549–13583. doi:10.1007/s11227-021-03809-1. ISSN 0920-8542. S2CID 235569547.
  6. ^ Wan, Zhiyu; Vorobeychik, Yevgeniy; Xia, Weiyi; Clayton, Ellen Wright; Kantarcioglu, Murat; Malin, Bradley (2017-02-02). "Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach". The American Journal of Human Genetics. 100 (2): 316–322. doi:10.1016/j.ajhg.2016.12.002. ISSN 0002-9297. PMC 5294764. PMID 28065469.
  7. ^ Wan, Zhiyu; Vorobeychik, Yevgeniy; Xia, Weiyi; Liu, Yongtai; Wooders, Myrna; Guo, Jia; Yin, Zhijun; Clayton, Ellen Wright; Kantarcioglu, Murat; Malin, Bradley A. (2021). "Using game theory to thwart multistage privacy intrusions when sharing data". Science Advances. 7 (50): eabe9986. Bibcode:2021SciA....7.9986W. doi:10.1126/sciadv.abe9986. PMC 8664254. PMID 34890225.
  8. ^ Koh, Joewie J.; Ding, Guohui; Heckman, Christoffer; Chen, Lijun; Roncone, Alessandro (2020-10-24). "Cooperative Control of Mobile Robots with Stackelberg Learning". 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Las Vegas, NV, USA: IEEE. pp. 7985–7992. arXiv:2008.00679. doi:10.1109/IROS45743.2020.9341376. ISBN 978-1-7281-6212-6. S2CID 220935562.
  9. ^ Ranjbar-Sahraei, Bijan; Stankova, Katerina; Tuyls, Karl; Weiss, Gerhard (2013-09-02). "Stackelberg-based Coverage Approach in Nonconvex Environments". Advances in Artificial Life, ECAL 2013. MIT Press: 462–469. CiteSeerX 10.1.1.650.4481. doi:10.7551/978-0-262-31709-2-ch066. ISBN 978-0-262-31709-2. S2CID 11668402.
  10. ^ Yoo, Jehong; Langari, Reza (2020). "A Stackelberg Game Theoretic Model of Lane-Merging". arXiv:2003.09786. {{cite journal}}: Cite journal requires |journal= (help)
  11. ^ Cooper, Matt; Lee, Jun Ki; Beck, Jacob; Fishman, Joshua D.; Gillett, Michael; Papakipos, Zoë; Zhang, Aaron; Ramos, Jerome; Shah, Aansh (2019), Salichs, Miguel A.; Ge, Shuzhi Sam; Barakova, Emilia Ivanova; Cabibihan, John-John (eds.), "Stackelberg Punishment and Bully-Proofing Autonomous Vehicles", Social Robotics, vol. 11876, Cham: Springer International Publishing, pp. 368–377, arXiv:1908.08641, doi:10.1007/978-3-030-35888-4_34, ISBN 978-3-030-35887-7, S2CID 201645147, retrieved 2021-05-03
  12. ^ Qiu, Haifeng; Gu, Wei; Wang, Lu; Pan, Guangsheng; Xu, Yinliang; Wu, Zhi (June 2021). "Trilayer Stackelberg Game Approach for Robustly Power Management in Community Grids". IEEE Transactions on Industrial Informatics. 17 (6): 4073–4083. doi:10.1109/TII.2020.3015733. ISSN 1551-3203. S2CID 226558914.
  13. ^ An, Lu; Chakrabortty, Aranya; Duel-Hallen, Alexandra (2020-12-14). "A Stackelberg Security Investment Game for Voltage Stability of Power Systems". 2020 59th IEEE Conference on Decision and Control (CDC). Jeju, Korea (South): IEEE. pp. 3359–3364. arXiv:2006.11665. doi:10.1109/CDC42340.2020.9304301. ISBN 978-1-7281-7447-1. S2CID 219965779.
  14. ^ Zheng, Weiye; Hill, David J. (2021-03-01). "Incentive-based coordination mechanism for distributed operation of integrated electricity and heat systems". Applied Energy. 285: 116373. doi:10.1016/j.apenergy.2020.116373. ISSN 0306-2619. S2CID 233833095.
  • H. von Stackelberg, Market Structure and Equilibrium: 1st Edition Translation into English, Bazin, Urch & Hill, Springer 2011, XIV, 134 p., ISBN 978-3-642-12585-0
  • Fudenberg, D. and Tirole, J. (1993) Game Theory, MIT Press. (see Chapter 3, sect 1)
  • Gibbons, R. (1992) A primer in game theory, Harvester-Wheatsheaf. (see Chapter 2, section 1B)
  • Osborne, M.J. and Rubenstein, A. (1994) A Course in Game Theory, MIT Press (see p 97-98)
  • Oligoply Theory made Simple, Chapter 6 of Surfing Economics by Huw Dixon.

stackelberg, competition, this, article, technical, most, readers, understand, please, help, improve, make, understandable, experts, without, removing, technical, details, january, 2022, learn, when, remove, this, template, message, stackelberg, leadership, mo. This article may be too technical for most readers to understand Please help improve it to make it understandable to non experts without removing the technical details January 2022 Learn how and when to remove this template message The Stackelberg leadership model is a strategic game in economics in which the leader firm moves first and then the follower firms move sequentially hence it is sometimes described as the leader follower game It is named after the German economist Heinrich Freiherr von Stackelberg who published Marktform und Gleichgewicht Market Structure and Equilibrium in 1934 which described the model In game theory terms the players of this game are a leader and a follower and they compete on quantity The Stackelberg leader is sometimes referred to as the Market Leader There are some further constraints upon the sustaining of a Stackelberg equilibrium The leader must know ex ante that the follower observes its action The follower must have no means of committing to a future non Stackelberg leader s action and the leader must know this Indeed if the follower could commit to a Stackelberg leader action and the leader knew this the leader s best response would be to play a Stackelberg follower action Firms may engage in Stackelberg competition if one has some sort of advantage enabling it to move first More generally the leader must have commitment power Moving observably first is the most obvious means of commitment once the leader has made its move it cannot undo it it is committed to that action Moving first may be possible if the leader was the incumbent monopoly of the industry and the follower is a new entrant Holding excess capacity is another means of commitment Contents 1 Subgame perfect Nash equilibrium 1 1 Examples 2 Economic analysis 3 Credible and non credible threats by the follower 4 Stackelberg compared with Cournot 4 1 Game theoretic considerations 5 Comparison with other oligopoly models 6 Applications 7 See also 8 ReferencesSubgame perfect Nash equilibrium editThe Stackelberg model can be solved to find the subgame perfect Nash equilibrium or equilibria SPNE i e the strategy profile that serves best each player given the strategies of the other player and that entails every player playing in a Nash equilibrium in every subgame In very general terms let the price function for the duopoly industry be P displaystyle P nbsp price is simply a function of total industry output so is P q1 q2 displaystyle P q 1 q 2 nbsp where the subscript 1 displaystyle 1 nbsp represents the leader and 2 displaystyle 2 nbsp represents the follower Suppose firm i displaystyle i nbsp has the cost structure Ci qi displaystyle C i q i nbsp The model is solved by backward induction The leader considers what the best response of the follower is i e how it will respond once it has observed the quantity of the leader The leader then picks a quantity that maximises its payoff anticipating the predicted response of the follower The follower actually observes this and in equilibrium picks the expected quantity as a response To calculate the SPNE the best response functions of the follower must first be calculated calculation moves backwards because of backward induction The profit of firm 2 displaystyle 2 nbsp the follower is revenue minus cost Revenue is the product of price and quantity and cost is given by the firm s cost structure so profit is P2 P q1 q2 q2 C2 q2 displaystyle Pi 2 P q 1 q 2 cdot q 2 C 2 q 2 nbsp The best response is to find the value of q2 displaystyle q 2 nbsp that maximises P2 displaystyle Pi 2 nbsp given q1 displaystyle q 1 nbsp i e given the output of the leader firm 1 displaystyle 1 nbsp the output that maximises the follower s profit is found Hence the maximum of P2 displaystyle Pi 2 nbsp with respect to q2 displaystyle q 2 nbsp is to be found First differentiate P2 displaystyle Pi 2 nbsp with respect to q2 displaystyle q 2 nbsp P2 q2 P q1 q2 q2 q2 P q1 q2 C2 q2 q2 displaystyle frac partial Pi 2 partial q 2 frac partial P q 1 q 2 partial q 2 cdot q 2 P q 1 q 2 frac partial C 2 q 2 partial q 2 nbsp Setting this to zero for maximisation P2 q2 P q1 q2 q2 q2 P q1 q2 C2 q2 q2 0 displaystyle frac partial Pi 2 partial q 2 frac partial P q 1 q 2 partial q 2 cdot q 2 P q 1 q 2 frac partial C 2 q 2 partial q 2 0 nbsp The values of q2 displaystyle q 2 nbsp that satisfy this equation are the best responses Now the best response function of the leader is considered This function is calculated by considering the follower s output as a function of the leader s output as just computed The profit of firm 1 displaystyle 1 nbsp the leader is P1 P q1 q2 q1 q1 C1 q1 displaystyle Pi 1 P q 1 q 2 q 1 q 1 C 1 q 1 nbsp where q2 q1 displaystyle q 2 q 1 nbsp is the follower s quantity as a function of the leader s quantity namely the function calculated above The best response is to find the value of q1 displaystyle q 1 nbsp that maximises P1 displaystyle Pi 1 nbsp given q2 q1 displaystyle q 2 q 1 nbsp i e given the best response function of the follower firm 2 displaystyle 2 nbsp the output that maximises the leader s profit is found Hence the maximum of P1 displaystyle Pi 1 nbsp with respect to q1 displaystyle q 1 nbsp is to be found First differentiate P1 displaystyle Pi 1 nbsp with respect to q1 displaystyle q 1 nbsp P1 q1 P q1 q2 q2 q2 q1 q1 q1 P q1 q2 q1 q1 P q1 q2 q1 C1 q1 q1 displaystyle frac partial Pi 1 partial q 1 frac partial P q 1 q 2 partial q 2 cdot frac partial q 2 q 1 partial q 1 cdot q 1 frac partial P q 1 q 2 partial q 1 cdot q 1 P q 1 q 2 q 1 frac partial C 1 q 1 partial q 1 nbsp Setting this to zero for maximisation P1 q1 P q1 q2 q2 q2 q1 q1 q1 P q1 q2 q1 q1 P q1 q2 q1 C1 q1 q1 0 displaystyle frac partial Pi 1 partial q 1 frac partial P q 1 q 2 partial q 2 cdot frac partial q 2 q 1 partial q 1 cdot q 1 frac partial P q 1 q 2 partial q 1 cdot q 1 P q 1 q 2 q 1 frac partial C 1 q 1 partial q 1 0 nbsp Examples edit The following example is very general It assumes a generalised linear demand structure p q1 q2 a b q1 q2 displaystyle p q 1 q 2 bigg a b q 1 q 2 bigg nbsp and imposes some restrictions on cost structures for simplicity s sake so the problem can be resolved 2Ci qi qi qj 0 j displaystyle frac partial 2 C i q i partial q i cdot partial q j 0 forall j nbsp and Ci qi qj 0 j i displaystyle frac partial C i q i partial q j 0 j neq i nbsp for ease of computation The follower s profit is p2 a b q1 q2 q2 C2 q2 displaystyle pi 2 bigg a b q 1 q 2 bigg cdot q 2 C 2 q 2 nbsp The maximisation problem resolves to from the general case a b q1 q2 q2 q2 a b q1 q2 C2 q2 q2 0 displaystyle frac partial bigg a b q 1 q 2 bigg partial q 2 cdot q 2 a b q 1 q 2 frac partial C 2 q 2 partial q 2 0 nbsp bq2 a b q1 q2 C2 q2 q2 0 displaystyle Rightarrow bq 2 a b q 1 q 2 frac partial C 2 q 2 partial q 2 0 nbsp q2 a bq1 C2 q2 q22b displaystyle Rightarrow q 2 frac a bq 1 frac partial C 2 q 2 partial q 2 2b nbsp Consider the leader s problem P1 a b q1 q2 q1 q1 C1 q1 displaystyle Pi 1 bigg a b q 1 q 2 q 1 bigg cdot q 1 C 1 q 1 nbsp Substituting for q2 q1 displaystyle q 2 q 1 nbsp from the follower s problem P1 a b q1 a bq1 C2 q2 q22b q1 C1 q1 displaystyle Pi 1 bigg a b bigg q 1 frac a bq 1 frac partial C 2 q 2 partial q 2 2b bigg bigg cdot q 1 C 1 q 1 nbsp P1 a b q1 C2 q2 q22 q1 C1 q1 displaystyle Rightarrow Pi 1 bigg frac a b q 1 frac partial C 2 q 2 partial q 2 2 bigg cdot q 1 C 1 q 1 nbsp The maximisation problem resolves to from the general case p1 q1 a 2bq1 C2 q2 q22 C1 q1 q1 0 displaystyle frac partial pi 1 partial q 1 bigg frac a 2bq 1 frac partial C 2 q 2 partial q 2 2 bigg frac partial C 1 q 1 partial q 1 0 nbsp Now solving for q1 displaystyle q 1 nbsp yields q1 displaystyle q 1 nbsp the leader s optimal action q1 a C2 q2 q2 2 C1 q1 q12b displaystyle q 1 frac a frac partial C 2 q 2 partial q 2 2 cdot frac partial C 1 q 1 partial q 1 2b nbsp This is the leader s best response to the reaction of the follower in equilibrium The follower s actual can now be found by feeding this into its reaction function calculated earlier q2 a b a C2 q2 q2 2 C1 q1 q12b C2 q2 q22b displaystyle q 2 frac a b cdot frac a frac partial C 2 q 2 partial q 2 2 cdot frac partial C 1 q 1 partial q 1 2b frac partial C 2 q 2 partial q 2 2b nbsp q2 a 3 C2 q2 q2 2 C1 q1 q14b displaystyle Rightarrow q 2 frac a 3 cdot frac partial C 2 q 2 partial q 2 2 cdot frac partial C 1 q 1 partial q 1 4b nbsp The Nash equilibria are all q1 q2 displaystyle q 1 q 2 nbsp It is clear if marginal costs are assumed to be zero i e cost is essentially ignored that the leader has a significant advantage Intuitively if the leader was no better off than the follower it would simply adopt a Cournot competition strategy Plugging the follower s quantity q2 displaystyle q 2 nbsp back into the leader s best response function will not yield q1 displaystyle q 1 nbsp This is because once leader has committed to an output and observed the followers it always wants to reduce its output ex post However its inability to do so is what allows it to receive higher profits than under Cournot Economic analysis editAn extensive form representation is often used to analyze the Stackelberg leader follower model Also referred to as a decision tree the model shows the combination of outputs and payoffs both firms have in the Stackelberg game nbsp A Stackelberg game represented in extensive formThe image on the left depicts in extensive form a Stackelberg game The payoffs are shown on the right This example is fairly simple There is a basic cost structure involving only marginal cost there is no fixed cost The demand function is linear and price elasticity of demand is 1 However it illustrates the leader s advantage The follower wants to choose q2 displaystyle q 2 nbsp to maximise its payoff q2 5000 q1 q2 c2 displaystyle q 2 times 5000 q 1 q 2 c 2 nbsp Taking the first order derivative and equating it to zero for maximisation yields q2 5000 q1 c22 displaystyle q 2 frac 5000 q 1 c 2 2 nbsp as the maximum value of q2 displaystyle q 2 nbsp The leader wants to choose q1 displaystyle q 1 nbsp to maximise its payoff q1 5000 q1 q2 c1 displaystyle q 1 times 5000 q 1 q 2 c 1 nbsp However in equilibrium it knows the follower will choose q2 displaystyle q 2 nbsp as above So in fact the leader wants to maximise its payoff q1 5000 q1 5000 q1 c22 c1 displaystyle q 1 times 5000 q 1 frac 5000 q 1 c 2 2 c 1 nbsp by substituting q2 displaystyle q 2 nbsp for the follower s best response function By differentiation the maximum payoff is given by q1 5000 2c1 c22 displaystyle q 1 frac 5000 2c 1 c 2 2 nbsp Feeding this into the follower s best response function yields q2 5000 2c1 3c24 displaystyle q 2 frac 5000 2c 1 3c 2 4 nbsp Suppose marginal costs were equal for the firms so the leader has no market advantage other than first move and in particular c1 c2 1000 displaystyle c 1 c 2 1000 nbsp The leader would produce 2000 and the follower would produce 1000 This would give the leader a profit payoff of two million and the follower a profit of one million Simply by moving first the leader has accrued twice the profit of the follower However Cournot profits here are 1 78 million apiece strictly 16 9 106 displaystyle 16 9 10 6 nbsp apiece so the leader has not gained much but the follower has lost However this is example specific There may be cases where a Stackelberg leader has huge gains beyond Cournot profit that approach monopoly profits for example if the leader also had a large cost structure advantage perhaps due to a better production function There may also be cases where the follower actually enjoys higher profits than the leader but only because it say has much lower costs This behaviour consistently work on duopoly markets even if the firms are asymmetrical Credible and non credible threats by the follower editIf after the leader had selected its equilibrium quantity the follower deviated from the equilibrium and chose some non optimal quantity it would not only hurt itself but it could also hurt the leader If the follower chose a much larger quantity than its best response the market price would lower and the leader s profits would be stung perhaps below Cournot level profits In this case the follower could announce to the leader before the game starts that unless the leader chooses a Cournot equilibrium quantity the follower will choose a deviant quantity that will hit the leader s profits After all the quantity chosen by the leader in equilibrium is only optimal if the follower also plays in equilibrium The leader is however in no danger Once the leader has chosen its equilibrium quantity it would be irrational for the follower to deviate because it too would be hurt Once the leader has chosen the follower is better off by playing on the equilibrium path Hence such a threat by the follower would not be credible However in an indefinitely repeated Stackelberg game the follower might adopt a punishment strategy where it threatens to punish the leader in the next period unless it chooses a non optimal strategy in the current period This threat may be credible because it could be rational for the follower to punish in the next period so that the leader chooses Cournot quantities thereafter Stackelberg compared with Cournot editThe Stackelberg and Cournot models are similar because in both competition is on quantity However as seen the first move gives the leader in Stackelberg a crucial advantage There is also the important assumption of perfect information in the Stackelberg game the follower must observe the quantity chosen by the leader otherwise the game reduces to Cournot With imperfect information the threats described above can be credible If the follower cannot observe the leader s move it is no longer irrational for the follower to choose say a Cournot level of quantity in fact that is the equilibrium action However it must be that there is imperfect information and the follower is unable to observe the leader s move because it is irrational for the follower not to observe if it can once the leader has moved If it can observe it will so that it can make the optimal decision Any threat by the follower claiming that it will not observe even if it can is as uncredible as those above This is an example of too much information hurting a player In Cournot competition it is the simultaneity of the game the imperfection of knowledge that results in neither player ceteris paribus being at a disadvantage Game theoretic considerations edit As mentioned imperfect information in a leadership game reduces to Cournot competition However some Cournot strategy profiles are sustained as Nash equilibria but can be eliminated as incredible threats as described above by applying the solution concept of subgame perfection Indeed it is the very thing that makes a Cournot strategy profile a Nash equilibrium in a Stackelberg game that prevents it from being subgame perfect Consider a Stackelberg game i e one which fulfills the requirements described above for sustaining a Stackelberg equilibrium in which for some reason the leader believes that whatever action it takes the follower will choose a Cournot quantity perhaps the leader believes that the follower is irrational If the leader played a Stackelberg action it believes that the follower will play Cournot Hence it is non optimal for the leader to play Stackelberg In fact its best response by the definition of Cournot equilibrium is to play Cournot quantity Once it has done this the best response of the follower is to play Cournot Consider the following strategy profiles the leader plays Cournot the follower plays Cournot if the leader plays Cournot and the follower plays Stackelberg if the leader plays Stackelberg and if the leader plays something else the follower plays an arbitrary strategy hence this actually describes several profiles This profile is a Nash equilibrium As argued above on the equilibrium path play is a best response to a best response However playing Cournot would not have been the best response of the leader were it that the follower would play Stackelberg if it the leader played Stackelberg In this case the best response of the leader would be to play Stackelberg Hence what makes this profile or rather these profiles a Nash equilibrium or rather Nash equilibria is the fact that the follower would play non Stackelberg if the leader were to play Stackelberg However this very fact that the follower would play non Stackelberg if the leader were to play Stackelberg means that this profile is not a Nash equilibrium of the subgame starting when the leader has already played Stackelberg a subgame off the equilibrium path If the leader has already played Stackelberg the best response of the follower is to play Stackelberg and therefore it is the only action that yields a Nash equilibrium in this subgame Hence the strategy profile which is Cournot is not subgame perfect Comparison with other oligopoly models editIn comparison with other oligopoly models The aggregate Stackelberg output is greater than the aggregate Cournot output but less than the aggregate Bertrand output The Stackelberg price is lower than the Cournot price but greater than the Bertrand price The Stackelberg consumer surplus is greater than the Cournot consumer surplus but lower than the Bertrand consumer surplus The aggregate Stackelberg output is greater than pure monopoly or cartel but less than the perfectly competitive output The Stackelberg price is lower than the pure monopoly or cartel price but greater than the perfectly competitive price Applications editThe Stackelberg concept has been extended to dynamic Stackelberg games 1 2 With the addition of time as a dimension phenomena not found in static games were discovered such as violation of the principle of optimality by the leader 2 In recent years Stackelberg games have been applied in the security domain 3 In this context the defender leader designs a strategy to protect a resource such that the resource remains safe irrespective of the strategy adopted by the attacker follower Stackelberg differential games are also used to model supply chains and marketing channels 4 Other applications of Stackelberg games include heterogeneous networks 5 genetic privacy 6 7 robotics 8 9 autonomous driving 10 11 electrical grids 12 13 and integrated energy systems 14 See also editEconomic theory Cournot competition Bertrand competition Extensive form game Industrial organization Mathematical programming with equilibrium constraintsReferences edit Simaan M Cruz J B May 1973 On the Stackelberg strategy in nonzero sum games Journal of Optimization Theory and Applications 11 5 533 555 doi 10 1007 BF00935665 ISSN 0022 3239 S2CID 121400147 a b Simaan M Cruz J B June 1973 Additional aspects of the Stackelberg strategy in nonzero sum games Journal of Optimization Theory and Applications 11 6 613 626 doi 10 1007 BF00935561 ISSN 0022 3239 Brown Gerald 2006 Defending critical infrastructure Interfaces 36 6 530 544 doi 10 1287 inte 1060 0252 hdl 10945 36732 S2CID 16223037 He Xiuli Prasad Ashutosh Sethi Suresh P Gutierrez Genaro J December 2007 A survey of Stackelberg differential game models in supply and marketing channels Journal of Systems Science and Systems Engineering 16 4 385 413 CiteSeerX 10 1 1 727 2952 doi 10 1007 s11518 007 5058 2 ISSN 1004 3756 S2CID 11443159 Ghosh Subha De Debashis 2021 04 28 E M energy efficient massive MIMO MISO 5G HetNet using Stackelberg game The Journal of Supercomputing 77 11 13549 13583 doi 10 1007 s11227 021 03809 1 ISSN 0920 8542 S2CID 235569547 Wan Zhiyu Vorobeychik Yevgeniy Xia Weiyi Clayton Ellen Wright Kantarcioglu Murat Malin Bradley 2017 02 02 Expanding Access to Large Scale Genomic Data While Promoting Privacy A Game Theoretic Approach The American Journal of Human Genetics 100 2 316 322 doi 10 1016 j ajhg 2016 12 002 ISSN 0002 9297 PMC 5294764 PMID 28065469 Wan Zhiyu Vorobeychik Yevgeniy Xia Weiyi Liu Yongtai Wooders Myrna Guo Jia Yin Zhijun Clayton Ellen Wright Kantarcioglu Murat Malin Bradley A 2021 Using game theory to thwart multistage privacy intrusions when sharing data Science Advances 7 50 eabe9986 Bibcode 2021SciA 7 9986W doi 10 1126 sciadv abe9986 PMC 8664254 PMID 34890225 Koh Joewie J Ding Guohui Heckman Christoffer Chen Lijun Roncone Alessandro 2020 10 24 Cooperative Control of Mobile Robots with Stackelberg Learning 2020 IEEE RSJ International Conference on Intelligent Robots and Systems IROS Las Vegas NV USA IEEE pp 7985 7992 arXiv 2008 00679 doi 10 1109 IROS45743 2020 9341376 ISBN 978 1 7281 6212 6 S2CID 220935562 Ranjbar Sahraei Bijan Stankova Katerina Tuyls Karl Weiss Gerhard 2013 09 02 Stackelberg based Coverage Approach in Nonconvex Environments Advances in Artificial Life ECAL 2013 MIT Press 462 469 CiteSeerX 10 1 1 650 4481 doi 10 7551 978 0 262 31709 2 ch066 ISBN 978 0 262 31709 2 S2CID 11668402 Yoo Jehong Langari Reza 2020 A Stackelberg Game Theoretic Model of Lane Merging arXiv 2003 09786 a href Template Cite journal html title Template Cite journal cite journal a Cite journal requires journal help Cooper Matt Lee Jun Ki Beck Jacob Fishman Joshua D Gillett Michael Papakipos Zoe Zhang Aaron Ramos Jerome Shah Aansh 2019 Salichs Miguel A Ge Shuzhi Sam Barakova Emilia Ivanova Cabibihan John John eds Stackelberg Punishment and Bully Proofing Autonomous Vehicles Social Robotics vol 11876 Cham Springer International Publishing pp 368 377 arXiv 1908 08641 doi 10 1007 978 3 030 35888 4 34 ISBN 978 3 030 35887 7 S2CID 201645147 retrieved 2021 05 03 Qiu Haifeng Gu Wei Wang Lu Pan Guangsheng Xu Yinliang Wu Zhi June 2021 Trilayer Stackelberg Game Approach for Robustly Power Management in Community Grids IEEE Transactions on Industrial Informatics 17 6 4073 4083 doi 10 1109 TII 2020 3015733 ISSN 1551 3203 S2CID 226558914 An Lu Chakrabortty Aranya Duel Hallen Alexandra 2020 12 14 A Stackelberg Security Investment Game for Voltage Stability of Power Systems 2020 59th IEEE Conference on Decision and Control CDC Jeju Korea South IEEE pp 3359 3364 arXiv 2006 11665 doi 10 1109 CDC42340 2020 9304301 ISBN 978 1 7281 7447 1 S2CID 219965779 Zheng Weiye Hill David J 2021 03 01 Incentive based coordination mechanism for distributed operation of integrated electricity and heat systems Applied Energy 285 116373 doi 10 1016 j apenergy 2020 116373 ISSN 0306 2619 S2CID 233833095 H von Stackelberg Market Structure and Equilibrium 1st Edition Translation into English Bazin Urch amp Hill Springer 2011 XIV 134 p ISBN 978 3 642 12585 0 Fudenberg D and Tirole J 1993 Game Theory MIT Press see Chapter 3 sect 1 Gibbons R 1992 A primer in game theory Harvester Wheatsheaf see Chapter 2 section 1B Osborne M J and Rubenstein A 1994 A Course in Game Theory MIT Press see p 97 98 Oligoply Theory made Simple Chapter 6 of Surfing Economics by Huw Dixon Retrieved from https en wikipedia org w index php title Stackelberg competition amp oldid 1180090126, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.