# 纳什均衡

{{Infobox equilibrium

{{Infobox equilibrium

}}

}}

In game theory, the Nash equilibrium, named after the mathematician John Forbes Nash Jr., is a proposed solution of a non-cooperative game involving two or more players in which each player is assumed to know the equilibrium strategies of the other players, and no player has anything to gain by changing only their own strategy.

In game theory, the Nash equilibrium, named after the mathematician John Forbes Nash Jr., is a proposed solution of a non-cooperative game involving two or more players in which each player is assumed to know the equilibrium strategies of the other players, and no player has anything to gain by changing only their own strategy.

If each player has chosen a strategy—an action plan choosing its own action based on what it has seen happen so far in the game—and no player can increase its own expected payoff by changing its strategy while the other players keep theirs unchanged, then the current set of strategy choices constitutes a Nash equilibrium.

If two players Alice and Bob choose strategies A and B, (A, B) is a Nash equilibrium if Alice has no other strategy available that does better than A at maximizing her payoff in response to Bob choosing B, and Bob has no other strategy available that does better than B at maximizing his payoff in response to Alice choosing A. In a game in which Carol and Dan are also players, (A, B, C, D) is a Nash equilibrium if A is Alice's best response to (B, C, D), B is Bob's best response to (A, C, D), and so forth.

Nash showed that there is a Nash equilibrium for every finite game: see further the article on strategy.

## 应用

Game theorists use Nash equilibrium to analyze the outcome of the strategic interaction of several decision makers. In a strategic interaction, the outcome for each decision-maker depends on the decisions of the others as well as their own. The simple insight underlying Nash's idea is that one cannot predict the choices of multiple decision makers if one analyzes those decisions in isolation. Instead, one must ask what each player would do taking into account what she/he expects the others to do. Nash equilibrium requires that their choices be consistent: no player wishes to undo their decision given what the others are deciding.

The concept has been used to analyze hostile situations like wars and arms races (see prisoner's dilemma), and also how conflict may be mitigated by repeated interaction (see tit-for-tat). It has also been used to study to what extent people with different preferences can cooperate (see battle of the sexes), and whether they will take risks to achieve a cooperative outcome (see stag hunt). It has been used to study the adoption of technical standards,[citation needed] and also the occurrence of bank runs and currency crises (see coordination game). Other applications include traffic flow (see Wardrop's principle), how to organize auctions (see auction theory), the outcome of efforts exerted by multiple parties in the education process, regulatory legislation such as environmental regulations (see tragedy of the commons), natural resource management, analysing strategies in marketing, even penalty kicks in football (see matching pennies), energy systems, transportation systems, evacuation problems and wireless communications.

## 历史

Nash equilibrium is named after American mathematician John Forbes Nash, Jr. The same idea was used in a particular application in 1838 by Antoine Augustin Cournot in his theory of oligopoly. In Cournot's theory, each of several firms choose how much output to produce to maximize its profit. The best output for one firm depends on the outputs of the others. A Cournot equilibrium occurs when each firm's output maximizes its profits given the output of the other firms, which is a pure-strategy Nash equilibrium. Cournot also introduced the concept of best response dynamics in his analysis of the stability of equilibrium. Cournot did not use the idea in any other applications, however, or define it generally.

The modern game-theoretic concept of Nash equilibrium is instead defined in terms of mixed strategies, where players choose a probability distribution over possible actions (rather than choosing a deterministic action to be played with certainty). The concept of a mixed-strategy equilibrium was introduced by John von Neumann and Oskar Morgenstern in their 1944 book The Theory of Games and Economic Behavior. However, their analysis was restricted to the special case of zero-sum games. They showed that a mixed-strategy Nash equilibrium will exist for any zero-sum game with a finite set of actions. The contribution of Nash in his 1951 article "Non-Cooperative Games" was to define a mixed-strategy Nash equilibrium for any game with a finite set of actions and prove that at least one (mixed-strategy) Nash equilibrium must exist in such a game. The key to Nash's ability to prove existence far more generally than von Neumann lay in his definition of equilibrium. According to Nash, "an equilibrium point is an n-tuple such that each player's mixed strategy maximizes his payoff if the strategies of the others are held fixed. Thus each player's strategy is optimal against those of the others." Just putting the problem in this framework allowed Nash to employ the Kakutani fixed-point theorem in his 1950 paper, and a variant upon it in his 1951 paper used the Brouwer fixed-point theorem to prove that there had to exist at least one mixed strategy profile that mapped back into itself for finite-player (not necessarily zero-sum) games; namely, a strategy profile that did not call for a shift in strategies that could improve payoffs.

Since the development of the Nash equilibrium concept, game theorists have discovered that it makes misleading predictions (or fails to make a unique prediction) in certain circumstances. They have proposed many related solution concepts (also called 'refinements' of Nash equilibria) designed to overcome perceived flaws in the Nash concept. One particularly important issue is that some Nash equilibria may be based on threats that are not 'credible'. In 1965 Reinhard Selten proposed subgame perfect equilibrium as a refinement that eliminates equilibria which depend on non-credible threats. Other extensions of the Nash equilibrium concept have addressed what happens if a game is repeated, or what happens if a game is played in the absence of complete information. However, subsequent refinements and extensions of Nash equilibrium share the main insight on which Nash's concept rests: the equilibrium is a set of strategies such that each player's strategy is optimal given the choices of the others.

## 定义

### 纳什均衡

Informally, a strategy profile is a Nash equilibrium if no player can do better by unilaterally changing their strategy. To see what this means, imagine that each player is told the strategies of the others. Suppose then that each player asks themselves: "Knowing the strategies of the other players, and treating the strategies of the other players as set in stone, can I benefit by changing my strategy?"

If any player could answer "Yes", then that set of strategies is not a Nash equilibrium. But if every player prefers not to switch (or is indifferent between switching and not) then the strategy profile is a Nash equilibrium. Thus, each strategy in a Nash equilibrium is a best response to all other strategies in that equilibrium.

The Nash equilibrium may sometimes appear non-rational in a third-person perspective. This is because a Nash equilibrium is not necessarily Pareto optimal.

The Nash equilibrium may also have non-rational consequences in sequential games because players may "threaten" each other with non-rational moves. For such games the subgame perfect Nash equilibrium may be more meaningful as a tool of analysis.

### 严格/弱均衡

Suppose that in the Nash equilibrium, each player asks themselves: "Knowing the strategies of the other players, and treating the strategies of the other players as set in stone, would I suffer a loss by changing my strategy?"

If every player's answer is "Yes", then the equilibrium is classified as a strict Nash equilibrium.

If instead, for some player, there is exact equality between the strategy in Nash equilibrium and some other strategy that gives exactly the same payout (i.e. this player is indifferent between switching and not), then the equilibrium is classified as a weak Nash equilibrium.

A game can have a pure-strategy or a mixed-strategy Nash equilibrium. (In the latter a pure strategy is chosen stochastically with a fixed probability).

### 纳什均衡点的存在性定理

Nash proved that if we allow mixed strategies (where a player chooses probabilities of using various pure strategies), then every game with a finite number of players in which each player can choose from finitely many pure strategies has at least one Nash equilibrium (which might be a pure strategy for each player or might be a probability distribution over strategies for each player).

Nash equilibria need not exist if the set of choices is infinite and noncompact. An example is a game where two players simultaneously name a number and the player naming the larger number wins. Another example is where each of two players chooses a real number strictly less than 5 and the winner is whoever has the biggest number; no biggest number strictly less than 5 exists (if the number could equal 5, the Nash equilibrium would have both players choosing 5 and tying the game). However, a Nash equilibrium exists if the set of choices is compact with each player's payoff continuous in the strategies of all the players. An example, in which the equilibrium is a mixture of continuously many pure strategies, is a game where two players simultaneously pick a real number between 0 and 1 (inclusive) and player one's winnings (paid by the second player) equal the square root of the distance between the two numbers.

## 示例

### 协调博弈

{ | align = right border = “1” cellpadding = “4” cellspacing = “0” style = “ margin: 1em 1em 1em 1em 1em; background: # f9f9f9; border: 1px # solid; border-collapse: collapse; font-size: 95% ; class = “ wikable”

The coordination game is a classic (symmetric) two player, two strategy game, with an example payoff matrix shown to the right. The players should thus coordinate, both adopting strategy A, to receive the highest payoff; i.e., 4. If both players chose strategy B though, there is still a Nash equilibrium. Although each player is awarded less than optimal payoff, neither player has incentive to change strategy due to a reduction in the immediate payoff (from 2 to 1).

A famous example of this type of game was called the stag hunt; in the game two players may choose to hunt a stag or a rabbit, the former providing more meat (4 utility units) than the latter (1 utility unit). The caveat is that the stag must be cooperatively hunted, so if one player attempts to hunt the stag, while the other hunts the rabbit, she/he will fail in hunting (0 utility units), whereas if they both hunt it they will split the payoff (2, 2). The game hence exhibits two equilibria at (stag, stag) and (rabbit, rabbit) and hence the players' optimal strategy depend on their expectation on what the other player may do. If one hunter trusts that the other will hunt the stag, they should hunt the stag; however if they suspect that the other will hunt the rabbit, they should hunt the rabbit. This game was used as an analogy for social cooperation, since much of the benefit that people gain in society depends upon people cooperating and implicitly trusting one another to act in a manner corresponding with cooperation.

Another example of a coordination game is the setting where two technologies are available to two firms with comparable products, and they have to elect a strategy to become the market standard. If both firms agree on the chosen technology, high sales are expected for both firms. If the firms do not agree on the standard technology, few sales result. Both strategies are Nash equilibria of the game.

Driving on a road against an oncoming car, and having to choose either to swerve on the left or to swerve on the right of the road, is also a coordination game. For example, with payoffs 10 meaning no crash and 0 meaning a crash, the coordination game can be defined with the following payoff matrix:

{ | align = left border = “1” cellpadding = “4” cellspacing = “0” style = “ margin: 1em 1em 1em 1em 1em; background: # f9f9f9; border: 1px # solid; border-collapse: collapse; font-size: 95% ” class = “ wikable”

In this case there are two pure-strategy Nash equilibria, when both choose to either drive on the left or on the right. If we admit mixed strategies (where a pure strategy is chosen at random, subject to some fixed probability), then there are three Nash equilibria for the same case: two we have seen from the pure-strategy form, where the probabilities are (0%, 100%) for player one, (0%, 100%) for player two; and (100%, 0%) for player one, (100%, 0%) for player two respectively. We add another where the probabilities for each player are (50%, 50%).

### 囚徒困境

{ | align = right border = “1” cellpadding = “4” cellspacing = “0” style = “ margin: 1em 1em 1em 1em 1em; background: # f9f9f9; border: 1px # solid; border-collapse: collapse; font-size: 95% ; text-align: center = “ wikable”
 模板:Diagonal split header Cooperate (with other) 合作 Cooperate (with other) 示例 PD 支付矩阵 −1, −1 −1, −1 −1, −1 −3, 0 −3, 0 −3, 0 0, −3 0, −3 0, −3 −2, −2 −2, −2 −2, −2

Imagine two prisoners held in separate cells, interrogated simultaneously, and offered deals (lighter jail sentences) for betraying their fellow criminal. They can "cooperate" (with the other prisoner) by not snitching, or "defect" by betraying the other. However, there is a catch; if both players defect, then they both serve a longer sentence than if neither said anything. Lower jail sentences are interpreted as higher payoffs (shown in the table).

The prisoner's dilemma has a similar matrix as depicted for the coordination game, but the maximum reward for each player (in this case, a minimum loss of 0) is obtained only when the players' decisions are different. Each player improves their own situation by switching from "cooperating" to "defecting", given knowledge that the other player's best decision is to "defect". The prisoner's dilemma thus has a single Nash equilibrium: both players choosing to defect.

What has long made this an interesting case to study is the fact that this scenario is globally inferior to "both cooperating". That is, both players would be better off if they both chose to "cooperate" instead of both choosing to defect. However, each player could improve their own situation by breaking the mutual cooperation, no matter how the other player possibly (or certainly) changes their decision.

### Network traffic 网络流量

Sample network graph. Values on edges are the travel time experienced by a 'car' traveling down that edge. is the number of cars traveling via that edge.

Sample network graph. Values on edges are the travel time experienced by a 'car' traveling down that edge. x is the number of cars traveling via that edge.

An application of Nash equilibria is in determining the expected flow of traffic in a network. Consider the graph on the right. If we assume that there are "cars" traveling from A to D, what is the expected distribution of traffic in the network?

An application of Nash equilibria is in determining the expected flow of traffic in a network. Consider the graph on the right. If we assume that there are x "cars" traveling from A to D, what is the expected distribution of traffic in the network?

This situation can be modeled as a "game" where every traveler has a choice of 3 strategies, where each strategy is a route from A to D (either , , or ). The "payoff" of each strategy is the travel time of each route. In the graph on the right, a car travelling via experiences travel time of , where is the number of cars traveling on edge . Thus, payoffs for any given strategy depend on the choices of the other players, as is usual. However, the goal, in this case, is to minimize travel time, not maximize it. Equilibrium will occur when the time on all paths is exactly the same. When that happens, no single driver has any incentive to switch routes, since it can only add to their travel time. For the graph on the right, if, for example, 100 cars are travelling from A to D, then equilibrium will occur when 25 drivers travel via , 50 via , and 25 via . Every driver now has a total travel time of 3.75 (to see this, note that a total of 75 cars take the edge, and likewise, 75 cars take the edge).

This situation can be modeled as a "game" where every traveler has a choice of 3 strategies, where each strategy is a route from A to D (either ABD, ABCD, or ACD). The "payoff" of each strategy is the travel time of each route. In the graph on the right, a car travelling via ABD experiences travel time of (1+x/100)+2, where x is the number of cars traveling on edge AB. Thus, payoffs for any given strategy depend on the choices of the other players, as is usual. However, the goal, in this case, is to minimize travel time, not maximize it. Equilibrium will occur when the time on all paths is exactly the same. When that happens, no single driver has any incentive to switch routes, since it can only add to their travel time. For the graph on the right, if, for example, 100 cars are travelling from A to D, then equilibrium will occur when 25 drivers travel via ABD, 50 via ABCD, and 25 via ACD. Every driver now has a total travel time of 3.75 (to see this, note that a total of 75 cars take the AB edge, and likewise, 75 cars take the CD edge).

Notice that this distribution is not, actually, socially optimal. If the 100 cars agreed that 50 travel via and the other 50 through , then travel time for any single car would actually be 3.5, which is less than 3.75. This is also the Nash equilibrium if the path between B and C is removed, which means that adding another possible route can decrease the efficiency of the system, a phenomenon known as Braess's paradox.

Notice that this distribution is not, actually, socially optimal. If the 100 cars agreed that 50 travel via ABD and the other 50 through ACD, then travel time for any single car would actually be 3.5, which is less than 3.75. This is also the Nash equilibrium if the path between B and C is removed, which means that adding another possible route can decrease the efficiency of the system, a phenomenon known as Braess's paradox.

### Competition game 竞赛游戏

{ | align = right border = “1” cellpadding = “1” cellspacing = “0” style = “ margin: 1em 1em 1em 1em 1em; background: # f9f9f9; border: 1px # solid; border-collapse: collapse; font-size: 95% ; class = “ wikable”
 模板:Diagonal split header Choose '0' 选择「0」 Choose '0' Choose '1' 选择「1」 Choose '1' Choose '2' 选择「2」 Choose '2' a competition game 0, 0 0,0 0, 0 2, −2 2, −2 2, −2 2, −2 2, −2 2, −2 2, −2 2, −2 2, −2 −2, 2 −2, 2 −2, 2 1, 1 1,1 1, 1 3, −1 3, −1 3, −1 3, −1 3, −1 3, −1 −2, 2 −2, 2 −2, 2 −1, 3 −1, 3 −1, 3 2, 2 2,2 2, 2 4, 0 4,0 4, 0 −2, 2 −2, 2 −2, 2 −1, 3 −1, 3 −1, 3 0, 4 0,4 0, 4 3, 3 3,3 3, 3

This can be illustrated by a two-player game in which both players simultaneously choose an integer from 0 to 3 and they both win the smaller of the two numbers in points. In addition, if one player chooses a larger number than the other, then they have to give up two points to the other.

This can be illustrated by a two-player game in which both players simultaneously choose an integer from 0 to 3 and they both win the smaller of the two numbers in points. In addition, if one player chooses a larger number than the other, then they have to give up two points to the other.

This game has a unique pure-strategy Nash equilibrium: both players choosing 0 (highlighted in light red). Any other strategy can be improved by a player switching their number to one less than that of the other player. In the adjacent table, if the game begins at the green square, it is in player 1's interest to move to the purple square and it is in player 2's interest to move to the blue square. Although it would not fit the definition of a competition game, if the game is modified so that the two players win the named amount if they both choose the same number, and otherwise win nothing, then there are 4 Nash equilibria: (0,0), (1,1), (2,2), and (3,3).

This game has a unique pure-strategy Nash equilibrium: both players choosing 0 (highlighted in light red). Any other strategy can be improved by a player switching their number to one less than that of the other player. In the adjacent table, if the game begins at the green square, it is in player 1's interest to move to the purple square and it is in player 2's interest to move to the blue square. Although it would not fit the definition of a competition game, if the game is modified so that the two players win the named amount if they both choose the same number, and otherwise win nothing, then there are 4 Nash equilibria: (0,0), (1,1), (2,2), and (3,3).

### Nash equilibria in a payoff matrix 收益矩阵中的纳什均衡

There is an easy numerical way to identify Nash equilibria on a payoff matrix. It is especially helpful in two-person games where players have more than two strategies. In this case formal analysis may become too long. This rule does not apply to the case where mixed (stochastic) strategies are of interest. The rule goes as follows: if the first payoff number, in the payoff pair of the cell, is the maximum of the column of the cell and if the second number is the maximum of the row of the cell - then the cell represents a Nash equilibrium.

There is an easy numerical way to identify Nash equilibria on a payoff matrix. It is especially helpful in two-person games where players have more than two strategies. In this case formal analysis may become too long. This rule does not apply to the case where mixed (stochastic) strategies are of interest. The rule goes as follows: if the first payoff number, in the payoff pair of the cell, is the maximum of the column of the cell and if the second number is the maximum of the row of the cell - then the cell represents a Nash equilibrium.

{ | align = “ left” border = “1” cellpadding = “4” cellspacing = “0” style = “ margin: 1em; background: # f9f9f9; border: 1px # aaa solid; border-collapse: collapse; font-size: 95% ; class = “ wikable”
 模板:Diagonal split header Option A 选项 a Option A 一个回报矩阵-纳什均衡用粗体表示 0, 0 0,0 0, 0 25, 40 25,40 25, 40 5, 10 5,10 5, 10 40, 25 40,25 40, 25 0, 0 0,0 0, 0 5, 15 5,15 5, 15 10, 5 10,5 10, 5 15, 5 15,5 15, 5 10, 10 10,10 10, 10

We can apply this rule to a 3×3 matrix:

We can apply this rule to a 3×3 matrix:

Using the rule, we can very quickly (much faster than with formal analysis) see that the Nash equilibria cells are (B,A), (A,B), and (C,C). Indeed, for cell (B,A) 40 is the maximum of the first column and 25 is the maximum of the second row. For (A,B) 25 is the maximum of the second column and 40 is the maximum of the first row. Same for cell (C,C). For other cells, either one or both of the duplet members are not the maximum of the corresponding rows and columns.

Using the rule, we can very quickly (much faster than with formal analysis) see that the Nash equilibria cells are (B,A), (A,B), and (C,C). Indeed, for cell (B,A) 40 is the maximum of the first column and 25 is the maximum of the second row. For (A,B) 25 is the maximum of the second column and 40 is the maximum of the first row. Same for cell (C,C). For other cells, either one or both of the duplet members are not the maximum of the corresponding rows and columns.

This said, the actual mechanics of finding equilibrium cells is obvious: find the maximum of a column and check if the second member of the pair is the maximum of the row. If these conditions are met, the cell represents a Nash equilibrium. Check all columns this way to find all NE cells. An N×N matrix may have between 0 and N×N pure-strategy Nash equilibria.

This said, the actual mechanics of finding equilibrium cells is obvious: find the maximum of a column and check if the second member of the pair is the maximum of the row. If these conditions are met, the cell represents a Nash equilibrium. Check all columns this way to find all NE cells. An N×N matrix may have between 0 and N×N pure-strategy Nash equilibria.

< ! -- 宽屏布局修正 -- >

## Stability 稳定性

The concept of stability, useful in the analysis of many kinds of equilibria, can also be applied to Nash equilibria.

The concept of stability, useful in the analysis of many kinds of equilibria, can also be applied to Nash equilibria.

A Nash equilibrium for a mixed-strategy game is stable if a small change (specifically, an infinitesimal change) in probabilities for one player leads to a situation where two conditions hold:

A Nash equilibrium for a mixed-strategy game is stable if a small change (specifically, an infinitesimal change) in probabilities for one player leads to a situation where two conditions hold:

the player who did not change has no better strategy in the new circumstance


1. the player who did not change has no better strategy in the new circumstance
the player who did change is now playing with a strictly worse strategy.


1. the player who did change is now playing with a strictly worse strategy.

If these cases are both met, then a player with the small change in their mixed strategy will return immediately to the Nash equilibrium. The equilibrium is said to be stable. If condition one does not hold then the equilibrium is unstable. If only condition one holds then there are likely to be an infinite number of optimal strategies for the player who changed.

If these cases are both met, then a player with the small change in their mixed strategy will return immediately to the Nash equilibrium. The equilibrium is said to be stable. If condition one does not hold then the equilibrium is unstable. If only condition one holds then there are likely to be an infinite number of optimal strategies for the player who changed.

In the "driving game" example above there are both stable and unstable equilibria. The equilibria involving mixed strategies with 100% probabilities are stable. If either player changes their probabilities slightly, they will be both at a disadvantage, and their opponent will have no reason to change their strategy in turn. The (50%,50%) equilibrium is unstable. If either player changes their probabilities (which would neither benefit or damage the expectation of the player who did the change, if the other player's mixed strategy is still (50%,50%)), then the other player immediately has a better strategy at either (0%, 100%) or (100%, 0%).

In the "driving game" example above there are both stable and unstable equilibria. The equilibria involving mixed strategies with 100% probabilities are stable. If either player changes their probabilities slightly, they will be both at a disadvantage, and their opponent will have no reason to change their strategy in turn. The (50%,50%) equilibrium is unstable. If either player changes their probabilities (which would neither benefit or damage the expectation of the player who did the change, if the other player's mixed strategy is still (50%,50%)), then the other player immediately has a better strategy at either (0%, 100%) or (100%, 0%).

Stability is crucial in practical applications of Nash equilibria, since the mixed strategy of each player is not perfectly known, but has to be inferred from statistical distribution of their actions in the game. In this case unstable equilibria are very unlikely to arise in practice, since any minute change in the proportions of each strategy seen will lead to a change in strategy and the breakdown of the equilibrium.

Stability is crucial in practical applications of Nash equilibria, since the mixed strategy of each player is not perfectly known, but has to be inferred from statistical distribution of their actions in the game. In this case unstable equilibria are very unlikely to arise in practice, since any minute change in the proportions of each strategy seen will lead to a change in strategy and the breakdown of the equilibrium.

The Nash equilibrium defines stability only in terms of unilateral deviations. In cooperative games such a concept is not convincing enough. Strong Nash equilibrium allows for deviations by every conceivable coalition. Formally, a strong Nash equilibrium is a Nash equilibrium in which no coalition, taking the actions of its complements as given, can cooperatively deviate in a way that benefits all of its members. However, the strong Nash concept is sometimes perceived as too "strong" in that the environment allows for unlimited private communication. In fact, strong Nash equilibrium has to be Pareto efficient. As a result of these requirements, strong Nash is too rare to be useful in many branches of game theory. However, in games such as elections with many more players than possible outcomes, it can be more common than a stable equilibrium.

The Nash equilibrium defines stability only in terms of unilateral deviations. In cooperative games such a concept is not convincing enough. Strong Nash equilibrium allows for deviations by every conceivable coalition. Formally, a strong Nash equilibrium is a Nash equilibrium in which no coalition, taking the actions of its complements as given, can cooperatively deviate in a way that benefits all of its members. However, the strong Nash concept is sometimes perceived as too "strong" in that the environment allows for unlimited private communication. In fact, strong Nash equilibrium has to be Pareto efficient. As a result of these requirements, strong Nash is too rare to be useful in many branches of game theory. However, in games such as elections with many more players than possible outcomes, it can be more common than a stable equilibrium.

A refined Nash equilibrium known as coalition-proof Nash equilibrium (CPNE) Further, it is possible for a game to have a Nash equilibrium that is resilient against coalitions less than a specified size, k. CPNE is related to the theory of the core.

A refined Nash equilibrium known as coalition-proof Nash equilibrium (CPNE) occurs when players cannot do better even if they are allowed to communicate and make "self-enforcing" agreement to deviate. Every correlated strategy supported by iterated strict dominance and on the Pareto frontier is a CPNE. Further, it is possible for a game to have a Nash equilibrium that is resilient against coalitions less than a specified size, k. CPNE is related to the theory of the core.

Finally in the eighties, building with great depth on such ideas Mertens-stable equilibria were introduced as a solution concept. Mertens stable equilibria satisfy both forward induction and backward induction. In a game theory context stable equilibria now usually refer to Mertens stable equilibria.

Finally in the eighties, building with great depth on such ideas Mertens-stable equilibria were introduced as a solution concept. Mertens stable equilibria satisfy both forward induction and backward induction. In a game theory context stable equilibria now usually refer to Mertens stable equilibria.

## Occurrence发生

If a game has a unique Nash equilibrium and is played among players under certain conditions, then the NE strategy set will be adopted. Sufficient conditions to guarantee that the Nash equilibrium is played are:

If a game has a unique Nash equilibrium and is played among players under certain conditions, then the NE strategy set will be adopted. Sufficient conditions to guarantee that the Nash equilibrium is played are:

The players all will do their utmost to maximize their expected payoff as described by the game.


1. The players all will do their utmost to maximize their expected payoff as described by the game.
The players are flawless in execution.


1. The players are flawless in execution.
The players have sufficient intelligence to deduce the solution.


1. The players have sufficient intelligence to deduce the solution.
The players know the planned equilibrium strategy of all of the other players.


1. The players know the planned equilibrium strategy of all of the other players.
The players believe that a deviation in their own strategy will not cause deviations by any other players.


1. The players believe that a deviation in their own strategy will not cause deviations by any other players.
There is common knowledge that all players meet these conditions, including this one. So, not only must each player know the other players meet the conditions, but also they must know that they all know that they meet them, and know that they know that they know that they meet them, and so on.


1. There is common knowledge that all players meet these conditions, including this one. So, not only must each player know the other players meet the conditions, but also they must know that they all know that they meet them, and know that they know that they know that they meet them, and so on.

### Where the conditions are not met 当条件无法满足

Examples of game theory problems in which these conditions are not met:

Examples of game theory problems in which these conditions are not met:

The first condition is not met if the game does not correctly describe the quantities a player wishes to maximize. In this case there is no particular reason for that player to adopt an equilibrium strategy. For instance, the prisoner's dilemma is not a dilemma if either player is happy to be jailed indefinitely.


1. The first condition is not met if the game does not correctly describe the quantities a player wishes to maximize. In this case there is no particular reason for that player to adopt an equilibrium strategy. For instance, the prisoner's dilemma is not a dilemma if either player is happy to be jailed indefinitely.
Intentional or accidental imperfection in execution. For example, a computer capable of flawless logical play facing a second flawless computer will result in equilibrium. Introduction of imperfection will lead to its disruption either through loss to the player who makes the mistake, or through negation of the common knowledge criterion leading to possible victory for the player. (An example would be a player suddenly putting the car into reverse in the game of chicken, ensuring a no-loss no-win scenario).


1. Intentional or accidental imperfection in execution. For example, a computer capable of flawless logical play facing a second flawless computer will result in equilibrium. Introduction of imperfection will lead to its disruption either through loss to the player who makes the mistake, or through negation of the common knowledge criterion leading to possible victory for the player. (An example would be a player suddenly putting the car into reverse in the game of chicken, ensuring a no-loss no-win scenario).
In many cases, the third condition is not met because, even though the equilibrium must exist, it is unknown due to the complexity of the game, for instance in Chinese chess. Or, if known, it may not be known to all players, as when playing tic-tac-toe with a small child who desperately wants to win (meeting the other criteria).


1. In many cases, the third condition is not met because, even though the equilibrium must exist, it is unknown due to the complexity of the game, for instance in Chinese chess. Or, if known, it may not be known to all players, as when playing tic-tac-toe with a small child who desperately wants to win (meeting the other criteria).
The criterion of common knowledge may not be met even if all players do, in fact, meet all the other criteria. Players wrongly distrusting each other's rationality may adopt counter-strategies to expected irrational play on their opponents’ behalf. This is a major consideration in "chicken" or an arms race, for example.


1. The criterion of common knowledge may not be met even if all players do, in fact, meet all the other criteria. Players wrongly distrusting each other's rationality may adopt counter-strategies to expected irrational play on their opponents’ behalf. This is a major consideration in "chicken" or an arms race, for example.

### Where the conditions are met 当条件满足

In his Ph.D. dissertation, John Nash proposed two interpretations of his equilibrium concept, with the objective of showing how equilibrium points

In his Ph.D. dissertation, John Nash proposed two interpretations of his equilibrium concept, with the objective of showing how equilibrium points

/* Styling for Template:Quote */ .templatequote { overflow: hidden; margin: 1em 0; padding: 0 40px; } .templatequote .templatequotecite {

   line-height: 1.5em;
/* @noflip */
text-align: left;
/* @noflip */
margin-top: 0;


}

/* Styling for Template:Quote */ .templatequote { overflow: hidden; margin: 1em 0; padding: 0 40px; } .templatequote .templatequotecite {

   line-height: 1.5em;
/* @noflip */
text-align: left;
/* @noflip */
margin-top: 0;


}

A second interpretation, that Nash referred to by the mass action interpretation, is less demanding on players:

A second interpretation, that Nash referred to by the mass action interpretation, is less demanding on players:

/* Styling for Template:Quote */ .templatequote { overflow: hidden; margin: 1em 0; padding: 0 40px; } .templatequote .templatequotecite {

   line-height: 1.5em;
/* @noflip */
text-align: left;
/* @noflip */
margin-top: 0;


}

/* Styling for Template:Quote */ .templatequote { overflow: hidden; margin: 1em 0; padding: 0 40px; } .templatequote .templatequotecite {

   line-height: 1.5em;
/* @noflip */
text-align: left;
/* @noflip */
margin-top: 0;


}

For a formal result along these lines, see Kuhn, H. and et al., 1996, "The Work of John Nash in Game Theory," Journal of Economic Theory, 69, 153–185.

For a formal result along these lines, see Kuhn, H. and et al., 1996, "The Work of John Nash in Game Theory," Journal of Economic Theory, 69, 153–185.

Due to the limited conditions in which NE can actually be observed, they are rarely treated as a guide to day-to-day behaviour, or observed in practice in human negotiations. However, as a theoretical concept in economics and evolutionary biology, the NE has explanatory power. The payoff in economics is utility (or sometimes money), and in evolutionary biology is gene transmission; both are the fundamental bottom line of survival. Researchers who apply games theory in these fields claim that strategies failing to maximize these for whatever reason will be competed out of the market or environment, which are ascribed the ability to test all strategies. This conclusion is drawn from the "stability" theory above. In these situations the assumption that the strategy observed is actually a NE has often been borne out by research.

Due to the limited conditions in which NE can actually be observed, they are rarely treated as a guide to day-to-day behaviour, or observed in practice in human negotiations. However, as a theoretical concept in economics and evolutionary biology, the NE has explanatory power. The payoff in economics is utility (or sometimes money), and in evolutionary biology is gene transmission; both are the fundamental bottom line of survival. Researchers who apply games theory in these fields claim that strategies failing to maximize these for whatever reason will be competed out of the market or environment, which are ascribed the ability to test all strategies. This conclusion is drawn from the "stability" theory above. In these situations the assumption that the strategy observed is actually a NE has often been borne out by research.

## NE and non-credible threats NE（纳什均衡）与不可信的威胁

Extensive and Normal form illustrations that show the difference between SPNE and other NE. The blue equilibrium is not subgame perfect because player two makes a non-credible threat at 2(2) to be unkind (U).

Extensive and Normal form illustrations that show the difference between SPNE and other NE. The blue equilibrium is not subgame perfect because player two makes a non-credible threat at 2(2) to be unkind (U).

The Nash equilibrium is a superset of the subgame perfect Nash equilibrium. The subgame perfect equilibrium in addition to the Nash equilibrium requires that the strategy also is a Nash equilibrium in every subgame of that game. This eliminates all non-credible threats, that is, strategies that contain non-rational moves in order to make the counter-player change their strategy.

The Nash equilibrium is a superset of the subgame perfect Nash equilibrium. The subgame perfect equilibrium in addition to the Nash equilibrium requires that the strategy also is a Nash equilibrium in every subgame of that game. This eliminates all non-credible threats, that is, strategies that contain non-rational moves in order to make the counter-player change their strategy.

The image to the right shows a simple sequential game that illustrates the issue with subgame imperfect Nash equilibria. In this game player one chooses left(L) or right(R), which is followed by player two being called upon to be kind (K) or unkind (U) to player one, However, player two only stands to gain from being unkind if player one goes left. If player one goes right the rational player two would de facto be kind to her/him in that subgame. However, The non-credible threat of being unkind at 2(2) is still part of the blue (L, (U,U)) Nash equilibrium. Therefore, if rational behavior can be expected by both parties the subgame perfect Nash equilibrium may be a more meaningful solution concept when such dynamic inconsistencies arise.

The image to the right shows a simple sequential game that illustrates the issue with subgame imperfect Nash equilibria. In this game player one chooses left(L) or right(R), which is followed by player two being called upon to be kind (K) or unkind (U) to player one, However, player two only stands to gain from being unkind if player one goes left. If player one goes right the rational player two would de facto be kind to her/him in that subgame. However, The non-credible threat of being unkind at 2(2) is still part of the blue (L, (U,U)) Nash equilibrium. Therefore, if rational behavior can be expected by both parties the subgame perfect Nash equilibrium may be a more meaningful solution concept when such dynamic inconsistencies arise.

## Proof of existence存在的证明

### Proof using the Kakutani fixed-point theorem Kakutani不动点定理的证明

Nash's original proof (in his thesis) used Brouwer's fixed-point theorem (e.g., see below for a variant). We give a simpler proof via the Kakutani fixed-point theorem, following Nash's 1950 paper (he credits David Gale with the observation that such a simplification is possible).

Nash 的原始证明(在他的论文中)使用了 Brouwer 的不动点定理。在1950年 Nash 的论文之后，我们通过角谷静夫不动点定理给出了一个更简单的证明(他相信 David Gale 的观察，这样的简化是可能的)。

Nash's original proof (in his thesis) used Brouwer's fixed-point theorem (e.g., see below for a variant). We give a simpler proof via the Kakutani fixed-point theorem, following Nash's 1950 paper (he credits David Gale with the observation that such a simplification is possible).

To prove the existence of a Nash equilibrium, let $\displaystyle{ r_i(\sigma_{-i}) }$ be the best response of player i to the strategies of all other players.

To prove the existence of a Nash equilibrium, let $\displaystyle{ r_i(\sigma_{-i}) }$ be the best response of player i to the strategies of all other players.

$\displaystyle{ r_i(\sigma_{-i}) = \mathop{\underset{\sigma_i}{\operatorname{arg\,max}}} u_i (\sigma_i,\sigma_{-i}) }$

= mathop { underset { sigma _ i }{ operatorname { arg，max } u _ i (sigma _ i，sigma _ {-i }) </math >

$\displaystyle{ r_i(\sigma_{-i}) = \mathop{\underset{\sigma_i}{\operatorname{arg\,max}}} u_i (\sigma_i,\sigma_{-i}) }$

Here, $\displaystyle{ \sigma \in \Sigma }$, where $\displaystyle{ \Sigma = \Sigma_i \times \Sigma_{-i} }$, is a mixed-strategy profile in the set of all mixed strategies and $\displaystyle{ u_i }$ is the payoff function for player i. Define a set-valued function $\displaystyle{ r\colon \Sigma \rightarrow 2^\Sigma }$ such that $\displaystyle{ r = r_i(\sigma_{-i})\times r_{-i}(\sigma_{i}) }$. The existence of a Nash equilibrium is equivalent to $\displaystyle{ r }$ having a fixed point.

Here, $\displaystyle{ \sigma \in \Sigma }$, where $\displaystyle{ \Sigma = \Sigma_i \times \Sigma_{-i} }$, is a mixed-strategy profile in the set of all mixed strategies and $\displaystyle{ u_i }$ is the payoff function for player i. Define a set-valued function $\displaystyle{ r\colon \Sigma \rightarrow 2^\Sigma }$ such that $\displaystyle{ r = r_i(\sigma_{-i})\times r_{-i}(\sigma_{i}) }$. The existence of a Nash equilibrium is equivalent to $\displaystyle{ r }$ having a fixed point.

Kakutani's fixed point theorem guarantees the existence of a fixed point if the following four conditions are satisfied.

Kakutani's fixed point theorem guarantees the existence of a fixed point if the following four conditions are satisfied.

$\displaystyle{ \Sigma }$ is compact, convex, and nonempty.


1. $\displaystyle{ \Sigma }$ is compact, convex, and nonempty.
$\displaystyle{ r(\sigma) }$ is nonempty.


1. $\displaystyle{ r(\sigma) }$ is nonempty.
$\displaystyle{ r(\sigma) }$ is upper hemicontinuous


1. $\displaystyle{ r(\sigma) }$ is upper hemicontinuous
$\displaystyle{ r(\sigma) }$ is convex.


1. $\displaystyle{ r(\sigma) }$ is convex.

Condition 1. is satisfied from the fact that $\displaystyle{ \Sigma }$ is a simplex and thus compact. Convexity follows from players' ability to mix strategies. $\displaystyle{ \Sigma }$ is nonempty as long as players have strategies.

Condition 1. is satisfied from the fact that $\displaystyle{ \Sigma }$ is a simplex and thus compact. Convexity follows from players' ability to mix strategies. $\displaystyle{ \Sigma }$ is nonempty as long as players have strategies.

Condition 2. and 3. are satisfied by way of Berge's maximum theorem. Because $\displaystyle{ u_i }$ is continuous and compact, $\displaystyle{ r(\sigma_i) }$ is non-empty and upper hemicontinuous.

Condition 2. and 3. are satisfied by way of Berge's maximum theorem. Because $\displaystyle{ u_i }$ is continuous and compact, $\displaystyle{ r(\sigma_i) }$ is non-empty and upper hemicontinuous.

Condition 4. is satisfied as a result of mixed strategies. Suppose $\displaystyle{ \sigma_i, \sigma'_i \in r(\sigma_{-i}) }$, then $\displaystyle{ \lambda \sigma_i + (1-\lambda) \sigma'_i \in r(\sigma_{-i}) }$. i.e. if two strategies maximize payoffs, then a mix between the two strategies will yield the same payoff.

Condition 4. is satisfied as a result of mixed strategies. Suppose $\displaystyle{ \sigma_i, \sigma'_i \in r(\sigma_{-i}) }$, then $\displaystyle{ \lambda \sigma_i + (1-\lambda) \sigma'_i \in r(\sigma_{-i}) }$. i.e. if two strategies maximize payoffs, then a mix between the two strategies will yield the same payoff.

Therefore, there exists a fixed point in $\displaystyle{ r }$ and a Nash equilibrium.

Therefore, there exists a fixed point in $\displaystyle{ r }$ and a Nash equilibrium.

When Nash made this point to John von Neumann in 1949, von Neumann famously dismissed it with the words, "That's trivial, you know. That's just a fixed-point theorem." (See Nasar, 1998, p. 94.)

1949年，当 Nash 向约翰·冯·诺伊曼提出这个观点时，von Neumann 用这样的话驳斥了这个观点: “你知道，这是微不足道的。这只是一个不动点定理。”(见 Nasar，1998，p. 94。)

When Nash made this point to John von Neumann in 1949, von Neumann famously dismissed it with the words, "That's trivial, you know. That's just a fixed-point theorem." (See Nasar, 1998, p. 94.)

### Alternate proof using the Brouwer fixed-point theorem 用Brouwer不动点定理的交替证明

We have a game $\displaystyle{ G=(N,A,u) }$ where $\displaystyle{ N }$ is the number of players and $\displaystyle{ A = A_1 \times \cdots \times A_N }$ is the action set for the players. All of the action sets $\displaystyle{ A_i }$ are finite. Let $\displaystyle{ \Delta = \Delta_1 \times \cdots \times \Delta_N }$ denote the set of mixed strategies for the players. The finiteness of the $\displaystyle{ A_i }$s ensures the compactness of $\displaystyle{ \Delta }$.

We have a game $\displaystyle{ G=(N,A,u) }$ where $\displaystyle{ N }$ is the number of players and $\displaystyle{ A = A_1 \times \cdots \times A_N }$ is the action set for the players. All of the action sets $\displaystyle{ A_i }$ are finite. Let $\displaystyle{ \Delta = \Delta_1 \times \cdots \times \Delta_N }$ denote the set of mixed strategies for the players. The finiteness of the $\displaystyle{ A_i }$s ensures the compactness of $\displaystyle{ \Delta }$.

We can now define the gain functions. For a mixed strategy $\displaystyle{ \sigma \in \Delta }$, we let the gain for player $\displaystyle{ i }$ on action $\displaystyle{ a \in A_i }$ be

We can now define the gain functions. For a mixed strategy $\displaystyle{ \sigma \in \Delta }$, we let the gain for player $\displaystyle{ i }$ on action $\displaystyle{ a \in A_i }$ be

$\displaystyle{ \text{Gain}_i(\sigma,a) = \max \{0, u_i(a, \sigma_{-i}) - u_i(\sigma_{i}, \sigma_{-i})\}. }$

< math > 文本{ Gain } _ i (sigma，a) = max {0，u _ i (a，sigma _ i })-u _ i (sigma _ { i } ，sigma _ i })}。 </math >

$\displaystyle{ \text{Gain}_i(\sigma,a) = \max \{0, u_i(a, \sigma_{-i}) - u_i(\sigma_{i}, \sigma_{-i})\}. }$

The gain function represents the benefit a player gets by unilaterally changing their strategy. We now define $\displaystyle{ g = (g_1,\dotsc,g_N) }$ where

The gain function represents the benefit a player gets by unilaterally changing their strategy. We now define $\displaystyle{ g = (g_1,\dotsc,g_N) }$ where

$\displaystyle{ g_i(\sigma)(a) = \sigma_i(a) + \text{Gain}_i(\sigma,a) }$

(a) = sigma _ i (a) + text { Gain } _ i (sigma，a) </math >

$\displaystyle{ g_i(\sigma)(a) = \sigma_i(a) + \text{Gain}_i(\sigma,a) }$

for $\displaystyle{ \sigma \in \Delta, a \in A_i }$. We see that

$\displaystyle{ \sigma \in \Delta, a \in A_i }$，我们看到

for $\displaystyle{ \sigma \in \Delta, a \in A_i }$. We see that

$\displaystyle{ \sum_{a \in A_i} g_i(\sigma)(a) = \sum_{a \in A_i} \sigma_i(a) + \text{Gain}_i(\sigma,a) = 1 + \sum_{a \in A_i} \text{Gain}_i(\sigma,a) \gt 0. }$

(a) = sum _ { a in a _ i } g _ i (sigma)(a) = sum _ a in a _ i } sigma _ i (a) + text { Gain } _ i (sigma，a) = 1 + sum _ { a in a _ i } text { Gain } _ i (sigma，a) > 0

$\displaystyle{ \sum_{a \in A_i} g_i(\sigma)(a) = \sum_{a \in A_i} \sigma_i(a) + \text{Gain}_i(\sigma,a) = 1 + \sum_{a \in A_i} \text{Gain}_i(\sigma,a) \gt 0. }$

Next we define:

Next we define:

$\displaystyle{ \begin{cases} f = (f_1, \cdots, f_N) : \Delta \to \Delta \\ f_i(\sigma)(a) = \frac{g_i(\sigma)(a)}{\sum_{b \in A_i} g_i(\sigma)(b)} & a \in A_i \end{cases} }$

F = (f _ 1，cdots，f _ n) : Delta to Delta f _ i (sigma)(a) = frac { g _ i (sigma)(a)}{ sum _ b in a _ i (sigma)(b)} & a in a _ i end { cases } </math >

$\displaystyle{ \begin{cases} f = (f_1, \cdots, f_N) : \Delta \to \Delta \\ f_i(\sigma)(a) = \frac{g_i(\sigma)(a)}{\sum_{b \in A_i} g_i(\sigma)(b)} & a \in A_i \end{cases} }$

It is easy to see that each $\displaystyle{ f_i }$ is a valid mixed strategy in $\displaystyle{ \Delta_i }$. It is also easy to check that each $\displaystyle{ f_i }$ is a continuous function of $\displaystyle{ \sigma }$, and hence $\displaystyle{ f }$ is a continuous function. As the cross product of a finite number of compact convex sets, $\displaystyle{ \Delta }$ is also compact and convex. Applying the Brouwer fixed point theorem to $\displaystyle{ f }$ and $\displaystyle{ \Delta }$ we conclude that $\displaystyle{ f }$ has a fixed point in $\displaystyle{ \Delta }$, call it $\displaystyle{ \sigma^* }$. We claim that $\displaystyle{ \sigma^* }$ is a Nash equilibrium in $\displaystyle{ G }$. For this purpose, it suffices to show that

It is easy to see that each $\displaystyle{ f_i }$ is a valid mixed strategy in $\displaystyle{ \Delta_i }$. It is also easy to check that each $\displaystyle{ f_i }$ is a continuous function of $\displaystyle{ \sigma }$, and hence $\displaystyle{ f }$ is a continuous function. As the cross product of a finite number of compact convex sets, $\displaystyle{ \Delta }$ is also compact and convex. Applying the Brouwer fixed point theorem to $\displaystyle{ f }$ and $\displaystyle{ \Delta }$ we conclude that $\displaystyle{ f }$ has a fixed point in $\displaystyle{ \Delta }$, call it $\displaystyle{ \sigma^* }$. We claim that $\displaystyle{ \sigma^* }$ is a Nash equilibrium in $\displaystyle{ G }$. For this purpose, it suffices to show that

$\displaystyle{ \forall i \in \{1, \cdots, N\}, \forall a \in A_i: \quad \text{Gain}_i(\sigma^*,a) = 0. }$

{1，cdots，n } ，for all a in a _ i: quad text { Gain } _ i (sigma ^ * ，a) = 0

$\displaystyle{ \forall i \in \{1, \cdots, N\}, \forall a \in A_i: \quad \text{Gain}_i(\sigma^*,a) = 0. }$

This simply states that each player gains no benefit by unilaterally changing their strategy, which is exactly the necessary condition for a Nash equilibrium.

This simply states that each player gains no benefit by unilaterally changing their strategy, which is exactly the necessary condition for a Nash equilibrium.

Now assume that the gains are not all zero. Therefore, $\displaystyle{ \exists i \in \{1, \cdots, N\}, }$ and $\displaystyle{ a \in A_i }$ such that $\displaystyle{ \text{Gain}_i(\sigma^*, a) \gt 0 }$. Note then that

Now assume that the gains are not all zero. Therefore, $\displaystyle{ \exists i \in \{1, \cdots, N\}, }$ and $\displaystyle{ a \in A_i }$ such that $\displaystyle{ \text{Gain}_i(\sigma^*, a) \gt 0 }$. Note then that

$\displaystyle{ \sum_{a \in A_i} g_i(\sigma^*, a) = 1 + \sum_{a \in A_i} \text{Gain}_i(\sigma^*,a) \gt 1. }$

< math > sum _ { a in a _ i } g _ i (sigma ^ * ，a) = 1 + sum _ { a in a _ i } text { Gain } _ i (sigma ^ * ，a) > 1. </math >

$\displaystyle{ \sum_{a \in A_i} g_i(\sigma^*, a) = 1 + \sum_{a \in A_i} \text{Gain}_i(\sigma^*,a) \gt 1. }$

So let

So let

$\displaystyle{ C = \sum_{a \in A_i} g_i(\sigma^*, a). }$

C = sum _ { a in a _ i } g _ i (sigma ^ * ，a)

$\displaystyle{ C = \sum_{a \in A_i} g_i(\sigma^*, a). }$

Also we shall denote $\displaystyle{ \text{Gain}(i,\cdot) }$ as the gain vector indexed by actions in $\displaystyle{ A_i }$. Since $\displaystyle{ \sigma^* }$ is the fixed point we have:

Also we shall denote $\displaystyle{ \text{Gain}(i,\cdot) }$ as the gain vector indexed by actions in $\displaystyle{ A_i }$. Since $\displaystyle{ \sigma^* }$ is the fixed point we have:

\displaystyle{ \begin{align} 1.1.1.2.2.2.2.2.2.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3 :\lt math\gt \begin{align} \sigma^* = f(\sigma^*) &\Rightarrow \sigma^*_i = f_i(\sigma^*) \\ Sigma ^ * = f (sigma ^ *) & right tarrow sigma ^ * i = f _ i (sigma ^ *) \sigma^* = f(\sigma^*) &\Rightarrow \sigma^*_i = f_i(\sigma^*) \\ &\Rightarrow \sigma^*_i = \frac{g_i(\sigma^*)}{\sum_{a \in A_i} g_i(\sigma^*)(a)} \\ [6pt] & right tarrow sigma ^ * i = frac { g _ i (sigma ^ *)}{ sum _ { a in a _ i } g _ i (sigma ^ *)(a)}[6 pt ] &\Rightarrow \sigma^*_i = \frac{g_i(\sigma^*)}{\sum_{a \in A_i} g_i(\sigma^*)(a)} \\ [6pt] &\Rightarrow \sigma^*_i = \frac{1}{C} \left (\sigma^*_i + \text{Gain}_i(\sigma^*,\cdot) \right ) \\ [6pt] & right tarrow sigma ^ * i = frac {1}{ c } left (sigma ^ * i + text { Gain } i (sigma ^ * ，cdot) right)[6 pt ] &\Rightarrow \sigma^*_i = \frac{1}{C} \left (\sigma^*_i + \text{Gain}_i(\sigma^*,\cdot) \right ) \\ [6pt] &\Rightarrow C\sigma^*_i = \sigma^*_i + \text{Gain}_i(\sigma^*,\cdot) \\ & right tarrow c sigma ^ * _ i = sigma ^ * _ i + text { Gain } _ i (sigma ^ * ，cdot) &\Rightarrow C\sigma^*_i = \sigma^*_i + \text{Gain}_i(\sigma^*,\cdot) \\ &\Rightarrow \left(C-1\right)\sigma^*_i = \text{Gain}_i(\sigma^*,\cdot) \\ & right tarrow left (C-1 right) sigma ^ * _ i = text { Gain } _ i (sigma ^ * ，cdot) &\Rightarrow \left(C-1\right)\sigma^*_i = \text{Gain}_i(\sigma^*,\cdot) \\ &\Rightarrow \sigma^*_i = \left(\frac{1}{C-1}\right)\text{Gain}_i(\sigma^*,\cdot). & right tarrow sigma ^ * _ i = left (frac {1}{ C-1} right) text { Gain } _ i (sigma ^ * ，cdot). &\Rightarrow \sigma^*_i = \left(\frac{1}{C-1}\right)\text{Gain}_i(\sigma^*,\cdot). \end{align} }

\end{align}[/itex]

Since $\displaystyle{ C \gt 1 }$ we have that $\displaystyle{ \sigma^*_i }$ is some positive scaling of the vector $\displaystyle{ \text{Gain}_i(\sigma^*,\cdot) }$. Now we claim that

Since $\displaystyle{ C \gt 1 }$ we have that $\displaystyle{ \sigma^*_i }$ is some positive scaling of the vector $\displaystyle{ \text{Gain}_i(\sigma^*,\cdot) }$. Now we claim that

$\displaystyle{ \forall a \in A_i: \quad \sigma^*_i(a)(u_i(a_i, \sigma^*_{-i}) - u_i(\sigma^*_i, \sigma^*_{-i})) = \sigma^*_i(a)\text{Gain}_i(\sigma^*, a) }$

$\displaystyle{ \forall a \in A_i: \quad \sigma^*_i(a)(u_i(a_i, \sigma^*_{-i}) - u_i(\sigma^*_i, \sigma^*_{-i})) = \sigma^*_i(a)\text{Gain}_i(\sigma^*, a) }$

To see this, we first note that if $\displaystyle{ \text{Gain}_i(\sigma^*, a) \gt 0 }$ then this is true by definition of the gain function. Now assume that $\displaystyle{ \text{Gain}_i(\sigma^*, a) = 0 }$. By our previous statements we have that

To see this, we first note that if $\displaystyle{ \text{Gain}_i(\sigma^*, a) \gt 0 }$ then this is true by definition of the gain function. Now assume that $\displaystyle{ \text{Gain}_i(\sigma^*, a) = 0 }$. By our previous statements we have that

$\displaystyle{ \sigma^*_i(a) = \left(\frac{1}{C-1}\right)\text{Gain}_i(\sigma^*, a) = 0 }$

< math > sigma ^ * i (a) = left (frac {1}{ C-1} right) text { Gain } _ i (sigma ^ * ，a) = 0 </math >

$\displaystyle{ \sigma^*_i(a) = \left(\frac{1}{C-1}\right)\text{Gain}_i(\sigma^*, a) = 0 }$

and so the left term is zero, giving us that the entire expression is $\displaystyle{ 0 }$ as needed.

and so the left term is zero, giving us that the entire expression is $\displaystyle{ 0 }$ as needed.

So we finally have that

So we finally have that

\displaystyle{ \begin{align} 1.1.1.2.2.2.2.2.2.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3.3 :\lt math\gt \begin{align} 0 &= u_i(\sigma^*_i, \sigma^*_{-i}) - u_i(\sigma^*_i, \sigma^*_{-i}) \\ 0 & = u _ i (sigma ^ * _ i，sigma ^ * _ {-i })-u _ i (sigma ^ * _ i，sigma ^ * _ {-i }) 0 &= u_i(\sigma^*_i, \sigma^*_{-i}) - u_i(\sigma^*_i, \sigma^*_{-i}) \\ &= \left(\sum_{a \in A_i} \sigma^*_i(a)u_i(a_i, \sigma^*_{-i})\right) - u_i(\sigma^*_i, \sigma^*_{-i}) \\ & = left (sum _ { a in a _ i } sigma ^ * _ i (a _ i，sigma ^ * _ {-i }) right)-u _ i (sigma ^ * _ i，sigma ^ * _ {-i }) &= \left(\sum_{a \in A_i} \sigma^*_i(a)u_i(a_i, \sigma^*_{-i})\right) - u_i(\sigma^*_i, \sigma^*_{-i}) \\ & = \sum_{a \in A_i} \sigma^*_i(a) (u_i(a_i, \sigma^*_{-i}) - u_i(\sigma^*_i, \sigma^*_{-i})) \\ & = sum _ { a in a _ i } sigma ^ * _ i (u _ i (a _ i，sigma ^ * _ {-i })-u _ i (sigma ^ * _ i，sigma ^ * _ {-i })) & = \sum_{a \in A_i} \sigma^*_i(a) (u_i(a_i, \sigma^*_{-i}) - u_i(\sigma^*_i, \sigma^*_{-i})) \\ & = \sum_{a \in A_i} \sigma^*_i(a) \text{Gain}_i(\sigma^*, a) && \text{ by the previous statements } \\ 和 = sum _ { a in a _ i } sigma ^ * _ i (a) text { Gain } _ i (sigma ^ * ，a) & text { by the previous statements } & = \sum_{a \in A_i} \sigma^*_i(a) \text{Gain}_i(\sigma^*, a) && \text{ by the previous statements } \\ &= \sum_{a \in A_i} \left( C -1 \right) \sigma^*_i(a)^2 \gt 0 & = sum _ { a in a _ i } left (c-1 right) sigma ^ * _ i (a) ^ 2 \gt 0 &= \sum_{a \in A_i} \left( C -1 \right) \sigma^*_i(a)^2 \gt 0 \end{align} }

\end{align}[/itex]

where the last inequality follows since $\displaystyle{ \sigma^*_i }$ is a non-zero vector. But this is a clear contradiction, so all the gains must indeed be zero. Therefore, $\displaystyle{ \sigma^* }$ is a Nash equilibrium for $\displaystyle{ G }$ as needed.

where the last inequality follows since $\displaystyle{ \sigma^*_i }$ is a non-zero vector. But this is a clear contradiction, so all the gains must indeed be zero. Therefore, $\displaystyle{ \sigma^* }$ is a Nash equilibrium for $\displaystyle{ G }$ as needed.

## Computing Nash equilibria 计算纳什均衡

If a player A has a dominant strategy $\displaystyle{ s_A }$ then there exists a Nash equilibrium in which A plays $\displaystyle{ s_A }$. In the case of two players A and B, there exists a Nash equilibrium in which A plays $\displaystyle{ s_A }$ and B plays a best response to $\displaystyle{ s_A }$. If $\displaystyle{ s_A }$ is a strictly dominant strategy, A plays $\displaystyle{ s_A }$ in all Nash equilibria. If both A and B have strictly dominant strategies, there exists a unique Nash equilibrium in which each plays their strictly dominant strategy.

If a player A has a dominant strategy $\displaystyle{ s_A }$ then there exists a Nash equilibrium in which A plays $\displaystyle{ s_A }$. In the case of two players A and B, there exists a Nash equilibrium in which A plays $\displaystyle{ s_A }$ and B plays a best response to $\displaystyle{ s_A }$. If $\displaystyle{ s_A }$ is a strictly dominant strategy, A plays $\displaystyle{ s_A }$ in all Nash equilibria. If both A and B have strictly dominant strategies, there exists a unique Nash equilibrium in which each plays their strictly dominant strategy.

In games with mixed-strategy Nash equilibria, the probability of a player choosing any particular (so pure) strategy can be computed by assigning a variable to each strategy that represents a fixed probability for choosing that strategy. In order for a player to be willing to randomize, their expected payoff for each (pure) strategy should be the same. In addition, the sum of the probabilities for each strategy of a particular player should be 1. This creates a system of equations from which the probabilities of choosing each strategy can be derived.

In games with mixed-strategy Nash equilibria, the probability of a player choosing any particular (so pure) strategy can be computed by assigning a variable to each strategy that represents a fixed probability for choosing that strategy. In order for a player to be willing to randomize, their expected payoff for each (pure) strategy should be the same. In addition, the sum of the probabilities for each strategy of a particular player should be 1. This creates a system of equations from which the probabilities of choosing each strategy can be derived.

### Examples 样例

{ | border = “1” cellpadding = “4” cellspacing = “0” style = “ margin: 1em 1em 1em 1em; background: # f9f9f9; border: 1px # aaa solid; border-collapse: collapse; font-size: 95% ; class = “ wikitable floatright”
 模板:Diagonal split header Player B plays H 玩家 b 扮演 h Player B plays H Matching penny −1, +1 −1, +1 −1, +1 +1, −1 +1, −1 +1, −1 +1, −1 +1, −1 +1, −1 −1, +1 −1, +1 −1, +1

In the matching pennies game, player A loses a point to B if A and B play the same strategy and wins a point from B if they play different strategies. To compute the mixed-strategy Nash equilibrium, assign A the probability p of playing H and (1−p) of playing T, and assign B the probability q of playing H and (1−q) of playing T.

In the matching pennies game, player A loses a point to B if A and B play the same strategy and wins a point from B if they play different strategies. To compute the mixed-strategy Nash equilibrium, assign A the probability p of playing H and (1−p) of playing T, and assign B the probability q of playing H and (1−q) of playing T.

E[payoff for A playing H] = (−1)q + (+1)(1−q) = 1−2q

E [ a 演奏 h 的收益] = (- 1) q + (+ 1)(1-q) = 1-2q

E[payoff for A playing H] = (−1)q + (+1)(1−q) = 1−2q

E[payoff for A playing T] = (+1)q + (−1)(1−q) = 2q−1

E [ a 演奏 t 的收益] = (+ 1) q + (- 1)(1-q) = 2q-1

E[payoff for A playing T] = (+1)q + (−1)(1−q) = 2q−1

E[payoff for A playing H] = E[payoff for A playing T] ⇒ 1−2q = 2q−1 ⇒ q = 1/2

E [ a 玩 h 的收益] = e [ a 玩 t 的收益] something 1-2q = 2q-1 something q = 1/2

E[payoff for A playing H] = E[payoff for A playing T] ⇒ 1−2q = 2q−1 ⇒ q = 1/2

E[payoff for B playing H] = (+1)p + (−1)(1−p) = 2p−1

E [ b 参与 h 的收益] = (+ 1) p + (- 1)(1-p) = 2p-1

E[payoff for B playing H] = (+1)p + (−1)(1−p) = 2p−1

E[payoff for B playing T] = (−1)p + (+1)(1−p) = 1−2p

E [ b 参与 t 的收益] = (- 1) p + (+ 1)(1-p) = 1-2p

E[payoff for B playing T] = (−1)p + (+1)(1−p) = 1−2p

E[payoff for B playing H] = E[payoff for B playing T] ⇒ 2p−1 = 1−2p ⇒ p = 1/2

E [ b 参与 h 的收益] = e [ b 参与 t 的收益]2p-1 = 1-2p something p = 1/2

E[payoff for B playing H] = E[payoff for B playing T] ⇒ 2p−1 = 1−2pp = 1/2

Thus a mixed-strategy Nash equilibrium, in this game, is for each player to randomly choose H or T with p = 1/2 and q = 1/2.

Thus a mixed-strategy Nash equilibrium, in this game, is for each player to randomly choose H or T with p = 1/2 and q = 1/2.

## Notes 备注

1. Osborne, Martin J.; Rubinstein, Ariel (12 Jul 1994). A Course in Game Theory. Cambridge, MA: MIT. p. 14. ISBN 9780262150415.
2. Schelling, Thomas, The Strategy of Conflict, copyright 1960, 1980, Harvard University Press, .
3. De Fraja, G.; Oliveira, T.; Zanchi, L. (2010). "Must Try Harder: Evaluating the Role of Effort in Educational Attainment". Review of Economics and Statistics. 92 (3): 577. doi:10.1162/REST_a_00013.
4. Ward, H. (1996). "Game Theory and the Politics of Global Warming: The State of Play and Beyond". Political Studies. 44 (5): 850–871. doi:10.1111/j.1467-9248.1996.tb00338.x.,
5. Thorpe, Robert B.; Jennings, Simon; Dolder, Paul J. (2017). "Risks and benefits of catching pretty good yield in multispecies mixed fisheries". ICES Journal of Marine Science. 74 (8): 2097–2106. doi:10.1093/icesjms/fsx062.,
6. "Marketing Lessons from Dr. Nash - Andrew Frank". 2015-05-25. Retrieved 2015-08-30.
7. Chiappori, P. -A.; Levitt, S.; Groseclose, T. (2002). "Testing Mixed-Strategy Equilibria when Players Are Heterogeneous: The Case of Penalty Kicks in Soccer" (PDF). American Economic Review. 92 (4): 1138. CiteSeerX 10.1.1.178.1646. doi:10.1257/00028280260344678.
8. Djehiche, B.; Tcheukam, A.; Tembine, H. (2017). "A Mean-Field Game of Evacuation in Multilevel Building". IEEE Transactions on Automatic Control. 62 (10): 5154–5169. doi:10.1109/TAC.2017.2679487. ISSN 0018-9286.
9. Djehiche, Boualem; Tcheukam, Alain; Tembine, Hamidou (2017-09-27). "Mean-Field-Type Games in Engineering". AIMS Electronics and Electrical Engineering (in English). 1: 18–73. arXiv:1605.03281. doi:10.3934/ElectrEng.2017.1.18.
10. Cournot A. (1838) Researches on the Mathematical Principles of the Theory of Wealth
11. J. Von Neumann, O. Morgenstern, Theory of Games and Economic Behavior, copyright 1944, 1953, Princeton University Press
12. Carmona, Guilherme; Podczeck, Konrad (2009). "On the Existence of Pure Strategy Nash Equilibria in Large Games" (PDF). Journal of Economic Theory. 144 (3): 1300–1319. doi:10.1016/j.jet.2008.11.009. SSRN 882466.
13. von Ahn, Luis. "Preliminaries of Game Theory" (PDF). Archived from the original (PDF) on 2011-10-18. Retrieved 2008-11-07.
14. "Nash Equilibria". hoylab.cornell.edu. Retrieved 2019-12-08.
15. MIT OpenCourseWare. 6.254: Game Theory with Engineering Applications, Spring 2010. Lecture 6: Continuous and Discontinuous Games.
16. B. D. Bernheim; B. Peleg; M. D. Whinston (1987), "Coalition-Proof Equilibria I. Concepts", Journal of Economic Theory, 42 (1): 1–12, doi:10.1016/0022-0531(87)90099-8.
17. Aumann, R. (1959). "Acceptable points in general cooperative n-person games". Contributions to the Theory of Games. IV. Princeton, N.J.: Princeton University Press. ISBN 978-1-4008-8216-8.
18. D. Moreno; J. Wooders (1996), "Coalition-Proof Equilibrium" (PDF), Games and Economic Behavior, 17 (1): 80–112, doi:10.1006/game.1996.0095, hdl:10016/4408.
19. T. L. Turocy, B. Von Stengel, Game Theory, copyright 2001, Texas A&M University, London School of Economics, pages 141-144. 模板:Citation needed span – it can be represented as a strategy complying with his original conditions for a game with a NE. Such games may not have unique NE, but at least one of the many equilibrium strategies would be played by hypothetical players having perfect knowledge of all 模板:Citation needed span.
20. J. C. Cox, M. Walker, Learning to Play Cournot Duoploy Strategies -{zh-cn:互联网档案馆; zh-tw:網際網路檔案館; zh-hk:互聯網檔案館;}-存檔，存档日期2013-12-11., copyright 1997, Texas A&M University, University of Arizona, pages 141-144
21. Fudenburg, Drew; Tirole, Jean (1991). Game Theory. MIT Press. ISBN 978-0-262-06141-4.

## References 索引

### Other references其他文献

Category:Game theory equilibrium concepts

Category:Fixed points (mathematics)

Category:1951 in economics

This page was moved from wikipedia:en:Nash equilibrium. Its edit history can be viewed at 纳什均衡/edithistory 此页摘自维基百科：英语：纳什均衡。它的编辑历史可以在纳什均衡/编辑历史记录查阅