Game Theory is the general name given to a collection of mathematical proofs to assess possible strategies for players in competitive and cooperative scenarios
Normal Form Games
All normal form games:
- There is a finite set of $N$ players
- Each player must select a single strategy, from which all players may choose
- An action profile $A$ is used to represent the collection of single decisions of all players, for a single instance of a game
- There is a known payoff $u$ (also called utility) for each possible action profile
- Payoffs are fully known, complete information is known
Cooperative Games vs. Competitive Games
-
Cooperative Games are those where selected strategies result in a global improved gain or global reduced loss, and where individual results are less important than the combined result
-
Competitive Games are those where it is advantageous to select strategies which result in one or more players receiving improved gains or reduced losses, to the detriment of other players
-
Most games include a combination of competitive and cooperative games
Common Pay-off vs. Zero Sum Pay-off
-
Common Payoff games result in all players receiving the same payoff. Ie. Communal outcomes
-
Zero Sum (or more correctly constant sum) games result in the same payoff being shared amongst all players regardless of the strategies each select. I.e. individual gains are at the expense of other players losses
Strategies
Strategies for Normal Form Games
-
Pure Strategy is when a single player selects and plays a single strategy
-
Mixed Strategy is when a single player selects and plays a single strategy, selected randomly (based on a probability distribution) from possible strategies
Other terms
-
We sometimes call a player an agent in the context of Game Theory, (partly to dehumanize the parties which may make choices). In this case Multi Agent games typically involve $3 \text{ (or more)}$ players or agents, not to be confused with multi-agent systems which describes a collection of autonomous independent systems which interact within an environment
-
Support means the set of pure strategies which, when chosen randomly by probability distribution, where each strategy in the set has a positive probability of being selected.
-
A Strategy Profile is the Cartesian Product of individual strategies
-
Cartesian Product (general mathematical term)
A simple example of a Cartesian Product follows, where
\[\text{if } A = \{x,y,z\} \text{ and }B = \{1,2,3\}\]then formally
\[A \times B = \{(a,b) | a \in A \text{ and } b \in B\}\]or more simply,
\[A \times B = \{ (x,1),(x,2),(x,3),(y,1),(y,2),(y,3),(z,1),(z,2),(z,3)\}\]We can also represent this as a matrix, (so long as we remember that the Cartesian Product is the full matrix, treated as a set, which we read in order)
1 2 3 x (x,1) (x,2) (x,3) y (y,1) (y,2) (y,3) z (z,1) (z,2) (z,3)
Examples
Two vehicles on a Road (Chicken)
Type: Normal Form Common Payoff Cooperative Game
If two vehicles, approaching one another on the same section of road, choose which side of the road to use, they will either collide or pass one another.
Left | Right | |
Left | (1,1) | (0,0) |
Right | (0,0) | (1,1) |
Prisoners Dilemma
Type: Normal Form Cooperative Game
Silence | Confess | |
Silence | (-1,-1) | (-4,0) |
Confess | (0,-4) | (-3,-3) |
Generally, if $c > a > d > b$ means always a Prisoners Dilemma game
Option A | Option B | |
Option A | (a,a) | (b,c) |
Option B | (c,b) | (d,d) |
Rock Paper Scissors
Rock | Paper | Scissors | |
Rock | (0,0) | (-1,1) | (1,-1) |
Paper | (1,-1) | (0,0) | (-1,1) |
Scissors | (-1,1) | (1,-1) | (0,0) |
Optimal Strategies
- A concept of optimal strategy exists outcomes whenever a strategy may be selected to derive an advantage for any player
- Obviously whenever an optimal strategy is played, there also exists a non-optimal strategy and failure to chose the optimal strategy results in the loss of advantage
Pareto Domination
Whenever a player may select an optimal Strategy Profile $s$ without negatively impacting other players, then the Strategy Profile $s$ Pareto Dominates other Strategy Profiles $s’$
Using Pareto Domination we can sort some of our choices, to select better choices
Pareto Optimization
Whenever a Strategy Profile Pareto Dominates all other Strategy Profiles, it is considered Pareto Optimized
This is not just about Action Profile optimization but Strategy Profile optimization of the game.
Nash Equilibrium
If a competitive game is considered from a single player’s perspective, then we can see that the player, who can reason, will attempt to use opportunities to gain advantage.
If a player has perfect information and knows what other players strategies will be, then the player may select the best strategy to maximize payoff.
If all players select the best strategy given they know the strategies of other players, then an equilibrium is achieved (i.e. no player is playing a non-optimal strategy).
This is known as a Nash Equilibrium, and it is accepted that players would always select a best strategy determined by Nash Equilibrium, as it will result in the best outcome for the individual.
Sometimes, there isn’t a single strategy which a player may select to ensure a best response to a game. Whenever this occurs, the strategy profile is called a Weak Nash. A Weak Nash equilibrium exists whenever several optimal strategies exist and the payoff is equal to other optimal strategies
A Strict Nash equilibrium exists whenever only a single best strategy can be selected by all players.
Note: A Nash Equilibrium exists for any game with a finite set of players and finite set of actions
MaxMin Strategy
When a player is playing defensively, a player may protect themselves from aggressive opponents (whose goal it is to maximize loss to opponents), by selecting the maximum payoff in a worst case scenario, or the best minimum guaranteed payoff
Note: Whenever a player moves first, the MaxMin strategy is always the best strategy for that player, given other players can respond to the strategy
MinMax Strategy
Whenever a player is playing in a dominant position, they can ensure their opponents receive the least payoff possible, before considering the payoff to themselves.
Note: Opponent players may then select to play a strategy to minimize their loss in a worst case scenario.
Minimax Strategy
In a Zero Sum game, MinMax and MaxMin strategies are the same subset of possible strategies, and for this reason, they’re called Minimax Strategies
Regret
Regret is the formal term given to the loss that a player has incurred as a result of playing the incorrect strategy given the strategy played by other players.
It is pretty easy to calculate for player $i$, after a game, knowing the strategy of other players as follows:
\[\text{Regret }_i = \text{Best Payoff }_i - \text{ Actual Payoff}_i\]Precalculating regret helps select the strategy to minimize regret, ahead of time.
Minimax Regret Action
Minimax Regret Action is the action which results in the smallest maximum regret
Dominant and Dominated Strategies
Strategies which are better than others are dominant. Strategies which are worse than others, are dominated.
- Strictly dominant strategies are better than all other strategies.
- Weak dominant strategies are better than some strategies, but not the best strategy
- Very Weak dominant strategies are better than at least one other strategy.
Iterative removal of strictly dominated strategies
During analysis of a game, if players take turns to eliminate strategies over and over again, which are dominated for themselves, they will arrive at a smaller set of options which are better.
Iterated removal of strictly dominated strategies preserves Nash equilibria. However the result of reducing the strategies may not full remove all options (Ie. consider the Prisoner’s Dilemma, which cannot be further reduced).
Here’s an example (where players are maximizing their own gains, before minimizing their opponents gains), considered where $Player_{col}$ actions are assessed first.
-
Step 0 : Original game strategy profile
Left Centre Right Sum Upper (3,2) (1,0) (2,6) (6,8) Middle (4,2) (2,0) (3,1) (9,3) Lower (1,5) (4,0) (2,1) (7,6) Sum (8,9) (7,0) (7,8) -
Step 1 : $Player_{col}$ removes dominated strategy Centre to maximize gain
Left Right Sum Upper (3,2) (2,6) (5,8) Middle (4,2) (3,1) (7,3) Lower (1,5) (2,1) (3,6) Sum (8,9) (7,8) -
Step 2 : $Player_{row}$ removes dominated strategy Lower to maximize gain
Left Right Sum Upper (3,2) (2,6) (5,8) Middle (4,2) (3,1) (7,3) Sum (7,4) (5,7) -
Step 3 : $Player_{col}$ removes dominated strategy Left to maximize gain
Right Sum Upper (2,6) (2,6) Middle (3,1) (3,1) Sum (5,7) -
Step 4 : $Player_{row}$ removes dominated strategy Upper to maximize gain
Right Sum Middle (3,1) (3,1) Sum (3,1) -
Nash Equilibrium : (Middle, Right)
Let’s repeat where $Player_{row}$ is considered first.
-
Step 1 : $Player_{row}$ removes dominated strategy Upper to maximize gain
Left Centre Right Sum Middle (4,2) (2,0) (3,1) (9,3) Lower (1,5) (4,0) (2,1) (7,6) Sum (5,7) (6,0) (5,2) -
Step 2 : $Player_{col}$ removes dominated strategy Centre to maximize gain
Left Right Sum Middle (4,2) (3,1) (7,3) Lower (1,5) (2,1) (3,6) Sum (5,7) (5,2) -
Step 3 : $Player_{row}$ removes dominated strategy Lower to maximize gain
Left Right Sum Middle (4,2) (3,1) (7,3) Sum (4,2) (3,1) -
Step 4 : $Player_{row}$ removes dominated strategy Lower to maximize gain
Left Sum Middle (4,2) (4,2) Sum (4,2) -
Nash Equilibrium : (Middle, Left)
In Summary:
For $Player_{row}$ playing Middle is strictly dominant, as there are no choices which are result in a better outcome
For $Player_{col}$ playing Left is strictly dominant, but playing Right is weakly dominant
Interestingly in this game, when comparing the results (3,1) and (4,2), the relative difference is (+2, +0) for either, so anytime this game is played following the optimal Nash strategy approach, $Player_{row}$ will gain more than $Player_{col}$.
In a competitive market, where ultimately one player remains, $Player_{col}$ will lose out while the payoffs remain constant, and therefore must they adapt or modify the market environment, changing payoffs such that advantage is modified back in their favour to remain competitive.
Rational players
Its important to note that the above approach is only suitable when a game is considered between players who can consider the importance of gains and losses and therefore rationalize which is better. Adopting some strategies against opponent players who are playing randomly or consistently using poor strategies may result in reduced gains or excessive and increased losses.