Game Theory

14 Jan 2021

Game Theory is the general name given to a collection of mathematical proofs to assess possible strategies for players in competitive and cooperative scenarios

Normal Form Games

All normal form games:

\[(N, A, u)\]

Cooperative Games vs. Competitive Games

Common Pay-off vs. Zero Sum Pay-off

Strategies

Strategies for Normal Form Games

Other terms

Examples

Two vehicles on a Road (Chicken)

Type: Normal Form Common Payoff Cooperative Game

If two vehicles, approaching one another on the same section of road, choose which side of the road to use, they will either collide or pass one another.

LeftRight
Left(1,1)(0,0)
Right(0,0)(1,1)

Prisoners Dilemma

Type: Normal Form Cooperative Game

SilenceConfess
Silence(-1,-1)(-4,0)
Confess(0,-4)(-3,-3)

Generally, if $c > a > d > b$ means always a Prisoners Dilemma game

Option AOption B
Option A(a,a)(b,c)
Option B(c,b)(d,d)

Rock Paper Scissors

RockPaperScissors
Rock(0,0)(-1,1)(1,-1)
Paper(1,-1)(0,0)(-1,1)
Scissors(-1,1)(1,-1)(0,0)

Optimal Strategies

Pareto Domination

Whenever a player may select an optimal Strategy Profile $s$ without negatively impacting other players, then the Strategy Profile $s$ Pareto Dominates other Strategy Profiles $s’$

Using Pareto Domination we can sort some of our choices, to select better choices

Pareto Optimization

Whenever a Strategy Profile Pareto Dominates all other Strategy Profiles, it is considered Pareto Optimized

This is not just about Action Profile optimization but Strategy Profile optimization of the game.

Nash Equilibrium

If a competitive game is considered from a single player’s perspective, then we can see that the player, who can reason, will attempt to use opportunities to gain advantage.

If a player has perfect information and knows what other players strategies will be, then the player may select the best strategy to maximize payoff.

If all players select the best strategy given they know the strategies of other players, then an equilibrium is achieved (i.e. no player is playing a non-optimal strategy).

This is known as a Nash Equilibrium, and it is accepted that players would always select a best strategy determined by Nash Equilibrium, as it will result in the best outcome for the individual.

Sometimes, there isn’t a single strategy which a player may select to ensure a best response to a game. Whenever this occurs, the strategy profile is called a Weak Nash. A Weak Nash equilibrium exists whenever several optimal strategies exist and the payoff is equal to other optimal strategies

A Strict Nash equilibrium exists whenever only a single best strategy can be selected by all players.

Note: A Nash Equilibrium exists for any game with a finite set of players and finite set of actions

MaxMin Strategy

When a player is playing defensively, a player may protect themselves from aggressive opponents (whose goal it is to maximize loss to opponents), by selecting the maximum payoff in a worst case scenario, or the best minimum guaranteed payoff

Note: Whenever a player moves first, the MaxMin strategy is always the best strategy for that player, given other players can respond to the strategy

MinMax Strategy

Whenever a player is playing in a dominant position, they can ensure their opponents receive the least payoff possible, before considering the payoff to themselves.

Note: Opponent players may then select to play a strategy to minimize their loss in a worst case scenario.

Minimax Strategy

In a Zero Sum game, MinMax and MaxMin strategies are the same subset of possible strategies, and for this reason, they’re called Minimax Strategies

Regret

Regret is the formal term given to the loss that a player has incurred as a result of playing the incorrect strategy given the strategy played by other players.

It is pretty easy to calculate for player $i$, after a game, knowing the strategy of other players as follows:

\[\text{Regret }_i = \text{Best Payoff }_i - \text{ Actual Payoff}_i\]

Precalculating regret helps select the strategy to minimize regret, ahead of time.

Minimax Regret Action

Minimax Regret Action is the action which results in the smallest maximum regret

Dominant and Dominated Strategies

Strategies which are better than others are dominant. Strategies which are worse than others, are dominated.

Iterative removal of strictly dominated strategies

During analysis of a game, if players take turns to eliminate strategies over and over again, which are dominated for themselves, they will arrive at a smaller set of options which are better.

Iterated removal of strictly dominated strategies preserves Nash equilibria. However the result of reducing the strategies may not full remove all options (Ie. consider the Prisoner’s Dilemma, which cannot be further reduced).

Here’s an example (where players are maximizing their own gains, before minimizing their opponents gains), considered where $Player_{col}$ actions are assessed first.

Let’s repeat where $Player_{row}$ is considered first.

In Summary:

For $Player_{row}$ playing Middle is strictly dominant, as there are no choices which are result in a better outcome

For $Player_{col}$ playing Left is strictly dominant, but playing Right is weakly dominant

Interestingly in this game, when comparing the results (3,1) and (4,2), the relative difference is (+2, +0) for either, so anytime this game is played following the optimal Nash strategy approach, $Player_{row}$ will gain more than $Player_{col}$.

In a competitive market, where ultimately one player remains, $Player_{col}$ will lose out while the payoffs remain constant, and therefore must they adapt or modify the market environment, changing payoffs such that advantage is modified back in their favour to remain competitive.

Rational players

Its important to note that the above approach is only suitable when a game is considered between players who can consider the importance of gains and losses and therefore rationalize which is better. Adopting some strategies against opponent players who are playing randomly or consistently using poor strategies may result in reduced gains or excessive and increased losses.