Expected Payoff Matrices

Expected Payoff

Dr. Goodrich gave a hint on how to avoid doing a bunch of repeated play simulations each generation, by finding the long-run expected payoff matrix. (Quoted below)

The most efficient approach is to figure out what strategy A against strategy B would earn given a particular gamma, V(A|B) for all A and B pairs. This becomes the payoff matrix, and you use replicator or imitator dynamics on the V(A|B)'s.

General Formula

The formula for the long term discounted expected reward is:

V(A|B) = Σ_t=0[ γ^t U₁(A_t | B_t)]

Games

We can store these pre-computed V(A|B) values here for the various games:

Prisoner's Dilemma

Payoff Matrix

	C	D
C	(R, R)	(S, T)
CD	(T, S)	(P, P)

Where:

R = 3
T = 5
S = 1
P = 2

Expected Payoff Matrix

	AC	AD	TfT	NTfT
AC	^R⁄_1-γ	^S⁄_1-γ	^R⁄_1-γ	^S⁄_1-γ
AD	^T⁄_1-γ	^P⁄_1-γ	T + ^γP⁄_1-γ	S + ^Tγ⁄_1-γ
TfT	^R⁄_1-γ	S + ^Pγ⁄_1-γ	^R⁄_1-γ	S + Pγ + Tγ² + Rγ³ 1-γ⁴
NTfT	^T⁄_1-γ	T + ^Sγ⁄_1-γ	T + Pγ + Sγ² + Rγ³ 1-γ⁴	P + Rγ 1 - γ²

Note: This should be read where the row is the first player, and the column the second. So the entry on row AC and column AD should be intepreted as: V(AC | AD)

Stag Hunt

Payoff Matrix

	C	D
C	(5, 5)	(1, 3)
CD	(3, 1)	(3, 3)

Note: This expected payoff matrix for this could be thought of the same as for the Prisoner's Dilemma above, using the following values:

R = 5
T = 3
S = 1
P = 3

Reformed Expected Payoff Matrix

This is the same payoff matrix as above, but since P == T we've re-written it by swapping out all of the P elements with T.

Using:

R = 5
T = 3
S = 1

	AC	AD	TfT	NTfT
AC	^R⁄_1-γ	^S⁄_1-γ	^R⁄_1-γ	^S⁄_1-γ
AD	^T⁄_1-γ	^T⁄_1-γ	T + ^γT⁄_1-γ	S + ^Tγ⁄_1-γ
TfT	^R⁄_1-γ	S + ^Tγ⁄_1-γ	^R⁄_1-γ	S + Tγ + Tγ² + Rγ³ 1-γ⁴
NTfT	^T⁄_1-γ	T + ^Sγ⁄_1-γ	T + Tγ + Sγ² + Rγ³ 1-γ⁴	T + Rγ 1 - γ²

Battle of the Sexes

This matrix also allows choice of gender, which can be thought of as having 8 different strategies.

Where:

R = 3
T = 5
S = 1
P = 2

Battle of the Sexes Expected Payoff Matrix

	(H) AC	(H) AD	(H) TfT	(H) NTfT	(W) AC	(W) AD	(W) TfT	(W) NTfT
(H) AC	^P⁄_1-γ	^S⁄_1-γ	^P⁄_1-γ	^S⁄_1-γ	^R⁄_1-γ	^S⁄_1-γ	^R⁄_1-γ	^S⁄_1-γ
(H) AD	^P⁄_1-γ	^R⁄_1-γ	P + ^γR⁄_1-γ	R + ^γP⁄_1-γ	^P⁄_1-γ	^T⁄_1-γ	P + ^γT⁄_1-γ	T + ^γP⁄_1-γ
(H) TfT	^P⁄_1-γ	S + ^γR⁄_1-γ	^P⁄_1-γ	S + Rγ + Pγ² + Pγ³ 1-γ⁴	^R⁄_1-γ	S + ^γT⁄_1-γ	^R⁄_1-γ	S + Tγ + Pγ² + Rγ³ 1-γ⁴
(H) NTfT	^P⁄_1-γ	R + ^γS⁄_1-γ	P + Rγ + Sγ² + Pγ³ 1-γ⁴	R + Pγ 1 - γ²	^P⁄_1-γ	T + ^γS⁄_1-γ	P + Tγ + Sγ² + Rγ³ 1-γ⁴	T + Rγ 1 - γ²
(W) AC	^T⁄_1-γ	^P⁄_1-γ	^T⁄_1-γ	^P⁄_1-γ	^R⁄_1-γ	^P⁄_1-γ	^R⁄_1-γ	^P⁄_1-γ
(W) AD	^S⁄_1-γ	^R⁄_1-γ	S + ^γR⁄_1-γ	R + ^γS⁄_1-γ	^S⁄_1-γ	^P⁄_1-γ	S + ^γP⁄_1-γ	P + ^γS⁄_1-γ
(W) TfT	^T⁄_1-γ	P + ^γR⁄_1-γ	^T⁄_1-γ	P + Rγ + Sγ² + Tγ³ 1-γ⁴	^R⁄_1-γ	^P⁄_1-γ	^R⁄_1-γ	P + Pγ + Sγ² + Rγ³ 1-γ⁴
(W) NTfT	^S⁄_1-γ	R + ^γR⁄_1-γ	S + Rγ + Pγ² + Tγ³ 1-γ⁴	R + Tγ 1 - γ²	^S⁄_1-γ	^P⁄_1-γ	S + Pγ + Pγ² + Rγ³ 1-γ⁴	P + Rγ 1 - γ²

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expected Payoff Matrices

Expected Payoff

General Formula

Games

Prisoner's Dilemma

Payoff Matrix

Expected Payoff Matrix

Stag Hunt

Payoff Matrix

Reformed Expected Payoff Matrix

Battle of the Sexes

Battle of the Sexes Expected Payoff Matrix

Clone this wiki locally