What is the Prisoner's Dilemma?

A game where two players each choose to cooperate or defect. Defection is individually rational but mutually harmful, creating a tension between self-interest and collective good.

What is Nash equilibrium?

A set of strategies where no player can gain by unilaterally changing their strategy. In the single-round PD, the Nash equilibrium is (Defect, Defect), even though (Cooperate, Cooperate) is better for both.

A strategy that cooperates on the first round, then copies the opponent's previous move. It won Robert Axelrod's famous iterated PD tournaments in 1980.

Why does Tit-for-Tat work so well?

It is 'nice' (never defects first), 'retaliatory' (punishes defection immediately), 'forgiving' (returns to cooperation after the opponent does), and 'clear' (easy for opponents to model).

What is Grim Trigger?

A strategy that cooperates until the opponent defects even once, then defects forever. It is a strong deterrent but unforgiving — one mistake ruins cooperation permanently.

Where does the Prisoner's Dilemma appear in real life?

Arms races (countries cooperate by disarming or defect by arming), price wars (firms cooperate by keeping prices high or defect by cutting prices), and climate agreements (nations cooperate to reduce emissions or defect to maximize economic output).

Prisoner's Dilemma Calculator & Strategy Simulator

Explore the Prisoner's Dilemma — payoff matrices, Nash equilibrium, iterated games with Tit-for-Tat, Grim Trigger, Pavlov, and more strategies.

Prisoner's Dilemma Calculator

Mode

Reward (R) — Both Cooperate

Temptation (T) — I Defect, You Cooperate

Punishment (P) — Both Defect

Sucker (S) — I Cooperate, You Defect

Player 1 Strategy

Player 2 Strategy

Rounds

P1 Total Score

19.0

tit-for-tat: avg 0.95/round over 20 rounds

P2 Total Score

24.0

always-defect: avg 1.20/round over 20 rounds

Nash Equilibrium

Both Defect (D,D)

The stable outcome where neither player gains by switching unilaterally

Valid Dilemma?

✅ T>R>P>S, 2R>T+S

Standard PD requires T > R > P > S and 2R > T + S

P1 Cooperation Rate

5.0%

Cooperated 1 of 20 rounds

P2 Cooperation Rate

0.0%

Cooperated 0 of 20 rounds

Payoff Matrix

	P2 Cooperate	P2 Defect
P1 Cooperate	R, R = 3, 3	S, T = 0, 5
P1 Defect	T, S = 5, 0	P, P = 1, 1

Round-by-Round Results

Round	P1	P2	P1 Pay	P2 Pay	P1 Cum	P2 Cum
1	C	D	0	5	0	5
2	D	D	1	1	1	6
3	D	D	1	1	2	7
4	D	D	1	1	3	8
5	D	D	1	1	4	9
6	D	D	1	1	5	10
7	D	D	1	1	6	11
8	D	D	1	1	7	12
9	D	D	1	1	8	13
10	D	D	1	1	9	14
11	D	D	1	1	10	15
12	D	D	1	1	11	16
13	D	D	1	1	12	17
14	D	D	1	1	13	18
15	D	D	1	1	14	19
16	D	D	1	1	15	20
17	D	D	1	1	16	21
18	D	D	1	1	17	22
19	D	D	1	1	18	23
20	D	D	1	1	19	24

Score Comparison

Strategy Reference

Strategy	Rule	Strengths
Always Cooperate	Always play C	Maximizes mutual gain; exploitable
Always Defect	Always play D	Can't be exploited; misses cooperation
Tit-for-Tat	Start C, copy opponent's last move	Nice, retaliatory, forgiving; won Axelrod's tournament
Grim Trigger	C until opponent defects, then always D	Strong deterrent; unforgiving
Pavlov	Repeat if won; switch if lost	Self-correcting; exploits cooperators
Random	50/50 each round	Unpredictable; suboptimal baseline

Planning notes, formulas, and examples

About the Prisoner's Dilemma Calculator & Strategy Simulator

The Prisoner's Dilemma is the most studied model in game theory. Two players independently choose to cooperate (C) or defect (D). Mutual cooperation yields a moderate reward (R) for both, mutual defection yields a low punishment (P), but if one defects while the other cooperates, the defector gets the highest temptation payoff (T) while the cooperator gets the sucker's payoff (S). The dilemma: individually, defection is always rational (it dominates), yet mutual cooperation yields a better outcome for both.

This calculator lets you explore both single-round and iterated versions of the game. In single-round mode, you pick each player's choice and see the payoff. In iterated mode, you assign strategies — Tit-for-Tat, Always Defect, Grim Trigger, Pavlov, Random — and watch them play over many rounds. The round-by-round table shows every move, payoff, and cumulative score, while the output cards summarize Nash equilibrium, cooperation rates, and total scores.

Game theory's insights apply far beyond academic puzzles: international relations (arms races), business (price wars), biology (reciprocal altruism), and technology (protocol design) all involve variants of the Prisoner's Dilemma. This calculator makes the abstract logic concrete and explorable.

When This Page Helps

Game theory is essential in economics, political science, biology, and computer science, and the Prisoner's Dilemma is its most important building block. However, textbooks present payoff matrices statically. This simulator brings them to life — you can watch strategies interact round by round, see cooperation rates evolve, and discover why "nice" strategies like Tit-for-Tat outperform "nasty" ones in the long run.

It is ideal for students learning game theory, instructors building interactive lectures, and professionals exploring strategic interaction in negotiations, auctions, or protocol design.

How to Use the Inputs

Set the payoff values: Reward (R), Temptation (T), Punishment (P), and Sucker (S).
Choose single-round mode to set each player's choice manually, or iterated mode for strategy simulation.
In iterated mode, select strategies for both players and the number of rounds.
Read the output cards for total scores, Nash equilibrium, and cooperation rates.
Examine the payoff matrix to understand each outcome.
Review the round-by-round table in iterated mode to see move-by-move dynamics.
Use presets to explore classic matchups like TFT vs Always Defect.

Formula used

Payoff matrix: (C,C)→(R,R), (C,D)→(S,T), (D,C)→(T,S), (D,D)→(P,P). Standard PD: T > R > P > S and 2R > T + S. Nash equilibrium of single-round PD: (D,D). Tit-for-Tat: start C, then copy opponent's last move.

Example Calculation

Result: P1: 22, P2: 24

TFT cooperates on round 1 (gets S=0), then defects for the remaining 19 rounds (gets P=1 each). Always Defect gets T=5 once, then P=1 for 19 rounds. TFT loses slightly because of the initial exploitation.

Tips & Best Practices

Verify T > R > P > S and 2R > T + S to ensure a valid dilemma — the tool warns you otherwise.
Run TFT vs TFT to see the power of mutual cooperation: both get R every round.
Compare Always Defect against multiple strategies to see why it loses in tournaments.
Try Grim Trigger vs TFT — one accidental defection (in a noisy variant) makes Grim devastating.
Increase rounds to 50+ to see long-run average payoffs converge.
Use Random as a baseline to compare how much smarter strategies improve over pure chance.

History and Axelrod's Tournament

The Prisoner's Dilemma was formalized by Merrill Flood and Melvin Dresher at RAND in 1950, and Albert Tucker gave it its name. In 1980, political scientist Robert Axelrod invited game theorists to submit strategies for an iterated PD computer tournament. Anatol Rapoport's simple Tit-for-Tat strategy won both the original tournament and a much larger follow-up. The result was surprising: the winning strategy was the simplest submitted, and it never "won" a single round-pair — it succeeded by fostering cooperation.

Evolutionary Game Theory

In evolutionary biology, the Prisoner's Dilemma models reciprocal altruism. If organisms interact repeatedly, strategies like Tit-for-Tat can evolve and sustain cooperation in populations. This was a key insight of Axelrod and Hamilton's 1981 paper "The Evolution of Cooperation." The Prisoner's Dilemma also models the evolution of virulence in parasites, the maintenance of honest signaling, and the stability of mutualistic relationships.

Beyond Two Players

The N-player Prisoner's Dilemma (also called the Tragedy of the Commons) generalizes the model. Each individual benefits from defecting (free-riding), but if everyone defects, the shared resource collapses. Real-world examples include overfishing, pollution, and vaccine hesitancy. Solving these multi-player dilemmas requires institutional mechanisms: regulation, taxation, reputation systems, or repeated interaction — all of which can be understood through the lens of game theory.

Sources & Methodology

Last updated: January 15, 2025

Frequently Asked Questions

A game where two players each choose to cooperate or defect. Defection is individually rational but mutually harmful, creating a tension between self-interest and collective good.