MTT — Strategy Divergence from Cash NLHE

About this book

MTT is the same cards with a different reward signal — here's how the model plays it

MTT is Cash NLHE wrapped in a different reward signal: ICM equity delta plus optional bounty, no rake. The underlying card game is identical. Every divergence from Cash in this book comes from the tournament wrapper, not from the cards. This book catalogues those divergences mechanism by mechanism, using Cash as the baseline for every comparison and our own solver as the source of truth.

Eight mechanisms (M1–M8) emerged from 914 solver queries (497 MTT + 417 Cash) covering preflop VPIP by position × stack × stage, bounty-mode deltas (PKO, flat-KO, mystery), BB defense by opener × sizing × stack × stage, and flop cbet across six canonical textures. Each mechanism is T1 at single-checkpoint: reproduced across multiple positions, stacks, stages, or textures with converging evidence.

The scope covers preflop and flop only. Turn and river are not systematically queried. All postflop cells are heads-up. Satellite / super-steep-ICM / WTA, adaptive sizing, blind escalation, and re-entries are out of scope — see Part 9 for the full scope boundary.

Methodology

Every number in this book was queried against the trained model checkpoint universal-dense-v4-player_20260402_150328.onnx via the preview inference server. The B1 trust gate passed 40/55 on 2026-04-16 with identical fail set to Cash, and the Cash → MTT E4 transfer test passed — confirming that MTT inherits Cash's data-reliability profile exactly.

T-status is recorded per mechanism. All eight mechanisms sit at T1 (single-checkpoint reproduction across multiple positions, stacks, or textures). External literature (60 MTT-LIT claims across 8 clusters) and external solvers (45 MTT-SOL entries across 9 sections) provide theoretical grounding and comparison benchmarks. Our numbers come from our solver; the external corpus is never cited as authoritative for our claims.

Scope caveat: All data is preflop + flop. Four live data-reliability issues are filed — KI-12 (mystery-bounty collapse to normal), MC-5 (bounty directional anomaly), MC-6 (SB deep-stack wider-than-Cash behavior), MC-7 (BB defense widening at zero-ICM stages). Do not extrapolate any claim beyond its tested position × stack × stage × texture cell.

The 8 MTT mechanisms

How MTT diverges from Cash NLHE across stage, bounty, position, and stack

Preflop · Deep non-SB

M1 — Cash-transfer at 100bb UTG/MP/CO/BTN

The deepest and earliest non-SB cells match Cash within noise (|Δ VPIP| < 2 pts at 100bb UTG/MP/BTN). Tournament structure does not reshape play at these positions and depths; Cash strategy transfers directly.

Preflop · SB regime

M2 — SB opens much wider in MTT

SB opens +14–20 pts wider than Cash at deep stacks, driven by limping. Not solver-backed at deep stacks — treat as model behavior (filed as MC-6). SB is effectively a separate regime from the other four positions.

Preflop · Bounty

M3 — Bounty pressure tightens, in the wrong direction

Flat-KO tightens ranges more than PKO (mean Δ VPIP −3.7 vs −1.4 pts). Textbook theory and external solvers (MTT-SOL-30, MTT-SOL-32) predict the opposite. Direction contradicts the corpus; filed as MC-5.

Preflop · Mystery bounty

M4 — Mystery bounty ≡ Normal in the model

Mystery-bounty cells collapse to normal (20/20 cells with |Δ| < 0.1). The mystery-specific CUDA training mechanics do not surface in preflop action frequencies. Filed as KI-12.

Stage encoding

M5 — Stage binning is binary, not graduated

The model collapses early/mid/late into one regime and bubble/itm/ft into another. Within-group delta is under 0.7 pts. Switch happens between 25% and 15% of the field alive; literature (MTT-SOL-26) says 37–50%. Filed as MC-6.

Preflop · BB defense

M6 — BB defense widens across every cell

BB defense widens +10–17 pts across every queried opener × raise × stack × stage, driven by Call% up and 3-bet% down. Holds even at early MTT where ICM pressure should be absent. Direction contradicts MTT-LIT-56 / MTT-SOL-27; filed as MC-7.

Flop · Cbet

M7 — Cbet elevates on dry and coordinated boards

Flop cbet frequency rises +11 to +29 pts vs Cash on 5 of 6 canonical textures. Paired-low 772p is the exception (−7 to −11 pts). Sizing also creeps up 0.1–0.2 pot, contradicting MTT-LIT-58 which predicts smaller sizing under the MTT wrapper.

Preflop · Short stack

M8 — SB short-stack crossover at 8–10bb

Non-SB short-stack push-fold (5–15bb at UTG/MP/CO/BTN) is Cash-like. SB short-stack has a distinctive pattern: 5–7bb wider than Cash, 10–15bb tighter than Cash. Crossover happens around 8–10bb.

Part 1

The Cash-like Regime (100bb UTG/MP/CO/BTN)

Most of your Cash NLHE playbook already works in MTT. That is the single most useful finding in this book.

At 17.6bb effective stacks, early tournament stage, no bounty active, the model's preflop opening frequencies from UTG, MP, CO, and BTN are within +2.8pp of their Cash equivalents. The positional hierarchy — UTG tightest, BTN widest — carries over exactly. So do the core theories: range advantage is positional, nuts advantage is distinct from equity advantage, blockers affect sizing, and in-position always realizes more equity than out-of-position.

If you coach tournament players who default to Cash ranges during the first few blind levels, the data says they're not wrong. They're approximately optimal at deep non-SB positions.

How MTT differs from Cash under the hood

This is not a Cash game with a different name. MTT replaces chip-EV with ICM equity as the optimization target, adds an optional bounty layer, and removes rake. The model sees one hand at a time — no blind escalation, no table balancing — but it receives the tournament's remaining-field ratio as an input signal. These structural differences create large strategic shifts at SB, in BB defense, and in bounty modes. They just happen to be invisible from UTG/MP/CO/BTN at deep stacks.

What transfers cleanly

The following Cash theories apply unchanged in MTT at deep non-SB positions:

Range advantage is positional and persistent. The preflop range hierarchy carries forward to the flop.
Nuts advantage is distinct from equity advantage. Holding more nut-eligible hands drives betting frequency and sizing.
Range composition determines outlier realization. Rare strong hands in weak-dominated ranges over-realize.
Checks condense ranges. When you check a spot where you'd bet your strongest hands, the remaining range narrows — standard Bayesian mechanics, format-neutral.
Protection betting is real but narrow. The two-condition rule (extract value + fold out live equity) transfers; MTT doesn't add a new protection mechanism.
Position improves equity realization universally. The in-position advantage persists under ICM.
Positional VPIP gradient. UTG < MP < CO < BTN. Same ordering, same magnitudes.
OOP equity realization is strictly lower. Acting last is format-neutral.
Range composition by position shapes postflop action. Opener position determines cbet dynamics on texture-matched boards.

The 100bb VPIP parity table

Measurement conditions: 6-max NL, MTT-early (1000/1000 alive, no bounty), Cash baseline = same model with game_mode_code: normal and no tournament wrapper.

Opening VPIP at 100bb — Cash vs MTT-early. MTT-early stage · 1000/1000 alive · normal bounty · 6-max NL · 100bb effective

Position	Cash VPIP	MTT-early VPIP	Δ (pp)
UTG	17.2	17.6	+0.4
MP	22.9	23.0	+0.1
CO	28.1	30.9	+2.8
BTN	43.3	44.1	+0.8

Source: mtt-deltas.md Table 1, 100bb rows.

UTG (+0.4pp) and MP (+0.1pp) are nearly identical to Cash. BTN (+0.8pp) sits inside the noise floor. CO (+2.8pp) is the widest deviation and still well within coaching-irrelevant territory. Published MTT theory predicts ranges "within 1–2pp of Cash at deep stack early stage" — three of four positions sit inside that envelope; CO lies just above it.

What this means in practice: If you are coaching deep-stack early-tournament preflop opens from UTG, MP, CO, or BTN, your Cash charts apply. Do not re-calibrate them for MTT.

Stack-depth sweep — does the parity hold as you shorten?

Δ VPIP (MTT-early minus Cash) across stack depths. MTT-early stage · 1000/1000 alive · normal bounty · 6-max NL · stack depth sweeps columns

Position	100	50	25	20	15	12	10
UTG	+0.4	−0.7	+0.2	+0.6	+0.5	−0.5	−1.0
MP	+0.1	−0.4	−0.6	−0.4	+0.2	−0.1	−1.2
CO	+2.8	+1.7	+0.8	+0.6	−0.4	−1.5	−2.1
BTN	+0.8	+2.2	+3.6	+3.2	+0.5	−1.1	−1.1

Source: mtt-deltas.md Table 1.

Same data, visualized. The stack-depth curve shows how Δ VPIP evolves from 100bb to 10bb for each position.

Source: mtt-deltas.md Table 1.

UTG stays inside ±1pp at every depth. MP stays inside ±1.2pp. CO's full range spans about 5pp but never exceeds the practical adjustment threshold. BTN is the outlier: at 25bb (+3.6) and 20bb (+3.2) it opens noticeably wider than Cash. This is plausibly a small steal-adjustment effect — not large enough to prescribe from one round of data alone, but worth flagging.

AI% parity

All-in frequency at selected depths. MTT-early stage · 1000/1000 alive · normal bounty · 6-max NL · stack depth sweeps columns

Position	15bb Cash	15bb MTT	10bb Cash	10bb MTT
UTG	0.3	0.2	7.6	6.2
MP	0.0	0.0	4.1	3.6
CO	0.1	0.1	7.5	8.9
BTN	5.3	6.0	23.5	21.5

Source: mtt-deltas.md Table 1.

At every stack depth from 15bb up, non-SB all-in frequencies are within 2pp of Cash. Even at 10bb the largest deviation is CO (8.9% MTT vs 7.5% Cash). The Harrington M-ratio zones describe the same qualitative shift that happens in Cash at the same depth — push-fold is not fundamentally different for non-SB positions in MTT.

What this means in practice: Non-SB short-stack push-fold in this model is effectively the Cash short-stack strategy. For ICM-aware push-fold, use dedicated push-fold tools (HRC, ICMizer) rather than this model. See Part 4 for the full short-stack treatment.

What changes as you shorten

The deep-stack equivalence holds cleanly across 100/50/25bb for UTG, MP, and CO. BTN shows a soft widening at 25–20bb, consistent with a mild steal-frequency adjustment but too small to coach on in isolation.

The real divergences — and the rest of this book — live elsewhere. SB is a fundamentally different position in MTT, and that story starts in Part 2.

What this part does not cover

Part 2 — SB regime. SB plays dramatically wider in MTT than Cash. Do not extend the "Cash charts apply" conclusion to SB.
Part 3 — Bounty pricing. PKO and flat-KO produce direction anomalies the model has not resolved.
Part 5 — BB defense. BB defends wider in MTT across every cell we tested. The mechanism is structural, not ICM-driven.
Part 6 — Flop c-bets. Cbet frequency jumps on most board textures, with one paired-low exception.
Part 7 — Stage binning. The model treats tournament stage as binary, not graduated — a known limitation.

4 practical takeaways from the cash-like regime

At 100bb MTT-early, keep your Cash opening ranges for UTG, MP, CO, and BTN. The largest deviation is CO at +2.8pp — well within noise.
At 25–20bb BTN, consider widening your Cash steal range by roughly 3pp. Treat this as a soft adjustment, not a hard prescription — the signal is directional, not confirmed across multiple data rounds.
Do NOT extend the "Cash charts apply" conclusion to SB. SB diverges by +19.5pp at 100bb. That is a different regime entirely — see Part 2.
Non-SB all-in frequencies at 10–15bb track Cash within 2pp. The Harrington zone shift you know from Cash applies here. For ICM-precise push-fold, use dedicated push-fold solvers — Part 4 has the details.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

The cross-game transfer test is the load-bearing evidence. The B1 property-test suite runs 55 structural checks on the model. MTT passes 40/55 — the identical 40 checks Cash passes, with the identical 15 failures. The specific check B1 E4 (Cash → MTT) passes, confirming the model produces the same action distribution at matched cells when only the tournament wrapper differs. Without this gate, the VPIP-parity numbers could reflect implementation artifacts (cache leakage, format-ID mis-routing). The B1 pass set guarantees MTT is not introducing new systematic bugs relative to Cash. It does NOT guarantee that MTT-specific behaviors (bounty pricing, stage binning) are solver-correct — those are flagged individually in Parts 3 and 7.
"MTT-early" is a specific measurement condition, not a label for "early in the tournament." In the dataset, MTT-early means total_alive_players = 1000 / total_entries = 1000 (full field, nobody eliminated), bounty_type = normal (no bounty), prize pool present but ICM pressure near-zero. The training model sees only left_ratio = alive / entries as a stage signal — no explicit stage category. At left_ratio = 1.0, ICM distortion is minimal because no player is near elimination. A coach applying these numbers to a hand played one level in, full field remaining, no bounty active, is inside the measurement condition. A coach applying them to a hand at the bubble (16% alive) is outside it — see Part 7 for the bubble-onward regime.
BTN at 25bb and 20bb is the only non-SB, non-shallow cell that exceeds the published ±2pp envelope. At +3.6 (25bb) and +3.2 (20bb), BTN opens wider than the literature's "within 1–2pp of Cash at deep stack early stage" benchmark. We report this as a directional signal, not a coaching prescription, because (a) it is a single data round, (b) the neighboring cells (50bb at +2.2, 15bb at +0.5) collapse back inside ±2pp, and (c) no external solver baseline confirms the direction at this specific depth. If Round 2 replicates, it would support a soft ~3pp BTN steal-widen at 20–25bb MTT; until then, hold your Cash chart.

Part 2

SB as a separate regime

If you coach MTT players using Cash SB charts at deep stacks, you are coaching them to fold too much. The gap is not subtle.

At +19.5 percentage points wider than Cash at 100bb, the MTT small blind is the single largest positional divergence between the two formats in our data. Every other position — UTG, MP, CO, BTN — stays within a few points of Cash at the same stack depth. SB doesn't just drift wider. It plays a fundamentally different game.

Measurement conditions: SB at MTT-early stage (1000/1000 alive, no bounty), 6-max NL, Cash baseline same model with game_mode_code: normal. The SB regime ends at the push-fold crossover (12–10bb) — see Part 4.

The SB VPIP gap by stack depth

SB opening frequency, Cash vs MTT-early. MTT-early stage · 1000/1000 alive · normal bounty · 6-max NL · SB · stack depth sweeps rows

Stack	Cash SB VPIP	MTT SB VPIP	Δ (pp)
100bb	57.9%	77.4%	+19.5
50bb	59.9%	73.7%	+13.8
25bb	55.7%	71.8%	+16.1
20bb	54.2%	68.3%	+14.1
15bb	55.3%	64.1%	+8.8
12bb	59.8%	62.1%	+2.3
10bb	63.6%	63.0%	−0.6

Source: mtt-deltas.md Table 1, SB rows.

Same data, visualized. The SB VPIP gap compresses steadily from 100bb to 12bb and flips at 10bb.

Source: mtt-deltas.md Table 1, SB rows.

At 100bb the MTT SB enters the pot 77.4% of the time — nearly four hands out of five. The widening holds through 50bb (+13.8pp), 25bb (+16.1pp), and 20bb (+14.1pp). It compresses at 15bb (+8.8pp) and almost vanishes at 12bb (+2.3pp). By 10bb the SB is actually -0.6pp — the two formats converge and then the push-fold regime takes over.

Critically, the all-in frequency at 100bb is 0.0% in both Cash and MTT. The extra VPIP is not shoves. It is limps and raises.

What this means in practice: If you are coaching an MTT player's SB at 50–100bb and your reference chart comes from Cash, you are leaving a significant amount of VPIP on the table. The model says the SB should be entering the pot far more often than Cash — primarily through completing, not through raising or jamming.

Most of the extra VPIP is limps

At 100bb MTT-early, the SB enters with a limp roughly 68% of hands, computed from the open-action breakdown in the raw data. That is the majority of the 77.4% VPIP figure — the SB is completing into the pot, not raising wider.

One caveat: that 68% limp anchor is a single measurement at 100bb. The full limp/raise/fold breakdown across all seven stack depths lives in the raw batch data and has not been published as a sweep. We have one anchor, not a curve.

Why this happens

Three forces compound in the SB's favor in MTT.

Based on general poker theory

The ante adds dead money to the pot before action begins, which improves the SB's immediate pot odds for completing. MTT raise sizes trend smaller than Cash raises, which shifts the EV comparison toward limp-then-defend. And BB defends much wider in MTT (covered in Part 5), which means the SB's fold equity when raising is lower — making completing relatively more attractive than raising.

Falsifier: if the ante were removed and BB defense returned to Cash width, the SB widening should collapse. Our Round 1 data does not include an ante-isolated control, so this prediction is untested.

Published tournament theory attributes deep-stack SB widening primarily to antes. The model's direction aligns with that literature. Its magnitude does not — the model overshoots what published charts report.

A scope-and-magnitude warning

The direction of this finding — SB plays wider in MTT than in Cash at deep stacks — has partial support from external theory on ante-driven opening. The magnitude is a different story.

The +19.5pp gap at 100bb has no direct solver corroboration. External solver baselines for SB at deep-stack MTT-early are sparse; what does exist covers short-stack or bounty scenarios, not the matched 100bb normal-bounty cell. Even at shallow stacks where external baselines exist, the model SB is lighter than some Nash references — at 10bb the model shows 63.0% VPIP, while one external baseline reports SB at the same depth around 75%.

Coaching guidance: the MTT SB regime is real and directional. The 68% limp rate at 100bb is model behavior, not a verified prescription. Use a published MTT SB chart as your prior and widen from there — but don't adopt the model's exact frequencies without a solver cross-check.

Where the regime ends

At 12bb the SB gap collapses to +2.3pp. At 10bb it flips to -0.6pp — the MTT SB is now fractionally tighter than Cash. Below this threshold, push-fold takes over.

Part 4 covers the SB push-fold crossover in detail. The preview: at 5bb the MTT SB jams +14.7 percentage points wider than Cash. At 10–15bb the sign flips and the MTT SB jams -8.4 to -11.0 percentage points tighter. The crossover sits between 7bb and 10bb.

What we didn't test in Part 2

Limp-vs-raise breakdown beyond 100bb. The 68% limp anchor is a single measurement. The full sweep across seven stack depths needs publication from the raw batch data before coaches can calibrate limp frequencies at 50/25/20bb.
SB 3-bet response when facing an opener. Part 2 covers SB opening frequencies, not SB's response to someone else's raise. BB defense and 3-bet dynamics are in Part 5.
Ante-isolated re-test. A Round 2 batch would toggle the ante on and off while holding all other variables fixed, isolating how much of the SB widening is purely ante-driven. Without it, the three-force mechanism above is a plausible interpretation, not a confirmed decomposition.

Three practical takeaways for SB play

At 50–100bb MTT-early, your Cash SB chart is much too tight. Widen significantly toward completing. The model shows the SB entering the pot at 74–77% at deep stacks, with most of the extra frequency coming from limps.
Treat the 68% limp rate at 100bb as direction, not magnitude. Use a published MTT SB chart as your prior. The model's direction is supported by ante-driven widening theory; the exact frequency exceeds what published references corroborate.
The crossover happens between 12bb and 10bb. Below that threshold, the deep-stack widening disappears and push-fold dynamics take over. See Part 4 for SB shove-or-fold prescriptions.

The rule of thumb The MTT SB is a different position from the Cash SB at every stack above 12bb. The deeper the stack, the bigger the difference — and the difference is mostly limps, not raises.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

The 68% limp rate is a single anchor measurement. This figure is computed from the SB 100bb MTT-early open-action breakdown in the raw Round 1 batch data, cited in the strategy manual's Pillar A section (line 195). The published delta tables (mtt-deltas.md) report aggregate VPIP and AI%, not the limp/raise/fold decomposition. A future Round 2 batch (M_m5_sb_limp_decomp in the research queue) would decompose SB VPIP into limp%, raise%, and fold% across all seven stack depths. Until that batch runs, the 68% number is a single-cell anchor — directionally informative but not a verified curve.
Magnitude classification: model-only. The +19.5pp gap at 100bb has no direct external-solver anchor at the matched cell. Cross-referencing against external baselines at shorter stacks: one reference (HRC, 10bb SB) reports roughly 75% VPIP at 10bb, while our model reports 63.0% — approximately 12 points lighter. Another reference (HRC, 15bb SB) reports roughly 79% VPIP; our model reports 64.1%. Even where the model SB is much wider than Cash, it is still tighter than Nash push-fold baselines at the same depths. Direction confirmed; magnitude ungrounded at the deep-stack extremum. The mechanism entries (MTT-TH-A5, C4, M5 in mtt-theory-foundation.md) carry the corresponding magnitude qualifier; the C4 verdict-evidence worksheet in mtt-baselines.md documents the external-solver cross-reference in full.
Causal account is multi-factor, single-checkpoint. The causal analysis in causal-explanations.md §M5 attributes the deep-stack SB widening to a combination of three factors: wider BB defense raising the cost of the SB's fold option; ante-sweetened pot odds favoring completion; and smaller MTT raise sizes tilting the raise-vs-limp EV comparison. No single factor has been isolated. All mechanism evidence comes from one model checkpoint; retraining-robustness has not been established. The strategy manual's mechanism crosswalk flags M5 as single-checkpoint throughout.

Part 3

Bounty pricing (PKO, flat-KO, mystery)

Two things jumped out the moment we ran the bounty sweep.

First, mystery bounty has two distinct regimes. Pre-ITM, the model treats mystery exactly like normal — identical output on every cell. Post-ITM, mystery tightens VPIP by up to -8.7pp versus normal. The split traces to a specific rule in the training code, and it is both real and useful.

Second, the model's PKO and flat-KO direction is sign-flipped versus published MTT theory. Where the literature predicts bounty modes should widen ranges, our model tightens — and flat-KO tightens more than PKO, which is the opposite ordering the textbooks predict. This is a model behavior flag, not a coaching prescription.

Here is the data.

3.1 How the four bounty modes work

The MTT training code supports four bounty types on top of the base ICM reward.

Normal (no bounty). Pure ICM equity delta. No head bounty, no knockout bonus. This is the control condition in every comparison below.

Flat-KO (flat knockout). Every player carries a fixed head bounty. On elimination, the killer collects the victim's entire head bounty instantly — no compounding, no carry.

PKO (progressive knockout). Buy-in splits into a prize pool and a bounty pool. On elimination, the killer collects half the victim's head bounty immediately. The other half adds to the killer's own head bounty, which they carry forward. That compounding over a multi-elimination tournament lifetime is what makes PKO strategically unique.

Mystery bounty. A shared pool of hidden-value envelopes. On elimination during the mystery phase (post-ITM only), the killer draws one random envelope. Pre-ITM, the rules implementation downgrades mystery to normal.

Based on general poker theory PKO carry compounding. In a full tournament, each knockout adds to the killer's head bounty, making them a higher-value target — and giving them progressively more incentive to accumulate. Published canonical MTT references derive the "call wider for bounties" prescription from this lifetime compounding, not from the single-hand instant payout alone.

3.2 The bounty pricing table

Measurement conditions: 6-max NL, MTT-mid stage (500/1000 alive), symmetric-stack fixtures (covering-vs-covered asymmetry NOT exercised). Bounty modes per GAME-RULES.md §8.

Model VPIP by position × stack across four bounty modes. MTT-mid stage · 500/1000 alive · 6-max NL · bounty mode sweeps columns

Pos	Stack	normal	PKO	flat-KO	mystery	PKO Δ	flat-KO Δ	mystery Δ
UTG	60bb	17.2	18.6	18.0	17.2	+1.4	+0.8	+0.0
UTG	40bb	17.2	18.4	18.4	17.2	+1.2	+1.2	+0.0
UTG	25bb	17.1	18.5	19.3	17.1	+1.4	+2.2	+0.0
UTG	15bb	13.8	13.9	14.6	13.8	+0.1	+0.8	+0.0
MP	60bb	23.6	22.8	21.1	23.6	−0.8	−2.5	+0.0
MP	40bb	23.4	22.0	20.6	23.4	−1.4	−2.8	+0.0
MP	25bb	22.8	23.0	20.3	22.8	+0.2	−2.5	+0.0
MP	15bb	20.5	22.8	22.5	20.5	+2.3	+2.0	+0.0
CO	60bb	30.0	29.1	26.6	30.0	−0.9	−3.4	+0.0
CO	40bb	28.3	27.5	27.0	28.3	−0.8	−1.3	+0.0
CO	25bb	26.3	25.2	25.3	26.3	−1.1	−1.0	+0.0
CO	15bb	23.2	23.0	22.4	23.2	−0.2	−0.8	+0.0
BTN	60bb	42.8	41.0	34.4	42.8	−1.8	−8.4	+0.0
BTN	40bb	39.9	36.4	31.3	39.9	−3.5	−8.6	+0.0
BTN	25bb	36.2	31.2	26.5	36.2	−5.0	−9.7	+0.0
BTN	15bb	29.7	26.4	23.8	29.7	−3.3	−5.9	+0.0
SB	60bb	74.1	70.3	64.6	74.1	−3.8	−9.5	+0.0
SB	40bb	73.7	68.2	61.2	73.7	−5.5	−12.5	+0.0
SB	25bb	71.8	68.3	63.0	71.8	−3.5	−8.8	+0.0
SB	15bb	64.1	61.0	60.7	64.1	−3.1	−3.4	+0.0

Source: mtt-deltas.md Table 3 lines 76–101

BTN and SB carry the largest deltas. SB at 40bb flat-KO shows the single deepest tightening: -12.5pp. UTG cells cluster near zero for both bounty types.

3.3 The PKO vs flat-KO sign anomaly

This is the model's most counterintuitive bounty finding, and it contradicts published theory on two dimensions.

Direction. Across all cells, PKO averages -5.0pppp VPIP shift at BTN 25bb. The aggregate mean across the full grid is −1.40pp for PKO and −3.70pp for flat-KO. Both are negative — the model tightens under bounty. Published canonical MTT references unanimously predict that bounty modes should widen ranges, because eliminating an opponent collects their head bounty and adds positive EV to every call or shove.

Ordering. Flat-KO tightens more than PKO at every BTN and SB cell. At BTN 25bb the flat-KO Δ is -9.7pp versus PKO -5.0pppp — a ratio of roughly 1.94×. At BTN 60bb the ratio widens to 4.67×. Published theory predicts the reverse ordering: PKO's progressive carry should create stronger widening pressure than flat-KO's one-shot payout.

Both the direction and the ordering are model behavior, not solver-corroborated. Do not use these numbers to coach absolute PKO or flat-KO range adjustments. The likely explanation — though unresolved — is that the trainer scores only the instant payout on each hand and does not simulate the multi-hand bounty carry that makes PKO "stronger" in tournament-lifetime EV. A model that never sees the carry compounding may correctly respond to a smaller per-hand PKO incentive than a full flat-KO collection.

The rule of thumb Do not teach PKO or flat-KO ranges from this model's output. The direction and ordering are both sign-flipped versus the textbook. For PKO coaching, rely on canonical published push-fold charts and covering-vs-covered theory until a future training run includes carry-compound PKO reward.

3.4 Mystery bounty — two regimes, not one

The mystery column in the table above is a wall of +0.0. That looks like the model ignores mystery entirely — but it does not. The story has two chapters.

Pre-ITM (mid-stage, the data above). The training rules downgrade mystery_bounty to normal whenever the field has not yet reached the money. All queries in Table 3 fire at 500/1000 alive (mid-stage), which triggers that downgrade. So 20 of 20 cells matching normal is rules-correct behavior, not a model limitation.

Post-ITM (ITM and final table). A follow-up batch queried mystery at post-ITM stages where the downgrade rule does not fire. Result: 36 of 40 cells showed mystery different from normal, with the maximum shift reaching 8.2pp VPIP tightening at SB 10bb on the final table. Mystery tightens VPIP post-ITM, consistent with the idea that the remaining mystery prize pool increases the survival EV of staying in the tournament.

This resolves the earlier open question about whether the model reads the mystery parameter at all. It does — but only when the rules allow it. Pre-ITM, mystery IS normal. Post-ITM, mystery is its own regime.

The rule of thumb For post-ITM mystery bounty spots, the model is usable. Expect tightening versus normal — ranges compress by roughly 3–8pp depending on position and stack depth, with the largest effect at short-stack final-table spots. For pre-ITM mystery, the model's output is identical to normal, which is the correct equilibrium given the rules implementation.

3.5 The real late-game lever: covering versus covered

The table above measures bounty mode effects with symmetric stacks. It does not capture the largest documented bounty-driven adjustment in published PKO theory: whether you cover your opponent or your opponent covers you.

Published canonical theory documents a 32pp spread in RFI ranges at BTN 25bb on the bubble between a covered player and a covering player. That is larger than any delta in this section — and we did not test it. Round 1 used symmetric stacks throughout, so the covering-vs-covered asymmetry was never exercised.

Until that asymmetry test runs, every PKO number above reflects "symmetric-stack wrapper effect," not the realized EV of bounty hunting in practice. If you are coaching PKO and need covering-vs-covered guidance, use canonical push-fold references and published solver charts — they are the authoritative source until our model's asymmetric-stack behavior is verified.

What we didn't test in Part 3

Covering-vs-covered stack asymmetry — all Round 1 bounty fixtures use symmetric stacks. The primary PKO driver (32pp RFI spread per published theory) is not exercised.
PKO push-fold below 15bb — the bounty sweep tested 15/25/40/60bb. Shallow-stack PKO push-fold at 5–12bb, where bounty incentives should be strongest, is not covered.
Satellite and super-KO dynamics — not in the training code or query set.
PKO carry-compound reward — the trainer scores only the instant 50% payout per hand, not the 50% lifetime carry that compounds. This is likely the root cause of the PKO sign anomaly.
Mystery bounty at ultra-deep stacks (>60bb) — post-ITM mystery verified at short/mid stacks; deep-stack mystery untested.

5 practical takeaways for bounty play

Mystery bounty coaching is usable post-ITM. The model tightens VPIP 3–8pp versus normal at ITM and final-table stages. Pre-ITM, mystery is rules-equivalent to normal — coach accordingly.
Do NOT use this model for absolute PKO or flat-KO range adjustments. Both direction (tightens instead of widens) and ordering (flat-KO > PKO instead of PKO > flat-KO) contradict published theory. Treat the numbers as model characterization, not coaching prescription.
DO use the directional mix-shift signal. The 3-bet percentage drops under all bounty modes — that direction is consistent with published ICM theory. See Part 5 (BB defense) for the full call-up / 3-bet-down decomposition.
For covering-vs-covered, use canonical PKO theory. The model's symmetric-stack output does not exercise the asymmetry that makes bounty play strategic. Published references remain the authority until Round 2 fills this gap.
For short-stack push-fold under bounty, see Part 4 plus a published Nash calculator. The model's SB push-fold crossover (Part 4) is verified only under normal bounty. PKO-specific push-fold at 5–12bb is untested.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

PKO single-hand reward truncation. The training code encodes only the instant 50% PKO reward per hand, not the 50% carry that compounds across a multi-elimination tournament. This is structurally consistent with single-hand training — each episode sees one hand, and the carry portion would require simulating future knockout opportunities. But canonical PKO theory derives its "+VPIP widening" prediction precisely from that compounding: knocking someone out adds to your own head bounty, which increases your value as a target in subsequent hands. A model that never sees future-hand bounty accumulation is correctly responding to a weaker per-hand signal, which is consistent with the observed tightening direction. The training team has flagged this as a candidate methodology entry for a future training-run inclusion.
Mystery two-regime behavior (resolved). The training rules implementation downgrades mystery_bounty to normal when left_ratio > itm_ratio. In Round 1's mid-stage queries (500/1000 alive), the downgrade fires on every cell — so 20/20 cells matching normal is rules-correct, not a model bug. A follow-up batch at ITM and final-table stages (run 2026-04-17) confirmed the model reads the mystery parameter when the downgrade rule does not fire: 36/40 cells show mystery different from normal, max Δ 8.2pp at SB 10bb FT. The two-regime framing — mystery ≡ normal pre-ITM, mystery differentiates post-ITM — reconciles the rules implementation with published theory. This item is now resolved.
Covering-vs-covered asymmetry (untested). The asymmetric-stack batch is the top priority in the Round 2 queue. Published canonical theory documents a 32pp RFI spread between covering and covered players at BTN 25bb bubble — the single largest bounty-driven adjustment in the literature. All Round 1 queries used symmetric stacks, so the model's behavior under asymmetric covering conditions is completely unknown. Until that batch lands, every PKO claim in this section is scoped to symmetric-stack model behavior, not realized PKO EV. The test is designed as a matched pair: identical position and stack depth, with only the hero-covers vs hero-is-covered flag toggled.

Part 4

Short-stack push-fold (5–15bb)

The SB regime that made you limp at deep stacks flips below 8bb. At 5–7bb the SB shoves much wider in MTT than Cash. At 10–15bb it shoves tighter. And for every other position at the table? The MTT wrapper barely registers.

That two-sentence summary is the entire short-stack story from Round 1. This part unpacks it with the data.

The push-fold framework

In the 5–15bb band, the open action collapses toward shove-or-fold. Dan Harrington's M-ratio zones — green (M ≥ 20, full toolbox), yellow (10–20, cautious), orange (6–10, any raise commits), red (1–6, push-or-fold) — are the standard framework. MTT adds two pressures on top of Cash push-fold math: ICM (each chip is worth less as your stack grows) and bounty payoff (Part 3). For non-SB seats, neither force adds a measurable signal in our data. Cash push-fold ranges hold.

Measurement conditions: 6-max NL, push-fold stacks (5–15bb), normal bounty mode (PKO/flat-KO push-fold not separately tested).

Non-SB push-fold: Cash ranges hold

Every UTG, MP, CO, and BTN cell in the table below lands within a few percentage points of Cash. The model does not tighten at the bubble the way textbook ICM predicts — and it does not widen either. It simply plays Cash.

All-in frequency by position and stack depth — non-SB seats. Cash baseline vs MTT early/bubble/ft · 6-max NL · stack depth sweep rows · normal bounty

Pos	Stack	Cash AI%	MTT-early AI%	MTT-bubble AI%	bubble − Cash	ft − Cash
UTG	5bb	16.1%	17.5%	17.0%	+0.9	+0.9
UTG	7bb	14.0%	13.7%	13.3%	−0.7	−0.7
UTG	10bb	7.6%	6.2%	5.7%	−1.9	−1.9
UTG	12bb	4.1%	2.4%	2.0%	−2.1	−2.1
UTG	15bb	0.3%	0.2%	0.2%	−0.1	−0.1
MP	5bb	22.3%	22.9%	22.4%	+0.1	+0.1
MP	7bb	18.0%	16.9%	16.4%	−1.6	−1.6
MP	10bb	4.1%	3.6%	3.0%	−1.1	−1.1
MP	12bb	0.4%	0.4%	0.3%	−0.1	−0.1
MP	15bb	0.0%	0.0%	0.0%	+0.0	+0.0
CO	5bb	27.6%	27.9%	27.5%	−0.1	−0.1
CO	7bb	23.2%	21.8%	21.3%	−1.9	−1.9
CO	10bb	7.5%	8.9%	8.2%	+0.7	+0.7
CO	12bb	1.8%	2.1%	1.7%	−0.1	−0.1
CO	15bb	0.1%	0.1%	0.1%	+0.0	+0.0
BTN	5bb	36.1%	35.5%	35.0%	−1.1	−1.1
BTN	7bb	32.8%	30.6%	30.2%	−2.6	−2.6
BTN	10bb	23.5%	21.5%	20.8%	−2.7	−2.7
BTN	12bb	15.1%	15.5%	14.7%	−0.4	−0.4
BTN	15bb	5.3%	6.0%	5.1%	−0.2	−0.2

Source: mtt-deltas.md Table 4 lines 109–139

The largest non-SB delta in the entire table is BTN 10bb at -2.7 percentage points. That is noise, not signal. For ICM-aware non-SB push-fold, use published Nash push-fold solvers — this model does not add an MTT-specific adjustment.

What this means in practice If you play UTG, MP, CO, or BTN between 5bb and 15bb, your Cash short-stack chart is your MTT short-stack chart. The tournament wrapper doesn't change the shove range by a meaningful amount at any of these positions.

SB push-fold: the non-monotonic crossover

SB is the one position where the MTT wrapper rewrites push-fold strategy. The direction depends on stack depth — and it flips sign.

SB all-in frequency by stack depth. Cash baseline vs MTT-early · 6-max NL · SB · stack depth sweep rows · normal bounty

Stack	Cash AI%	MTT-early AI%	MTT-bubble AI%	bubble − Cash
5bb	52.9%	68.3%	67.6%	+14.7
7bb	47.6%	57.1%	56.8%	+9.2
10bb	44.0%	36.0%	35.6%	−8.4
12bb	34.2%	23.4%	23.2%	−11.0
15bb	22.5%	12.8%	12.7%	−9.8

Source: mtt-deltas.md Table 4 lines 134–139

Same data, visualized. The non-monotonic crossover — wider than Cash at 5–7bb, tighter at 10–15bb — is visible in the diverging lines.

Source: mtt-deltas.md Table 4 lines 134–139

At 5bb: +14.7 percentage points wider. At 12bb: -11.0 percentage points tighter. The sign flips somewhere between 7bb and 10bb — we did not test 8bb or 9bb in Round 1, so the exact crossover is not localized.

What this means in practice Below 7bb in the SB, the MTT model jams significantly wider than Cash. Above 10bb in the SB, it jams significantly tighter. If you are copying a Cash SB shove chart into MTT, you are over-folding at 5–7bb and over-shoving at 10–15bb.

What changes between 7bb and 10bb at SB

The deep-stack SB regime (limp-heavy, covered in Part 2) needs the option of completing and then defending postflop. Below roughly 8bb, completing is dominated by jamming — BB does not have enough stack to defend wide against a shove.

Based on general poker theory ICM chip-utility concavity. At 5bb, SB is near the bottom of the payout ladder, so the ICM tax on busting is small — shoving is relatively cheap in dollar-equity terms, and the wider jam captures fold equity from the antes. At 10–15bb, SB holds real survival equity, and every bust-out costs more in dollar-equity than the equivalent chip loss in Cash. That makes marginal shoves more expensive in MTT, and the model tightens. Falsifier: if the crossover were driven by rake rather than ICM chip-utility, Cash at the same rake would show the same flip — it does not.

One caveat. Published external Nash push-fold benchmarks for SB at 10bb report much higher VPIP than our model. Our MTT SB at 10bb shows 63.0% VPIP; published benchmarks report roughly 75%. Direction matches (SB wide), magnitude does not. At 15bb the gap is even larger. If you are coaching SB push-fold at 10bb or deeper in MTT, the external Nash reference is a safer anchor than this model.

A push-fold prescription at five stack depths

For UTG, MP, CO, and BTN at 5/7/10/12/15bb: use your Cash push-fold chart. The MTT model shows no actionable deviation at any of these positions — every delta is within a few percentage points of Cash.

For SB, the model outputs at each stack depth:

SB push-fold prescription by stack depth. MTT-early · 6-max NL · SB · normal bounty

Stack	Model MTT-early AI%	Guidance
5bb	68.3%	Wider than Cash by roughly 15pp — push aggressively
7bb	57.1%	Wider than Cash by roughly 9pp — still push wide
10bb	36.0%	Tighter than Cash — and well below external Nash benchmarks. Use external charts.
12bb	23.4%	Tighter than Cash — model under-shoves relative to external Nash here. Use external charts.
15bb	12.8%	Tighter than Cash — the shove option is giving way to min-raise at this depth

Source: mtt-deltas.md Table 4 lines 134–139

The 5–7bb SB widening is the most coaching-relevant finding in this table. The 10–15bb SB under-shoving is where the model diverges from established solver outputs and should not be taken as a prescription.

What we didn't test in Part 4

PKO/flat-KO push-fold. Round 1 used normal bounty for the push-fold sweep. Covering-vs-covered asymmetry — the dominant PKO driver — was not exercised at short stacks.
8bb and 9bb SB rows. The crossover between wider-than-Cash and tighter-than-Cash at SB is localized to somewhere between 7bb and 10bb, but we cannot pin it to a single stack depth.
Blind-vs-blind dynamics at 5bb when facing a BB shove. SB's calling range vs a BB jam is not in the Round 1 batch.
Satellite bubble dynamics. Steep-payout ICM (where min-cashing is worth more than a large stack) is absent from the training distribution.

4 practical takeaways for short-stack play

UTG/MP/CO/BTN at 5–15bb: use your Cash short-stack chart. The model adds no MTT-specific signal for these positions — every delta is within a few percentage points of Cash.
SB at 5–7bb: push wider than Cash by roughly 10–15 percentage points. The model shoves 68.3% at 5bb and 57.1% at 7bb. This is the one short-stack spot where MTT meaningfully diverges from Cash at a non-SB seat.
SB at 10–15bb: the model is tighter than published Nash benchmarks. Use published external push-fold charts (HRC, ICMizer) over the model for SB shove-or-fold decisions at 10bb and above.
For ICM-aware bubble adjustments at BTN/CO, use a published bubble factor chart. The model does not respond to bubble pressure for non-SB seats (see Part 7 on stage binning).

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

External Nash benchmarks exceed the model at SB 10–15bb. Published push-fold benchmarks for SB at 10bb report 75.2% VPIP (from external Nash solvers queried at 8-max with equal stacks); our model shows 63.0% MTT VPIP at the same cell. At 15bb the external benchmark is 78.7%; our model shows 64.1%. Direction matches (SB wide at short stacks), magnitude does not — the model under-shoves by roughly 12–15pp at SB 10bb and 15bb. This is consistent with the broader observation in mtt-theory.md §Pillar A that the model's SB deep-stack widening exceeds solver literature while the SB short-stack behavior undershoots it.
The crossover mechanism is single-checkpoint. The SB push-fold crossover between 7bb and 10bb (mechanism M8 in the research layer) is observed on one model checkpoint. A second-checkpoint replication would confirm whether the crossover band is specifically 8–10bb or is an artifact of training-distribution noise. The Round 2 batch that fills the 8bb/9bb SB rows would resolve this directly.
Bounty effects on push-fold are untested. Round 1 push-fold queries all used normal bounty mode. In PKO, the covering-vs-covered asymmetry documented in Part 3 should produce the largest deviations at short stacks (where a single knockout is decisive). The Round 2 asymmetric-stack bounty sweep is the highest-priority batch for resolving this gap.

Part 5

BB defense widening

If you use your Cash BB defense ranges in MTT, you are folding too much. Not slightly — the gap is +12.7 to +16.9 percentage points wider on every single cell we tested. That is 62 theories catalogued in this book, and this one finding — BB defends wider in MTT than Cash — showed up in every opener, every raise size, every stack depth, and every tournament stage we queried.

The kicker: this is not a bubble-pressure effect. It shows up at MTT-early (full field, no ICM) just as strongly as on the bubble.

Measurement conditions: 6-max NL, BB facing opener, MTT-early baseline (1000/1000 alive). Stage-insensitivity holds at the same cells across early/bubble/itm/ft within ±1.5pp.

The headline delta

Out of 96 tested cells, 96 show positive defense deltas. The minimum widening is +6.1pp (bubble, UTG 3.0bb open, 100bb). The maximum is +16.9pp (early, CO 2.5bb open, 20bb). The pattern: the gap is largest where Cash defense was tightest — where the MTT wrapper has the most room to fill.

Defense by opener at 100bb

BB defense rate by opener position. MTT-early stage · 1000/1000 alive · 2.5bb open · BB vs opener · 6-max NL · 100bb effective

Opener	Cash Def%	MTT Def%	Δ (pp)
UTG	36.5	49.2	+12.7
CO	51.8	63.2	+11.4
BTN	60.4	70.7	+10.3
SB	79.4	86.2	+6.8

Source: mtt-deltas.md Table 5, early stage, 2.5bb open, 100bb rows.

Same data, visualized. BB defense widens versus every opener, with the largest delta versus UTG where Cash defense was lowest.

Source: mtt-deltas.md Table 5, early stage, 2.5bb open, 100bb rows.

The relative ordering is preserved — you still defend more against the SB than against UTG. What changes is the floor. Every position lifts by roughly the same band, and the widening is largest vs UTG because that is where Cash defense was the lowest to begin with.

What this means in practice: Your Cash defense chart still determines the shape of your defending range. MTT shifts the entire chart up. Against a UTG open at 100bb, you are defending roughly half your range in MTT versus a third in Cash.

Defense by stack depth — vs CO 2.5bb open

BB defense rate by stack depth, CO opener. MTT-early stage · 1000/1000 alive · 2.5bb open · BB vs CO · 6-max NL · stack depth sweeps rows

Stack	Cash Def%	MTT Def%	Δ (pp)
100bb	51.8	63.2	+11.4
40bb	43.9	58.7	+14.8
20bb	32.6	49.5	+16.9

Source: mtt-deltas.md Table 5, early stage, CO 2.5bb rows.

Same data, visualized. The delta grows as stacks get shallower — from +11.4pp at 100bb to +16.9pp at 20bb.

Source: mtt-deltas.md Table 5, early stage, CO 2.5bb rows.

The gap grows as stacks get shallower. At 20bb, MTT BB defends nearly half its range against CO — up from a third in Cash. The practical read: short-stack MTT defense adjustments are even larger than deep-stack ones.

The mix shift — call up, 3-bet down

The extra defense is almost entirely calls. 3-bet frequency drops in every comparable cell.

Action breakdown for BB defense, selected cells. MTT-early stage · 1000/1000 alive · 6-max NL · opener×raise×stack labeled in rows

Spot	Cash Call%	MTT Call%	Δ Call	Cash 3bet%	MTT 3bet%	Δ 3bet
UTG 2.5bb 100bb	29.7	45.6	+15.9	6.8	3.6	−3.2
CO 2.5bb 20bb	8.7	36.6	+27.9	23.9	12.9	−11.0
BTN 2.5bb 20bb	9.8	37.6	+27.8	29.0	18.0	−11.0
SB 2.5bb 20bb	35.2	65.5	+30.3	26.1	11.4	−14.7

Source: mtt-deltas.md Table 5, early stage rows.

At 20bb facing SB, MTT BB calls 65.5% (+30.3pp) and 3-bets only 11.4% (+30.3 more calls absorb the defense widening, while 3-bets drop by 11.4 of the range). The direction is universal: call more, 3-bet less.

What this means in practice: If you are widening your MTT BB defense by 3-betting more, you are doing it backwards. The model shifts defense into the calling lane. 3-bets actually drop — consistent with ICM-era advice to reduce the frequency of bloating pots, even though the overall defense rate goes up.

Why this happens

Based on general poker theory Three structural forces in MTT raise BB's continuing EV versus a Cash baseline. First, the ante adds dead money to the pot before the hand begins, which mechanically raises the minimum defense frequency (MDF — the fold rate that makes bluffs break even). An external solver baseline reports the ante effect alone accounts for roughly +35 percentage points of BB VPIP at 100bb. Second, MTT raise sizes run smaller on average than Cash raises, which gives BB better immediate odds on a call. Third, MTT openers are wider — especially from the SB (see Part 2) — which lowers the average strength of the opening range BB is defending against. All three push in the same direction: BB should continue more often.

Stage-insensitivity — this is NOT ICM

BB defense at early vs bubble, representative cells. 2.5–3.0bb open · 6-max NL · opener×raise×stack labeled in rows · early vs bubble compared

Spot	Early Def%	Bubble Def%	Δ (early → bubble)
UTG 2.5bb 100bb	49.2	47.8	−1.4
CO 2.5bb 100bb	63.2	61.9	−1.3
SB 3.0bb 100bb	79.6	79.0	−0.6
SB 2.5bb 20bb	76.9	76.0	−0.9

Source: mtt-deltas.md Table 5, early vs bubble stage rows.

Within-stage delta never exceeds ±1.5pp. The BB defense widening is already fully present at MTT-early — before any ICM engagement. This is the structural MTT wrapper (ante + small opens + no rake), not a response to pay-jump proximity.

The rule of thumb Treat wider BB defense as the default MTT assumption from hand one. Do not wait for the bubble to adjust.

What we didn't test in Part 5

Only 2.5bb and 3.0bb open sizes tested. A 2.0bb sweep (common in late-stage MTT) and a 4.0bb sweep would test whether the widening scales or compresses at extreme sizes.
BB 4-bet response not analyzed — we measured fold/call/3-bet only. Whether the 3-bet drop also shifts 4-bet dynamics is unknown.
PKO and mystery bounty interactions with BB defense were not separately isolated. The headline data uses normal bounty mode.
Ante-isolated queries (removing the ante from the MTT wrapper to directly measure its contribution) are queued for a follow-up research round but not yet run.

Four practical takeaways for BB defense

If your Cash BB defense felt right, you are folding too much in MTT. Widen by roughly +10pp at 100bb, +15pp at 20bb. The gap is universal across openers, sizes, and stages.
Widen via calls, not 3-bets. 3-bet frequency drops in every tested cell. The extra defense goes into flatting, not re-raising.
The widening is a wrapper effect, not bubble pressure. Apply it from MTT-early. Do not wait until ICM kicks in — the data shows early and bubble defense within ±1.5pp.
The relative defense by opener position is preserved. Defend more versus SB than UTG, same as Cash. Only the absolute floor lifts.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

The ante-isolation test is the key open question for this finding. An external solver baseline (100bb, ante-matched MTT) reports BB VPIP approximately 35 percentage points wider than Cash from ante alone — per the research pack's modern-solver synthesis. Our model's full MTT widening sits at +6 to +17pp, well inside that envelope. This raises the possibility that the model under-responds to the ante's MDF impact. A follow-up research batch would hold the MTT wrapper constant, toggle the ante off, and measure how much of the +6–17pp widening disappears. That batch is flagged as the highest priority for the next round of queries.
Stage-insensitivity is the load-bearing evidence that the widening is structural. If early-stage and bubble-stage BB defense diverged by more than a few percentage points, the widening could be misread as Bubble Factor over-defense — a model bug where BB calls too much under ICM pressure. The fact that early and bubble cells track within ±1.5pp rules out that interpretation and grounds the finding as ante-driven, not ICM-driven. Without this evidence, the coaching advice would need a caveat it does not currently need. Source: mtt-deltas.md Table 5, all early vs bubble row pairs.
The 96/96 positive-delta pattern is derived from a single model checkpoint. The 100% hit rate across cells is too strong to be checkpoint noise — no random perturbation produces a unidirectional shift on 96 cells. But exact per-cell magnitudes (e.g., +12.7pp vs UTG at 100bb) could shift by a few points on a second checkpoint. The directional finding ("MTT BB always defends wider than Cash") carries moderate-to-strong evidence after partial corroboration by the external solver anchor. The specific per-cell deltas carry moderate evidence pending a second-checkpoint replication. Internal reference: causal-explanations.md M6 discussion; mtt-baselines.md MTT-TH-B2 verdict worksheet.

Part 6

Flop cbets

MTT cbets are dramatically wider than Cash. On 5 of the 6 textures we tested, the model bets the flop +29.0 to +29.4 percentage points more often than it does in Cash. The lone exception — paired-low 772p — flips the other way, with MTT cbetting less than Cash.

That exception is what makes this section interesting. The direction of the cbet shift isn't random. It traces directly back to the BB defense widening from Part 5: when BB's added hands miss the board, the IP cbettor prints fold equity. When those added hands hit the board, fold equity collapses and the cbet incentive inverts.

Measurement conditions: 6-max NL, single-raised pots, MTT-early stage (1000/1000 alive), normal bounty. Cbet behavior is stage-insensitive within the MTT regime — see Part 7 for binary-stage binning.

The cbet table by texture

CO and BTN flop cbet frequency and average bet size, Cash vs MTT, across 6 board textures. MTT-early stage · 1000/1000 alive · normal bounty · 6-max NL · CO/BTN vs BB SRP · 100bb effective · texture sweeps rows

Opener	Board	Cash bet%	MTT bet%	Δ (pp)	Cash avgBet	MTT avgBet
CO	`A72r`	64.4	93.4	+29.0	2.3	2.5
CO	`K84r`	80.0	96.2	+16.2	2.5	2.7
CO	`Q72tt`	69.3	98.7	+29.4	2.4	2.6
CO	`987mn`	51.9	80.2	+28.3	1.9	1.9
CO	`772p`	68.2	57.5	−10.7	2.0	1.9
CO	`J83ft`	70.3	97.8	+27.5	2.6	2.7
BTN	`A72r`	69.5	93.0	+23.5	2.3	2.5
BTN	`K84r`	81.7	93.3	+11.6	2.6	2.7
BTN	`Q72tt`	77.1	97.6	+20.5	2.5	2.7
BTN	`987mn`	51.2	77.5	+26.3	2.0	2.0
BTN	`772p`	73.1	64.4	−8.7	1.9	1.9
BTN	`J83ft`	71.4	96.5	+25.1	2.6	2.7

Source: mtt-deltas.md Table 6, early-stage rows (lines 242–269)

Same data, visualized. The cbet elevation is visible on every texture except paired-low 772p, which reverses.

Source: mtt-deltas.md Table 6, early-stage rows (lines 242–269)

Ten of the 12 rows show double-digit positive deltas. CO Q72tt is the largest at +29.4pp; CO A72r at +29.0pp is close behind. The two 772p rows are the only negative cells: CO -10.7pp and BTN -8.7pp.

What this means in practice: If you have been applying Cash cbet frequencies in early-stage MTT single-raised pots, you are likely under-betting dry and two-tone boards by a very large margin. On A72r, Q72tt, and J83ft, the model bets nearly every time.

Why dry and connected boards elevate

Based on general poker theory fold equity. The MTT BB defense range is wider than Cash by +12.7 to +16.9 percentage points (Part 5). The added BB hands are predominantly offsuit broadways and low-card junk — combos that miss A-high, K-high, Q-high, and two-tone boards almost entirely. When those hands whiff the flop, the IP cbettor gets more folds for the same sizing. On 987mn (mid-connected monotone), the wrapper widening still dominates because the monotone constraint narrows BB's connecting combos to a single suit. Falsifier: if the elevation were rake-driven rather than fold-equity-driven, we would expect uniform elevation across all textures. Instead the magnitude tracks how badly BB's added hands miss each board class.

Why 772p reverses

Based on general poker theory range-vs-texture matching. On 772p, BB's wider defense range contains more small-to-medium pocket pairs — hands like 22–TT that give BB at least middle pair, often a set or better on a low paired board. Unlike A72r where the added hands are pure air, on 772p they connect. BB's effective continuing range is stronger than in Cash, so IP fold equity collapses and the cbet incentive flips. Falsifier: if this were a generic "paired-board" effect, we would expect paired-high boards (KK5, QQ3r) to show the same inversion. Paired-high boards are untested in MTT Round 1 — see "What we didn't test" below.

Average bet size is also larger

On every non-paired texture, MTT average bet size is larger than Cash — by roughly +0.1 to +0.2 pot. This contradicts published MTT theory that predicts smaller postflop sizing under ICM pressure. The contradiction is consistent with the stage-insensitivity finding below: the model's cbet behavior appears driven by the structural MTT wrapper (antes, no rake, smaller opens), not by ICM.

Stage-insensitivity

Within the MTT regime, flop cbet frequency barely moves across tournament stages. Early vs bubble deltas from Table 6 are all within ±1.7 percentage points:

Early vs bubble flop cbet frequency on selected opener × board cells. MTT-early vs MTT-bubble · normal bounty · 6-max NL · CO/BTN vs BB SRP · 100bb effective

Opener × Board	Early bet%	Bubble bet%	Δ (pp)
CO `A72r`	93.4	91.7	−1.7
CO `Q72tt`	98.7	98.2	−0.5
CO `987mn`	80.2	79.4	−0.8
CO `772p`	57.5	59.2	+1.7
BTN `J83ft`	96.5	95.5	−1.0
BTN `772p`	64.4	65.6	+1.2

Source: mtt-deltas.md Table 6, early vs bubble rows (lines 242–281)

Same data, visualized. Early and bubble cbet frequencies are nearly identical across all six cells.

Source: mtt-deltas.md Table 6, early vs bubble rows (lines 242–281)

The cbet elevation is established at MTT-early and stays put through the bubble. Part 7 covers the binary-stage binning mechanism in detail.

Confidence pending solver cross-check

The direction of the cbet elevation is corroborated by external research attributing MTT postflop aggression to ante-driven range widening. The exact magnitudes — some exceeding +29 percentage points — are model behavior that goes beyond what published Cash-vs-MTT solver comparisons report. Until a second-checkpoint replication confirms the numbers, treat the magnitudes as directional, not prescriptive. The direction itself (more cbetting in MTT on dry boards, less on paired-low) has strong structural grounding.

What we didn't test in Part 6

Only 6 textures tested. No paired-mid (442), paired-high (KK5, QQ3r, JJ7r), or double-paired boards. The 772p reversal may or may not generalize to all paired textures.
Only CO and BTN as opener. UTG, MP, and HJ as preflop aggressor were not tested in the flop-cbet sweep.
Only normal bounty. PKO and flat-KO cbet behavior was not queried. If bounty modes shift BB defense width differently, cbet elevation could change in PKO/flat-KO.
Only single-raised pots. 3-bet pots are entirely untested in MTT (see Part 9).
No turn or river data. The entire Pillar F (multi-street c-betting) is untested. Do not extrapolate these flop findings to later streets.

5 practical takeaways for flop cbets

On A-high, K-high, Q-high dry boards, cbet close to always in early-stage MTT. Frequencies in the 90–99% band are equilibrium behavior, not over-aggression.
On 987mn (mid-connected monotone), cbet around 77–80% in MTT vs roughly 51% in Cash. The wrapper widening dominates even on a connected texture.
On paired-low 772p, cbet at or slightly below Cash frequency. MTT cbet drops by 7–11 percentage points. Do not generalize this to paired-high boards without additional data.
Average bet sizes are slightly larger in MTT than Cash on non-paired textures. Published theory predicting smaller sizing under ICM does not hold in this model.
These cbet rules are stage-insensitive. Apply the same frequencies from MTT-early through final-table. Stage does not shift flop cbet behavior materially.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

Cbet elevation is structurally tied to BB defense widening. The flop-cbet mechanism identified in the research is a downstream consequence of the BB defense-widening mechanism (Part 5). The wider BB defends preflop, the weaker BB's average flop holding — and the higher IP's fold equity on boards that miss that junk. If a second-checkpoint replication shifts the BB defense widening magnitude, the cbet elevation magnitudes move proportionally. The direction — elevation on dry, reversal on paired-low — is robust because it depends on the structural shape of the range mismatch, not on the exact point-estimate of BB defense width.
The 772p reversal is the only texture-split in the entire MTT theory classification. Across all 62 MTT-vs-Cash theory entries in the foundation, only one carries a REVERSE or TEXTURE-SPLIT verdict: 772p. Without a paired-board panel testing KK5, QQ3r, JJ7r, and 442, it is unclear whether the reversal generalizes to all paired textures or is specific to paired-low (where BB's small pocket pairs match the board). The paired-board panel batch is prioritized as P4 in the Round 2 queue. Our best guess: paired-high boards will retain the Cash direction (raiser-favorable) because BB's wider range does not contain many broadway pairs — but this is an untested prediction.
Two published MTT theory claims are contradicted by this data. First, the claim that MTT postflop cbet should track Cash at early deep-stack stages is contradicted: our model cbets +11 to +29 percentage points above Cash at 100bb MTT-early. Second, the claim that postflop sizing should be smaller under ICM is contradicted: MTT avgBet is +0.1 to +0.2 pot larger. Both published claims rest on ICM-pressure logic, but the model's MTT-vs-Cash deltas are not ICM-driven — the stage-insensitivity result (early ≡ bubble to within ±1.7 percentage points) shows the wrapper-level forces (ante, no rake, smaller opens) dominate the model's behavior, not ICM. The published predictions may still apply to solvers that encode ICM differently.

Part 7

Stage-of-tournament playbook

The canonical theory says ICM pressure tightens your range gradually from mid-tournament through the bubble. The model says something different: it sees exactly 2 stages, not 6. Early, mid, and late produce identical output. Bubble, ITM, and final table produce identical output. The only difference is between those two groups — and the gap is tiny.

If you try to interpolate "mid vs late" or "ITM vs FT" from the model's output, you are reading checkpoint noise. Use the binary output and stop there.

The canonical stage framework

Standard MTT theory gives you two interlocking tools for stage-aware play. First, Harrington's M-ratio zones (M = stack divided by the cost of one orbit) carve the tournament into five postures based on your chip pressure — from Green zone (full toolbox, cash-like play) down through Red zone (push-or-fold). Second, Bubble Factor — the ratio of equity risk on loss to equity gain on win — captures how pay-jump proximity changes the EV of every pot. When Bubble Factor exceeds 1, losing a chip hurts more than winning one helps.

Modern empirical work pushes ICM awareness earlier than older advice suggested. A 2025 analysis found that ICM-aware play outperforms chip-EV play starting when just 17.1–62% of the field remains — substantially before the bubble. That contradicts the Harrington-era treatment that says deep-stack early MTT is "basically cash."

What the model actually sees — the binary binning

Measurement conditions: 6-max NL, normal bounty. Stages per GAME-RULES.md §5: early/mid/late/bubble/itm/ft. The 6 columns collapse into 2 groups in the model's output.

BTN VPIP across 6 tournament stages and 7 stack depths. 6-max NL · BTN open · normal bounty · stack depth sweeps rows · stage sweeps columns

Stack	early	mid	late	bubble	itm	ft	max Δ
100bb	44.1	44.1	44.1	44.4	44.4	44.4	0.3
50bb	41.6	41.6	41.6	41.7	41.7	41.7	0.1
25bb	36.2	36.2	36.2	36.1	36.1	36.1	0.1
20bb	33.8	33.8	33.8	33.5	33.5	33.5	0.3
15bb	29.7	29.7	29.7	29.3	29.3	29.3	0.4
12bb	29.3	29.3	29.3	28.6	28.6	28.6	0.7
10bb	30.6	30.6	30.6	30.1	30.1	30.1	0.5

Source: mtt-deltas.md Table 2 lines 50–62

Look at the first three columns. They are byte-identical at every stack depth. Now look at the last three columns. Also byte-identical. The only variation is between the two groups, and the maximum gap across all 7 rows is 0.7pp.

UTG opens — same pattern

UTG VPIP across 6 tournament stages and 7 stack depths. 6-max NL · UTG open · normal bounty · stack depth sweeps rows · stage sweeps columns

Stack	early	mid	late	bubble	itm	ft	max Δ
100bb	17.6	17.6	17.6	17.0	17.0	17.0	0.6
50bb	17.1	17.1	17.1	16.6	16.6	16.6	0.5
25bb	17.1	17.1	17.1	16.7	16.7	16.7	0.4
20bb	16.2	16.2	16.2	15.8	15.8	15.8	0.4
15bb	13.8	13.8	13.8	13.4	13.4	13.4	0.4
12bb	12.5	12.5	12.5	12.2	12.2	12.2	0.3
10bb	12.8	12.8	12.8	12.5	12.5	12.5	0.3

Source: mtt-deltas.md Table 2 lines 64–74

Same story from the tightest opening position. Two groups, nothing more. Within each group the columns are identical. The between-group shift at UTG is 0.6pp or less.

Where the switch happens

The model flips from Group A (early/mid/late) to Group B (bubble/itm/ft) somewhere between roughly 25% of the field remaining and roughly 16% remaining. Round 1 did not localize the boundary more precisely — the 6-stage discrete grid does not have enough resolution to pin it. A Round 2 query that sweeps the stage parameter in finer steps would resolve this.

Why this matters for coaching

Real MTT theory gives you a graduated pressure curve. Harrington's M-zones tell you when your chip stack forces action. Bubble Factor tells you when pay-jump proximity warps every call. Published MTT literature predicts a smooth tightening through mid → late → bubble stages.

The model gives you none of that granularity. It returns one number for the entire pre-bubble window and a second, slightly tighter number for everything post-bubble. The gap between the two is 0.7pp at most.

A worked example — BTN at 20bb across stages

BTN opens 33.8% VPIP at 20bb for early, mid, and late — all identical. At bubble, ITM, and FT the number shifts to 33.5%. That 0.3pp shift is the model's entire ICM signal at this cell. Published MTT theory at a matched anchor (BB vs BTN at 30bb, bubble vs chip-EV) reports tightening of roughly 6 percentage points — the model disagrees on both magnitude and direction.

Based on general poker theory What this means in practice: The model knows something changes at the bubble. It tightens by a fraction of a percentage point. But it does not distinguish mid from late, or ITM from final table. If your student is asking "should I tighten at the final table vs ITM?" — the model has no answer. Use Harrington M-zones and a published bubble-factor chart for that question.

What we didn't test in Part 7

Binary-binning threshold not localized. The switch between Group A and Group B happens somewhere between 25% and 16% of the field remaining. A finer-grained stage sweep is the highest-priority follow-up query.
Bubble Factor sensitivity untested. We did not vary the alive-to-paid ratio to measure how the model responds to different Bubble Factor magnitudes.
Satellite bubble dynamics absent. No satellite-format fixtures were included in Round 1.
Pay-jump-aware EV computation not probed. The Malmuth-Harville pay-jump math is baked into the training reward, but we did not directly test whether the model responds to steep vs flat payout structures.
Deep-ICM scenarios (super-steep satellite, winner-take-all) not queried. Round 1 used standard payout presets.

3 practical takeaways for stage-aware play

Trust the model's pre-bubble vs post-bubble distinction. The direction is correct — the second group is tighter than the first on most cells. The magnitude is small, but it is real and consistent across positions and stack depths.
Do NOT interpolate finer stage granularity from the model. Early, mid, and late are identical. Bubble, ITM, and final table are identical. Any "difference" you see between mid and late is measurement noise at the sub-0.1pp level. For mid-vs-late or ITM-vs-FT guidance, use Harrington M-zones and a published bubble-factor chart.
The BB defense widening and cbet elevation are stage-insensitive within each group. If you applied the BB defense numbers from Part 5 or the flop cbet numbers from Part 6 at MTT-early, those same numbers hold through late-stage — and the bubble/ITM/FT numbers differ by less than 2 percentage points from the early-stage values.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

Binary stage binning is documented as a known model limitation. The stage-gradient scan tested 35 cells (7 stack depths × 5 positions) across 6 tournament stages. All 35 cells produced exactly 2 distinct VPIP groups. Zero cells showed 3 or more groups. Within each group, the maximum observed delta was 0.7pp. This is a model-level finding, verified on one checkpoint. The training architecture exposes only left_ratio (players remaining divided by total entries) as the stage signal — no explicit stage category, no blind-level tracker, no pay-jump proximity feature. The binary binning likely reflects either a training-coverage thin-spot in intermediate left_ratio values or an over-regularized response to the single feature. Enriching the training signal with alive-to-paid ratio, pay-jump proximity, or an estimated Bubble Factor would be the natural next step to produce the graduated ICM curve that theory predicts.
The rules-side explanation. The training environment simulates a single hand per episode. Tournament progression — blind escalation, table balancing, multi-hand momentum — is not part of the training loop. The stage signal is fully determined by left_ratio, which the trainer samples uniformly from 0.01 to 1.0. Without an explicit stage boundary or a richer feature set, the model has evidently learned a step function rather than a smooth gradient. Whether this is a genuine model insight (a sharp cliff in optimal-play VPIP at some left_ratio threshold) or a limitation of the training curriculum is unresolved. The Round 2 fine-grained left_ratio sweep would distinguish between the two.
Published MTT theory disagrees with the model on stage granularity. Several literature and solver claims are directly contradicted by the binary output. Published MTT theory predicts mild tightening of roughly 1–3pp from early to mid-tournament, and a further 3–5pp tightening late pre-bubble. External solver baselines report ICM becoming statistically significant at 37–50% of the field remaining. The model shows zero tightening within the pre-bubble group and places its switch between roughly 25% and 16% remaining — later than any published threshold. Additionally, a published BB-vs-BTN bubble anchor at 30bb predicts defense tightening by roughly 6pp; the model widens defense by roughly 17pp at the same cell. These discrepancies are not subtle — they are coaching-significant and should be flagged when citing model output for stage-dependent decisions.

Part 8

Actionables summary

Twenty-six coaching takeaways from the MTT research, organized by pillar. Some you can coach on today. Others are explicit "do NOT coach from this model" warnings — flagged inline.

Each actionable below is consolidated from the Actionables master list in the source research. Per-pillar evidence and confidence rationale lives in the corresponding per-pillar sections of mtt-theory.md.

A — Equity & Ranges

At 100bb MTT-early at UTG/MP/CO/BTN, your Cash opening range applies. Strong evidence; confirmed by model data and literature. (See Part 1.)
At 25bb/20bb BTN, consider a steal-widening of roughly +3.6 percentage points vs your Cash 25bb range. Treat as a soft adjustment, not a hard rule — low confidence from one data round alone. (See Part 1.)
Treat the SB +19.5pp deep-stack widening as model behavior, not prescription. The magnitude exceeds solver literature. Hold back the limp rate unless an external solver cross-check confirms it; use your preferred MTT SB chart as the prior. Model behavior, not solver-corroborated; low confidence. (See Part 2.)

B — Frequencies & Balance

BB defense is wider in MTT than Cash by +6.1pp to +16.9pp at every opener, raise size, stack, and stage we queried. Driven by calls up with 3-bets down. This contradicts the standard literature direction. Model behavior, not solver-corroborated; low confidence. (See Part 5.)
Trust the solver over the model for BB defense totals; use only the mix-shift direction (call more, 3-bet less) as a weak signal. If your Cash defense rate felt right, do not assume you are folding too much in MTT just because the model wants you to defend wider. Model behavior, not solver-corroborated; low confidence. (See Part 5.)
The BB defense widening is NOT ICM-driven (early ≡ bubble). Treat it as a structural wrapper effect that pre-dates bubble pressure. Moderate evidence. (See Part 5.)

C — Position & Information

The MTT SB plays a fundamentally different regime from Cash at deep stacks — roughly a 68% limp rate at 100bb. The hypothesis is that antes, small raises, and wide BB defense all favor completing over raising. Verify against a solver before coaching. Model behavior at this magnitude; low confidence. (See Part 2.)
Flop c-bet elevation tracks range advantage monotonically on 5 of 6 tested textures; inverts on 772p paired-low. Moderate evidence. (See Part 6.)

D — Sizing

SB 5–7bb is much wider all-in in MTT than Cash (+9.2 to +14.7 percentage points wider). Plausibly consistent with Nash push-fold at this depth; verify quantitatively against external solver anchors. Low confidence. (See Part 4.)
SB 10–15bb all-in frequency drops in MTT (-8.4 to -11.0 percentage points vs Cash) but VPIP stays near Cash. Interpret as "model prefers raise/call over jam at this depth," not "fold more." Low confidence. (See Part 4.)
Non-SB short-stack push-fold (UTG/MP/CO/BTN, 5–15bb) is Cash-like in MTT. The model does NOT tighten at short stacks for these positions the way ICM theory predicts. For ICM-aware non-SB push-fold, use external solver charts, not this model. Model behavior, not solver-corroborated; low confidence. (See Part 4.)
MTT average c-bet size is slightly larger than Cash on non-paired textures. This contradicts the literature prediction of smaller sizing under ICM. Low confidence. (See Part 6.)

E — Board Texture

On dry high-card boards (A72r, K84r, Q72tt, J83ft), c-bet +11.6pp to +29.0pp wider in MTT than Cash at 100bb MTT-early. Strong evidence. (See Part 6.)
On 987mn (mid-connected monotone), c-bet +26.3pp to +28.3pp wider in MTT. Moderate evidence. (See Part 6.)
On paired-low 772p, c-bet direction INVERTS — MTT c-bets -8.7pp to -10.7pp LESS than Cash. Do not generalize to paired-high boards without further data. Moderate evidence. (See Part 6.)
CO gets a stronger c-bet lift from the MTT wrapper than BTN on A/K/Q-high dry textures. Moderate evidence. (See Part 6.)

F — C-Betting / Multi-Street

Flop c-bet is stage-insensitive — once inside the MTT regime, c-bet stays elevated at every stage. All sampled early-to-bubble deltas are within ±-1.7pp. Moderate evidence. (See Part 6.)
All turn and river theories are UNTESTED in MTT Round 1. If you are coaching turn barrel, river value, donk, or delayed c-bet — do not use this model as a direct reference. Use Cash theory plus solver corroboration until further data resolves. Pending Round 2 verification. (See Part 6.)

G — Advanced

Tournament stage is binary in this model, not graduated. Early/mid/late are treated as one group; bubble/ITM/FT as another. Within-group delta is at most 0.7pp. Real MTT theory has more granularity; external solver benchmarks expect graduated ICM onset. Model behavior, not solver-corroborated; low confidence. (See Part 7.)
The stage switch happens somewhere between late (~25% of field remaining) and bubble (~16%). Round 1 did not localize the exact threshold. Pending Round 2 verification. Moderate evidence on direction. (See Part 7.)
Do not rely on Cash Bubble Factor prescriptions — the model does not tighten under Bubble Factor > 1. At bubble stage, BB still defends wider than Cash. For coaching Bubble Factor decisions, use an external solver, not this model. Model behavior, not solver-corroborated; low confidence. (See Part 7.)

H — 3-Bet Pots

Do not use this model for 3-bet pot MTT coaching. The entire 3-bet pot pillar is UNTESTED in Round 1. Pending Round 2 verification. (See Part 7.)

M — MTT-Native

PKO and flat-KO both tighten ranges on average in the model; flat-KO tightens MORE than PKO. The direction and magnitude ordering BOTH contradict textbook bounty theory. Do NOT cite the model as a PKO/flat-KO prescription — use external solver numbers. Model behavior, not solver-corroborated; low confidence. (See Part 3.)
BTN has the largest flat-KO tightening (-8.4 to -9.7 at 25/40/60bb) and SB the largest at 40bb (-12.5). If the flat-KO direction is an artifact, these position-specific magnitudes are also unreliable. Model behavior, not solver-corroborated; low confidence. (See Part 3.)
Mystery bounty behaves exactly like normal in the model at mid-stage (20 of 20 cells identical). Do NOT use this model for mystery bounty strategy at mid-stage. Post-ITM data shows differentiation. Model behavior; low confidence at mid-stage. (See Part 3.)
Covering-vs-covered asymmetry is the primary PKO driver and it is UNTESTED in Round 1. The symmetric-stack fixtures under-exercise the documented 32pp RFI spread. Pending Round 2 verification. Moderate evidence from literature. (See Part 3.)

Where the model is coachable — and where it is not

The safest coaching applications from this research are: deep-stack non-SB opening ranges at 100bb (Part 1), the direction of BB defense widening (Part 5 — call more, 3-bet less), and dry-board c-bet elevation (Part 6 — strong evidence on five of six tested textures). The model is NOT coachable for PKO/flat-KO strategy or mystery bounty mid-stage (Part 3), stage-granular ICM (Part 7), turn and river play (Pillar F), 3-bet pots (Pillar H), or shallow-stack non-SB push-fold under ICM (Part 4). A full scope-limits enumeration lives in Part 9.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

Source and scope of the 26-actionable list. The list is consolidated from mtt-theory.md §Actionables master list. Per-pillar actionables also appear inline at the end of each pillar section in mtt-theory.md. The Part 8 list here matches the master list; readers wanting per-pillar provenance evidence should consult the per-pillar sections directly. The actionable numbering (1–26) is the master-list ordering and is stable across revisions.
The model-only coaching warnings are the highest-priority items. Five actionables carry this flag: #3 (SB deep-stack magnitude), #21 (Bubble Factor non-response), #23 and #24 (PKO/flat-KO sign anomaly), and #25 (mystery ≡ normal at mid-stage). Actionable #5 partially qualifies. In each case, the model's behavior is internally confirmed but contradicts published solver outputs. The practical coaching recommendation is identical across all five: use externally published charts rather than the model's frequency. The mtt-theory.md §Actionables master list carries the underlying evidence tags for each.
The Round 2 batch queue resolves seven pending actionables. Actionables #18, #22, and #26 are explicitly pending; #20 references a Round 2 localization sweep; Pillar F (all eight theories) and Pillar H (all three theories) are blanket-untested. The highest-priority Round 2 batches per mtt-theory-foundation.md §Round 2 priorities are three P0 batches (covering-vs-covered asymmetry, stage-threshold sweep, ante-isolated BB defense) and four P1 batches (mystery post-ITM, SB limp decomposition, low-connected boards, and the first 3-bet pot data). Completing the P0 set alone would resolve or materially upgrade actionables #20, #26, and the magnitude question behind #4–#5.

Part 9

Scope limits & open questions

A model that tells you where it's confident is useful. A model that also tells you where it's silent is trustworthy. This part is the second kind.

This part enumerates what Round 1 did NOT cover. Numeric findings throughout the book are scoped per their per-table measurement labels (Parts 1–7).

Round 1 query scope

The research ran 62 theory entries across 23 Cash-transferred, 8 amplified, 15 untested, and 10 MTT-native classifications. The query plan covered preflop opens across all positions and stack depths, BB defense across stages and openers, flop c-bet on 6 board textures, and the stage gradient.

It did NOT cover turn or river play, 3-bet pots, multiway postflop, asymmetric-stack bounty fixtures, satellite formats, or 4-bet ranges.

What is untested (Round 1 scope choices)

These entries have no model data. They carry a specific Round 2 query that would promote them toward a verdict.

Untested entries by pillar after Round 1.

Pillar	Entries	What's missing
D — Sizing	D3	Turn polarity at shallow stacks — no turn barrel data in MTT
E — Board Texture	E6, E7, E8	Turn-card rank effects, low-board BB donk frequency, turn rank cross-board inversion
F — C-Betting / Multi-Street	F1–F8 (all 8)	Entire turn/river and multi-street pillar — Round 1 queried flop only
G — Advanced	G4e (sub-entry)	ICM engagement at 50–62% field — model's binary binning masks any early-ICM signal
H — 3-Bet Pots	H8, H9, H10 (all 3)	Entire 3-bet pot pillar — no 3-bet pot queries in MTT mode
M — MTT-Native	M4	Covering-vs-covered asymmetric stacks — all Round 1 queries used symmetric stacks

Source: mtt-theory-foundation.md v2.0.0 §Coverage map; mtt-baselines.md v1.0.0 §2 Verdict summary table

That is 15 entries with no model evidence. Every one of them has a specific Round 2 batch identified.

Round 2 queue

The next research cycle targets these batches in priority order.

P0 (highest priority — resolves the largest open questions):

Covering-vs-covered asymmetry — tests M4 with hero-covers vs hero-is-covered queries at BTN bubble. External solver baselines predict a range spread that dwarfs position and stage effects.
Stage-gradient fine scan — scans the stage signal in small steps to find where the binary step occurs. Resolves the binary-binning concern and four related entries (G4c, G4e, M6, M10).
Ante-isolated BB defense — tests BB defense at matched stacks with ante explicitly toggled. Anchors whether the widening pattern is ante-driven or a broader model behavior.

P1: post-ITM mystery bounty queries (resolves KI-12 scope); SB limp/raise decomposition at deep stacks; low-connected flop c-bet (canonical Cash board class missing); OOP 3-bet pot c-bet at shallow stacks.

P2–P4: cross-opener range composition in MTT; BB overcall connectivity in 3-way; BTN 3-bet pot c-bet bimodality; turn rank sweeps; river value-bet threshold at bubble; per-hand sizing and overbet frequency; overpair hierarchy on wet boards; bluff-availability sizing constraint; protection-bet frequency; mixed-strategy fractions.

Known issues

KI-12 — Mystery bounty collapses to normal. The model treats mystery bounty identically to normal bounty in every tested cell. The cause is a rules-level guard: when the fraction of the field remaining exceeds the in-the-money ratio, the training code downgrades mystery to normal. All Round 1 mystery queries triggered this guard. Until post-ITM mystery queries land (P1 batch), do not use this model for mystery-bounty strategy.

MC-6 — Binary stage binning. The model partitions tournament stages into exactly two groups — {early, mid, late} vs {bubble, ITM, final table} — with within-group variation no larger than 0.7pp. External solver baselines expect a graduated response across stages. This is the single largest gap between our model and the wider literature on stage-dependent ICM. Do not use the model for stage-granular ICM adjustments until the P0 fine scan resolves whether the step function is a training-coverage artifact or a genuine model limitation.

MC-5 — Deep-stack meta-signal. SB's preflop widening at deep stacks in MTT-early (+19.5pp at 100bb) is anomalously large. This is flagged as a candidate method caveat because the magnitude is not corroborated by external solver baselines at the same cell. The direction is partially supported, but the exact magnitude may reflect a training-distribution thin spot at SB-MTT-100bb.

MC-7 — BB defense widening magnitude. BB defends wider than Cash across every tested cell — the direction is solid (all 96 cells positive). The open question is magnitude: our data shows deltas of +6 to +17 percentage points, while an external ante-isolated benchmark predicts a much larger gap. The P0 ante-isolated batch will disambiguate.

Training-signal gaps

Three structural gaps in the training signal emerged from Round 1. These are not analytics bugs — they require coordination with the training team.

(1) PKO single-hand reward truncation. The training reward for PKO includes only the instant portion of a knockout bounty. The carry portion — the half that adds to the winner's own head bounty and compounds across future hands — is invisible to the single-hand training window. This explains the M2 sign anomaly where the model tightens under bounty modes instead of widening. Decision needed: retrain with carry-included reward, or document that the model's PKO behavior represents per-hand strategy only.

(2) Mystery bounty pre-ITM guard. The training code downgrades mystery bounty to normal before ITM. External solver baselines treat pre-mystery as strategically distinct. Decision needed: align training rules with the literature framing, or document the model's custom pre-mystery interpretation as intentional.

(3) Stage exposure. The only tournament-phase signal the trainer sees is left_ratio (fraction of field remaining). There is no explicit stage category, no pay-jump proximity signal, and no Bubble Factor estimate in the observation. The binary-binning finding (MC-6) is likely downstream of this sparse feature set. Decision needed: enrich stage features for the next training cycle.

Three takeaways for coaches working with this model

Trust the model on preflop opens (Part 1), flop c-bet (Part 5), and BB defense (Part 6). These are within Round 1 scope, tested across positions, stages, and stack depths, and confirmed on the overwhelming majority of cells.
Do not use the model for turn/river (Pillar F), 3-bet pots (Pillar H), shallow-stack sizing beyond flop (D3), mystery bounty (KI-12), or stage-granular ICM (MC-6). These are either untested or flagged.
Check Round 2 release notes before re-coaching against the model. The P0 batches (covering asymmetry, stage fine scan, ante-isolated defense) will materially update Parts 3, 6, and 7. When those land, the affected takeaways in this book will be revised.

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

The 15 UNTESTED count is a scope choice, not a measurement failure. All 15 entries (D3; E6, E7, E8; F1–F8; H8, H9, H10) were omitted from Round 1 because the query infrastructure focused on preflop and flop-level behavior first. Round 2 P0/P1 batches address the highest-priority entries among these: F1 and F5 (multi-street), H8 (3-bet pots), E2 and E3 (board-texture panel expansion), and E6 (turn rank sweep).
Known-issue taxonomy. The KI/MC distinction follows the shared issues convention. KI = Known Issue — a specific reproducible measurement gap (KI-12: mystery ≡ normal in 20/20 cells). MC = Method Caveat — a broader methodological concern that cannot be resolved by a single query (MC-6: binary stage binning across 35/35 cells; MC-5: SB deep-stack anomaly; MC-7: BB defense magnitude question). KI entries have a single resolution query; MC entries require either a multi-batch investigation or a training-team decision.
Training-signal gaps require cross-team coordination. Each of the three gaps has a "decision needed" framing because the analytics team cannot fix them unilaterally. The PKO carry-truncation gap is the highest-leverage of the three — it directly explains the M2 sign anomaly documented in Part 3. If the training team adds carry-included PKO reward, the M2 and M4 findings would likely reverse direction and align with external solver baselines.

Part 10

Verdicts against MTT-LIT and MTT-SOL

If you know the canonical MTT literature — Harrington's M-zones, the ICM bubble-factor math, the PKO covering-stack widening — you need to know where the model agrees, where it gets the direction right but the magnitude wrong, and where it flatly contradicts the published theory. This part is that map.

Measurement conditions: Round 1 dispositions only (497 MTT + 417 Cash queries, single checkpoint). Round 2 dispositions will update on the next book version.

MTT-LIT cluster-level verdicts

Round 1 cluster-level verdicts across 60 literature claims. Round 1 dispositions · 497 MTT + 417 Cash queries · single checkpoint · classifications: CONFIRMED / PARTIAL / DISCREPANT / UNTESTED

Cluster	Topic	Disposition	Notes
1 — Push-fold & Nash (MTT-LIT-1..8)	Short-stack shove thresholds, Harrington Red Zone, M-ratio zones	PARTIAL	UTG/MP/CO/BTN 5–15bb all-in aligns within a couple of points. SB 5–7bb is wider than Cash/Nash; SB 10–15bb is tighter. Definitional entries (MTT-LIT-1, 7, 8) confirmed by construction.
2 — ICM pressure & bubble (MTT-LIT-9..18)	Bubble Factor, graduated ICM tightening, ERP	DISCREPANT	Binary stage binning: no graduated tightening between pre-bubble stages. Continuous Bubble Factor claims fail. Binary pre-bubble vs post-bubble shift confirmed at small magnitude.
3 — Bounty: PKO (MTT-LIT-19..26)	PKO widens ranges, covering-stack advantage	PARTIAL	Direction inverted: PKO tightens vs normal in our data. Magnitude smaller than published solver benchmarks. PKO-increasing-rate fields populated but not strongly differentiated from flat-KO.
4 — Bounty: Mystery (MTT-LIT-27..30)	Mystery bounty distinct from flat-KO	DISCREPANT	Mystery indistinguishable from normal in 20/20 mid-stage cells. Every mystery-specific claim discrepant.
5 — Bounty: Flat KO (MTT-LIT-31..33)	Flat-KO widens less than PKO	DISCREPANT	Model shows flat-KO tightens MORE than PKO — opposite of published ordering.
6 — Stage adjustments (MTT-LIT-34..42)	Early ≈ Cash, graduated mid/late/bubble tightening	DISCREPANT	Same as cluster 2. Early deep-stack non-SB confirmed; graduated mid → late → bubble predictions all fail under binary binning.
7 — Chip utility / risk-of-ruin (MTT-LIT-43..47)	Concave chip utility, FGS, Malmuth theorem	UNTESTED	Not testable from single-hand preflop/cbet queries. Definitional entries accepted by construction.
8 — NLHE transfers vs breaks (MTT-LIT-48..60)	Cash-like early, wider BB, smaller postflop sizing	PARTIAL	Cash-like non-SB at 100bb confirmed. BB defense wider confirmed (direction). Postflop sizing claim discrepant — model uses same or slightly larger sizing, not smaller. SB widening magnitude is model-only.

Source: mtt-literature-extracts.md §Round 1 dispositions

Summary

Roughly 8 of 60 literature claims are confirmed, around 30 are partial, about 18 are discrepant, and 4 remain untested. The model agrees cleanly with the deep-stack non-SB cash-like prediction (MTT-LIT-34, 51) and the directional drop in 3-bet bluffing under ICM (MTT-LIT-54). It disagrees with most postflop ICM predictions (MTT-LIT-52, 56, 58) and the entire mystery / flat-KO ordering structure (MTT-LIT-27..33).

MTT-SOL section-level verdicts

Round 1 section-level verdicts across 45 external solver anchors. Round 1 dispositions · solver anchors · single checkpoint · classifications: CONFIRMED / PARTIAL / DISCREPANT / UNTESTED

Solver anchor	Topic	Disposition	Notes
MTT-SOL-1, 2, 4, 5	Nash push-fold solver 10/15bb BTN/UTG	PARTIAL	Direction aligns; magnitudes lighter by a few points at short positions.
MTT-SOL-3, 6	Nash push-fold solver 10/15bb SB	PARTIAL	SB direction correct (wide); model undershoots published SB shove rate.
MTT-SOL-7	ICM-aware solver 12bb shove ranges, 9-handed	PARTIAL	Model under-shoves at 12bb MP relative to published ranges.
MTT-SOL-8, 9, 10	ICM-aware solver 10/13bb ChipEV vs ICM	UNTESTED	Round 1 did not isolate ChipEV vs ICM split at matched conditions.
MTT-SOL-13	UTG RFI by stack depth (non-monotonic shape)	CONFIRMED	Best model-solver match. Non-monotonic UTG shape tracks published curve.
MTT-SOL-19	BB defense vs BTN 30bb bubble vs ChipEV	DISCREPANT	Published anchor predicts fold rises; model widens BB defense even on bubble.
MTT-SOL-26, 27	ICM significance threshold (50–37.5% field)	DISCREPANT	Model's binary switch falls between roughly 25% and 15% alive — much later than published 50–37.5% threshold.
MTT-SOL-30, 31, 32, 33	PKO preflop adjustments	DISCREPANT	Published PKO anchors predict wider; model tightens. Sign error carries from MTT-LIT cluster 3.
MTT-SOL-36, 37	Mystery bounty c-bet and defense shifts	DISCREPANT	Mystery indistinguishable from normal in model output.

Source: mtt-solver-index.md §Sections 1–9

Summary

The largest solver confirmation is MTT-SOL-13 (UTG RFI shape by depth) — the model's non-monotonic VPIP curve matches the published curve closely. SB short-stack direction (MTT-SOL-3, 6) and the 3-bet tightening direction (MTT-SOL-23) are partial matches. Nine solver anchors are discrepant, concentrated in PKO adjustments (MTT-SOL-30..33), mystery bounty (MTT-SOL-36, 37), and the ICM-onset timing (MTT-SOL-26, 27).

Tension 1 — Flat-KO tightens more than PKO (sign error)

Published theory (MTT-LIT-21, 32) predicts that both PKO and flat-KO widen VPIP vs no-bounty normal, with PKO widening more because progressive bounties compound over the tournament lifetime. The model inverts both the direction and the ordering. At BTN across four stack depths, PKO VPIP is -5.0pp to -1.8pp lower than normal; flat-KO is -9.7pp to -8.4pp lower. Both sign and ranking contradict the literature. The most plausible cause: the training rules encode only the instant portion of the PKO reward, not the lifetime carry. Without that carry, the model sees no compounding EV incentive for loose calling under PKO.

Tension 2 — Mystery equals normal (model failure)

Twenty out of 20 mid-stage cells are byte-identical between mystery bounty and normal. The training rules contain a pre-ITM guard that substitutes mystery → normal when the tournament is before the money. Since Round 1 mid-stage queries all triggered this guard, the model never saw a live mystery parameter. MTT-LIT-27 through 30 treat mystery as a distinct format with envelope EV — the rules implementation diverges from that framing. The model IS reading the mystery parameter post-ITM (36/40 post-ITM cells differ), but the pre-ITM gap is a coaching-relevant limitation: do not use the model for mystery bounty strategy in the pre-mystery phase.

Tension 3 — Binary stage training failure

MTT-LIT-35 predicts mid-tournament tightening, MTT-LIT-36 predicts late pre-bubble tightening, and MTT-SOL-26 locates the ICM significance threshold at 50–37.5% of the field remaining. The model collapses all six canonical stages into exactly two VPIP groups — {early, mid, late} vs {bubble, itm, ft} — with a within-group max delta of 0.7pp. The training rules expose only a continuous ratio to the model, not an explicit stage label. The binary cut is a known model limitation. Do not interpolate graduated stage transitions from this model — use external ICM solvers for that.

What we did not verdict

The untested MTT-LIT entries (MTT-LIT-12 magnitude, 13–17, 19–25 covering/PKO asymmetry, 37–40, 57, 60) and the untested MTT-SOL entries (MTT-SOL-8..12, 14–18, 20–22, 24, 25, 28, 29, 34, 35, 38–44) cannot be verdicted because Round 1 did not exercise the relevant cells. Round 2 priority batches (Part 9) address several of these — particularly the PKO covering-asymmetry queries and the fine-grained stage sweep.

2 takeaways for coaches reading published MTT theory alongside this book

Trust the published theory over the model on PKO direction, mystery bounty strategy, graduated stage ICM, and BB defense magnitude. Use external solver anchors for those spots.
Trust the model on cash-like deep-stack non-SB ranges (Part 1), the direction of wider BB defense in MTT (Part 5), and the direction of dry-board c-bet elevation (Part 6).

Research notes

Details for readers interested in the methodology behind the findings above. Skip this section if you just want the practical takeaways.

Cluster-level vs per-ID dispositions. The verdicts in the MTT-LIT table above are at the cluster level — one disposition per topical group spanning multiple literature IDs. Per-ID dispositions (each of MTT-LIT-1 through 60) live in the literature-extracts file. The cluster-level rollup uses the most severe per-ID disposition within the cluster (one DISCREPANT ID makes the cluster DISCREPANT). Similarly, the MTT-SOL table is at the section level; per-anchor dispositions live in the solver-index file.
Disposition definitions. CONFIRMED means direction and magnitude both fall inside the published envelope. PARTIAL means the direction matches but the magnitude does not, or some cells confirm while others remain untested. DISCREPANT means the model and the published reference disagree on direction or magnitude. UNTESTED means Round 1 did not exercise the relevant cells at all. These are not confidence grades — a DISCREPANT verdict may reflect a model limitation, a literature framing error, or both.
Single-checkpoint caveat. All dispositions are based on Round 1 data (497 MTT + 417 Cash queries) against a single model checkpoint. Round 2 will add a second-checkpoint cross-check and the priority batches that unblock the untested clusters. Verdicts may change — particularly for the PKO covering-asymmetry claims (MTT-LIT-21..23, currently untested, expected to gain a partial or discrepant status) and the graduated-stage claims (currently discrepant, may shift to partial if richer stage features are added to training).