NFL 2026 · Research

Backtests & the honest record

Everything we tested on real data — including the ideas we killed. The rule: prove it or don't trust it.
Bottom line: you can't beat the betting market on winners, spreads, or totals — we proved it on 1,359 games. But DFS construction (stacking + leverage) is provably +EV, and it's biggest in soft home leagues: ~22% win rate → +160% ROI. So: don't bet sides; build smart DFS lineups.

1 · DFS construction edge proven

Real DraftKings data (2014-2018), 85 full slates, identical projections — only the build changes. Graded on actual points vs a simulated field. dfs_contest_backtest.py
ConstructionCash%GPP top-20%GPP win (top-1%)Soft 12-man WTAWTA ROI
NAIVE — max projection, no stack54.1%31.8%0.0%22.5%+170%
STACK — QB + 2 + bring-back51.8%27.1%1.2%21.3%+155%
LEVERAGE — stack + off-chalk51.8%25.9%3.5%21.8%+161%
Cash = beat the double-up line. Top-1% ≈ a GPP win. Soft 12-man WTA = your home league (must finish 1st of 12; random baseline 8.3%). Read it: cash → don't stack (naive best); big GPP → leverage (top-1% 3.5% vs naive 0.0%); your home league → ~22% win, +160% ROI.

2 · Markets are efficient on sides real data

Continuous cross-season walk-forward, warm-up 2005-2020, evaluated 2021-2025 (1,359 games). backtest_walkforward.py
5-yr winners (SU)
63.4%
market favorite 66.4%
Against the spread
49.4%
below 52.4% break-even
Over/Under
50.4%
coin-flip
Cold-start tax (Wk 1-4)
+8.4 pts
warm 58.1% vs cold 49.7%
Per season (warm model vs market):
SeasonGamesOur SUMarket SUATSO/UScore MAECold SU
202127262.9%62.1%47.4%48.0%8.0358.5%
202227164.2%65.7%49.8%45.9%7.1760.9%
202327260.3%68.0%50.4%54.8%7.6656.2%
202427265.8%71.3%51.5%50.2%7.4662.1%
202527263.6%65.1%48.0%53.3%7.4358.1%

3 · 2025 re-run, warm model (no cold start) real data

Model trained 2005-2024, walked 2025 week by week predicting cold. replay_2025_warm.py
Regular-season ML (winner)
173-99
63.6%
Regular-season O/U
145-127
53.3% (noise — 5yr is 50%)
Weeks 1-4 (warm)
68.8%
vs 51.6% cold-start
Week by week:
WeekML (SU)O/UMarket SUCum ML%Cum O/U%
W111/167/1613/1668.8%43.8%
W212/168/1611/1671.9%46.9%
W313/1611/1612/1675.0%54.2%
W48/169/1610/1668.8%54.7%
W54/149/145/1461.5%56.4%
W69/158/1511/1561.3%55.9%
W710/1510/1512/1562.0%57.4%
W810/139/1311/1363.6%58.7%
W98/146/148/1463.0%57.0%
W109/146/148/1463.1%55.7%
W119/158/1511/1562.8%55.5%
W1213/148/1410/1465.2%55.6%
W1310/164/169/1664.9%53.1%
W148/1410/147/1464.4%54.3%
W1511/168/1610/1664.7%54.0%
W169/167/1610/1664.2%53.3%
W178/166/169/1663.3%52.3%
W1811/1611/1610/1663.6%53.3%
Wild Card4/65/63/663.7%54.0%
Divisional4/43/44/464.2%54.3%
Conf Champ2/21/22/264.4%54.2%
Super Bowl1/11/11/164.6%54.4%

Ideas we killed (so we don't chase them)

Same rigor, honest results — all refuted on real data:
TheoryVerdict
Beat the spread / totals with a model❌ 49% ATS / 50% O/U over 1,359 games — market is sharp
"Primetime favorites don't cover"❌ dogs 51.7% (n=261), below break-even — noise, not rigged
Chinese zodiac (Pig/Snake, Rat fade)❌ age confound — zero independent signal (flavor only)
Out-project the field on player points❌ no model beats the season-average baseline (MAE ~4.7)
All backtests on real nflverse + DraftKings data. Deterministic & re-runnable. Generated 2026-06-20.
The point of this page: keep us honest. Bet the edge we proved, not the ones we wish we had.