Backtest performance

Honest chronological holdout: each match is predicted using only ratings from earlier matches. Hyperparameters are grid-searched on the same post-cutoff reporting set for this first build, so treat these as baseline diagnostics, not a locked contest-grade out-of-sample score.

Out-of-sample Brier
0.2121
Accuracy
65.4%
Coin-flip baseline Brier
0.2475
Matches scored
1676

Data and tuned parameters

Source
Cricsheet normalized completed men's matches; chronological Elo holdout from 2024-01-01 onward
Holdout
chronological out-of-sample from 2024-01-01
K
40
Home Elo
0
Temperature
0.9
Cross-format carryover
0.08

Metrics

FormatMatchesLog lossBrierAccuracy
TEST890.58780.203265.2%
ODI2220.70030.248055.9%
T2013650.60400.206866.9%
ALL16760.61590.212165.4%