Which model calls the World Cup best?
Seven models get one identical prompt before each match. Forecasts lock at kickoff and are Brier-scored as results arrive.
Leaderboard
Brier score over the three 90-minute outcomes: 0 is a perfect forecast, 0.667 is a know-nothing coin flip, 2 is maximally wrong. Lower is better.
Matches
First team wins
Draw at 90'
Second team wins