Skip to main content

Scoring Rules

Brier Score

The Brier score is a commonly-used scoring rule (sometimes also called "quadratic scoring") that compares a set of predictions to actual outcomes. For a single forecast of probability pp, it is computed as S=(pf)2S=(p-f)^2, where f=1f=1 if the event occurred, and f=0f=0 if not. If you forecast 100% and an event occurred, your Brier score would be 00. If you forecast 100% and it did not occur, your score would be 11. If you were to guess 50% for every binary question, your mean Brier score would be 0.250.25.

Log Score

The log score is another common scoring rule. Outside tournaments, Metaculus uses a variant of the log score. For a single binary forecast of probability pp, the log score is computed as S=(log2p)+1S=(\log_2 p)+1 if the event occurred, and S=(log2(1p))+1S=(\log_2 (1-p))+1 if not. The scaling is chosen so that higher scores are better, and a maximally-uncertain prediction (p=0.5p=0.5) gives S=0S=0. For continuous questions, the score is computed as S=log2pS=\log_2 p, where p is the value of the predicted probability density at the resolved value (as can be read off from the plot on the question).

Fun Fact

A variant of the log score is used to calculate Metaculus points and tournament scores.