Scoring Rules
Brier Score
The Brier score is a commonly-used scoring rule (sometimes also called "quadratic scoring") that compares a set of predictions to actual outcomes. For a single forecast of probability , it is computed as , where if the event occurred, and if not. If you forecast 100% and an event occurred, your Brier score would be . If you forecast 100% and it did not occur, your score would be . If you were to guess 50% for every binary question, your mean Brier score would be .
Log Score
The log score is another common scoring rule. Outside tournaments, Metaculus uses a variant of the log score. For a single binary forecast of probability , the log score is computed as if the event occurred, and if not. The scaling is chosen so that higher scores are better, and a maximally-uncertain prediction () gives . For continuous questions, the score is computed as , where p is the value of the predicted probability density at the resolved value (as can be read off from the plot on the question).
A variant of the log score is used to calculate Metaculus points and tournament scores.