Elo rating, named after the Hungarian-American physicist Arpad Elo, is a method for calculating the relative skill levels of players in zero-sum games such as chess and competitive video games. At its core, the system is designed to be self-correcting; a player's rating should reflect their statistical likelihood of winning against opponents of varying strengths. The fundamental principle is that victory against a highly-rated opponent yields more points than a win against a lower-rated player, while a loss to a weaker opponent incurs a heavier penalty. This dynamic ensures that the rating number is not just a static score, but a living metric of performance.
Understanding the Core Mechanics
The calculation relies on a few key variables: your current rating, your opponent's rating, and the outcome of the match. The system assumes that a player's performance follows a normal distribution curve, meaning most results cluster around the average with fewer extreme outcomes. Because of this, the algorithm uses the difference between two ratings to predict an expected score. This expected score is a decimal representing the probability of winning; for example, an expected score of 0.75 means the player has a 75% chance of winning based purely on the rating gap.
The Expected Score Formula
Before the actual result is applied, you must calculate the expected score (E). This is done using a logistic curve formula that translates the rating difference into a probability. The standard divisor used is 400, which means a player who is 400 points higher is expected to win approximately 91% of the time. The mathematical relationship ensures that the gap required to predict a 50% win rate is zero, and the curve steeply approaches 100% or 0% as the difference increases. This mathematical rigor prevents volatile swings and creates a stable ranking environment.
Applying the K-Factor
Once the expected score is determined, the actual result (win, loss, or draw) is compared against it to calculate the rating adjustment. This adjustment is moderated by the K-factor, which acts as a multiplier for how much a single game can change your rating. A high K-factor makes the system volatile, allowing rapid movement for new players or those on a hot streak, while a low K-factor prioritizes stability for established veterans. Choosing the right K-factor is crucial; many organizations use a baseline of 20 or 30, but this value often decreases as a player reaches a high rating ceiling to prevent inflation.
The Calculation Step-by-Step
To determine the final rating change, you multiply the K-factor by the difference between the actual score (S) and the expected score (E). The actual score is straightforward: 1 point for a win, 0.5 for a draw, and 0 for a loss. If a player with a rating of 1500 expects to win against a 1400 opponent (E = 0.64) but loses (S = 0), the calculation would be 30 * (0 - 0.64), resulting in a loss of approximately 19 points. Conversely, pulling off an upset victory would yield a significant gain, rewarding the risk taken against the odds.
Practical Applications and Variants
While the original Elo system was designed for two-player games, modern implementations have adapted it for team-based scenarios and multiplayer free-for-alls. One common approach is to treat team ratings as the average of the individual players, calculating the team's expected score based on the aggregated value. In games with more than two participants, the points exchanged are often scaled down to ensure that the total sum of rating points in the system remains constant, preventing the overall pool from growing indefinitely.