Chess Win Probability Calculator
Calculate chess win, draw, and loss probabilities from Elo rating difference.
Enter both ratings to see expected outcomes based on the Elo system.
Where Elo ratings come from
Arpad Elo, a Hungarian-American physics professor, invented the rating system in the 1950s to replace the older (and statistically unsound) Harkness system used by the US Chess Federation. The Elo system was adopted by USCF in 1960 and FIDE (international chess body) in 1970. It’s now used in chess, but also Scrabble, Go, esports, online dating apps, and even Facebook’s old “Hot or Not” rankings.
The core idea: each player has a rating that represents their playing strength. When two players meet, the difference in their ratings predicts the expected outcome.
The expected score formula
Expected Score (for the higher-rated player) = 1 ÷ (1 + 10^((opponent − you) ÷ 400))
The “400” is calibrated so that a 200-point gap produces a roughly 76%/24% expected split (treating draws as half-wins for each side).
Expected score ranges from 0 to 1:
- 1.0 = certain win
- 0.5 = certain draw
- 0.0 = certain loss
- 0.76 = expected to win 3 of every 4 games on average
Rating differences and expected outcomes
| Rating diff | Expected score (higher player) | Approximate W/D/L |
|---|---|---|
| 0 | 0.50 | 35% W / 30% D / 35% L |
| 50 | 0.57 | 42% / 30% / 28% |
| 100 | 0.64 | 49% / 30% / 21% |
| 150 | 0.70 | 56% / 28% / 16% |
| 200 | 0.76 | 64% / 24% / 12% |
| 300 | 0.85 | 76% / 18% / 6% |
| 400 | 0.91 | 86% / 11% / 3% |
| 500 | 0.95 | 92% / 7% / 1% |
| 800 | 0.99 | 99% / 1% / 0% |
Note draws shrink dramatically at high rating gaps — when one player is much stronger, decisive games dominate. At equal strength, draws are very common, especially in classical time controls.
Rating ranges in chess
| Rating | Skill level |
|---|---|
| < 600 | Beginner (just learned rules) |
| 600-1000 | Casual / hobby player |
| 1000-1400 | Improving club player |
| 1400-1600 | Strong club player |
| 1600-1800 | Tournament regular |
| 1800-2000 | Class A / Candidate Master (USCF) |
| 2000-2200 | Expert (USCF) |
| 2200-2300 | National Master (NM) |
| 2300-2400 | FIDE Master (FM) |
| 2400-2500 | International Master (IM) |
| 2500+ | Grandmaster (GM) |
| 2700+ | Super-GM (about 50 players in the world) |
| 2800+ | World Championship contenders |
| 2882 | Magnus Carlsen’s peak (highest in history) |
The world average rating is around 1200-1400 for active tournament players. Online platforms (Chess.com, Lichess) use different scales — typically 100-200 points higher than the equivalent USCF/FIDE rating because of the player pool composition.
Rating change formula
When you play a rated game, your rating changes based on actual vs expected outcome:
New rating = Old rating + K × (Actual − Expected)
Where K is the rating change factor:
- K = 40 for new players (under 30 games)
- K = 20 for established players under 2400
- K = 10 for players 2400+
- K = 10 for FIDE titled players
Example: 1600-rated player beats 1800-rated opponent
- Expected score: 0.24
- Actual: 1.0
- Change: K × (1.0 − 0.24) = 20 × 0.76 = +15.2 rating points
- Opponent loses the same amount
Drawing the 1800 gives the 1600 player a 0.50 actual − 0.24 expected = +0.26 × 20 = +5.2 points.
Conversely, the 1800 expecting to win and only drawing loses 5.2 points.
The Glicko system — the modern alternative
Mark Glickman developed Glicko (now Glicko-2) as an improvement on Elo:
- Tracks rating uncertainty (RD — rating deviation) separately
- Players with few recent games have high uncertainty; their rating moves more
- Active players have low uncertainty; their rating moves less per game
- More accurate during rapid improvement periods
Chess.com, Lichess, and most modern online chess use Glicko-2. FIDE still uses traditional Elo with K-factor adjustments. The two systems converge for established players but Glicko handles new and returning players better.
FIDE vs USCF vs online ratings
The biggest source of confusion in chess ratings: each platform has its own scale.
| Platform | Approximate calibration |
|---|---|
| FIDE rating | International standard; tournament play |
| USCF rating (US) | ~100 points higher than FIDE for same player |
| Chess.com rapid | ~150-250 points higher than USCF |
| Chess.com blitz | ~150-250 points higher than USCF |
| Chess.com bullet | similar to blitz |
| Lichess classical | ~50-100 points higher than USCF |
| Lichess rapid | ~100-200 points higher than USCF |
| Lichess blitz | ~100-200 points higher |
| Lichess bullet | ~100-150 points higher |
So a USCF 1500 player would be roughly Chess.com 1700 rapid, Lichess 1600 classical. The differences come from:
- Player pool composition
- Time control effects (faster = more error = higher variance)
- Initial ratings (Lichess starts everyone at 1500; Chess.com starts at 1200; USCF tournaments seed based on initial tournaments)
Time control affects rating
Strong players are not equally strong at all time controls:
| Format | Time per side | Effect on skill |
|---|---|---|
| Classical | 60-180 min | Pure positional understanding |
| Rapid | 10-25 min | Mixed strategy + calculation |
| Blitz | 3-5 min | Tactics-dominant; opening prep critical |
| Bullet | 1-2 min | Move speed + pattern recognition |
| Hyperbullet | 30 sec to 1 min | Almost pure speed |
A player rated 2000 classical might be 1800 rapid, 1700 blitz, 1600 bullet. Or they might be the reverse — some players excel at fast play, others at slow.
The Magnus Carlsen factor
Carlsen’s peak FIDE rating of 2882 (2014) is the highest ever recorded. To put this in context:
- Carlsen’s expected score against a 2800 player: 0.59 (small advantage)
- vs 2700 (most super-GMs): 0.71
- vs 2500 (typical GM): 0.86
- vs 1500 (club player): 0.997
The Elo system breaks down at the top because there aren’t enough super-strong opponents to fully calibrate the highest ratings. But the rough predictions still hold.
Rating inflation/deflation debate
Chess fans regularly debate whether modern ratings are inflated compared to past eras. The arguments:
Pro-inflation: Modern computing makes openings sharper; players use engine prep; ratings of 2700+ are more common than in the 1990s.
Anti-inflation: Modern players have better training tools; skill has genuinely improved. Comparing Kasparov (1990s peak 2851) to Carlsen (peak 2882) shows real improvement.
Most statisticians conclude there’s no significant inflation — the system is mathematically self-correcting. Strong players replace older strong players over time.
Performance rating
A different concept from regular rating: performance rating is the rating a player would need to achieve their tournament result.
Performance = Average opponent rating + 800 × (score / games − 0.5)
If you score 4/5 against an average opponent rated 1800: performance = 1800 + 800 × (0.8 − 0.5) = 1800 + 240 = 2040.
A 200-point performance over your rating in a serious tournament is a strong result. Tournament norms (IM, GM) require multiple tournaments with specific performance levels.
Bottom line
Elo’s formula: expected score = 1 ÷ (1 + 10^(diff/400)). 100-point gap predicts ~64% to higher-rated player. Different chess platforms calibrate ratings differently — Chess.com online ratings are typically 100-250 points higher than equivalent FIDE/USCF tournament ratings. Time control matters: blitz strength differs from classical. Rating change uses K-factor times (actual − expected). Modern systems like Glicko-2 improve on raw Elo by tracking rating uncertainty.