Improving Pythagoras

Research Ratings Home About Us
 

Home

Up
Site Map
Search

Pythagorean Wins

One of Bill James's contributions to sabermetrics is Pythagrean Expectation, which attempts to relate a team's runs scored (RS) and runs allowed (RA) to its wins (W) and losses (L).  James conjectured

Though it is widely accepted, this formulation has two short-comings:

  1. Estimated wins and losses for all teams based on this equation are seldom equal.  Clearly, since every game has a winner and a loser, total wins must equal total losses, league-wide.  So-called Pythagorean standings can show, paradoxically, that the entire league is better (or worse) than itself.
  2. Using only the ratio of runs scored to runs allowed is problematic.  Consider two situations where two teams (A and B) play two games against each other:
    1. Team A scores 2 runs, and Team B scores 1.
    2. Team A scores 20 runs, and Team B scores 10.

    Since ties are prohibited, case (a.) can only occur if A wins a game by a 2-0 final and loses the other game 0-1.  Team A's record must be 1-1.  If we assume runs are equally likely to be scored in either game, Team A wins at least one game but will win both games about 95% of the time, resulting in 1.95 expected wins. The Pythagorean expectation predicts A to win 1.6 games in both cases.


Conditional Inference

Both problems of Pythagorean Expectation can be solved by considering wins and losses conditional on run totals.  While the mathematics can be difficult for real data sets, the premise is straightforward.  We'll illustrate the idea using a round-robin tournament between three teams (A, B and C) with the following results:

    A    3            A    4            B    3
    B    2            C    1            C    2

which can be represented in a table.

  A B C Runs Scored
A (2-0) -- 3 4 7
B (1-1) 2 -- 3 5
C (0-2) 1 2 -- 3
Runs Allowed 3 5 7  

One way to predict win totals based on runs is to consider the permutation distribution1 of cell entries based on holding row and column sums constant.  For this example, the only other possible tables with the same run totals are:

  A B C Runs Scored
A (1-1) -- 2 5 7
B (1-1) 3 -- 2 5
C (1-1) 0 3 -- 3
Runs Allowed 3 5 7  

 

  A B C Runs Scored
A (2-0) -- 4 3 7
B (1-1) 1 -- 4 5
C (0-2) 2 1 -- 3
Runs Allowed 3 5 7  

 

  A B C Runs Scored
A (1-1) -- 5 2 7
B (1-1) 0 -- 5 5
C (1-1) 3 0 -- 3
Runs Allowed 3 5 7  

However, if we assume that runs are equally likely to be scored in any game, some of these tables are more likely than others, meaning win-loss records based on them should be weighted accordingly.  For this example, the four run tables should occur with probabilities2  0.581, 0.116, 0.291 and 0.012, respectively.

This suggests that conditional on run totals, Team A should win 1.872 games3 and lose 0.128  Similarly, the expected win-loss records of B and C are 1-1 and 0.128-1.872, respectively.  Note that unlike the Pythagorean Expectation, these expected records will always satisfy the property that total wins and total losses each equal total games played.


1. A permutation distribution is the basis for so-called "exact tests" which require enumeration of all possible data sets given certain constraints (in this case runs scored and allowed by each team).

2. Table probabilities follow a non-central hypergeometric distribution, and in general can not be calculated in closed form.  However, Markov Chain Monte Carlo (MCMC) techniques are available for simulating tables with the correct frequencies.

3. The expectation is calculated by taking a weighted average of wins (which are directly computed from the table) with weights corresponding to the probability of the particular arrangement of runs.  Here 1.872 = (0.581)(2) + (0.116)(1) + (0.291)(2) + (0.012)(1).


Advantages of the Conditional Approach

In addition to preserving league-wide wins and losses, our conditional procedure provides a complete distribution of wins and losses for each team.  Conversely, the Pythagorean Expectation is merely a single number.  The advantage of having the complete joint distribution of wins is that it allows us to compute many potentially interesting quantities:

  1. Win Percentile.  Given these run totals, how often would a particular team win at least as many games as it did?
  2. Playoff/Division Races.  Given these run totals, how often could a particular team expect to win its division or win the wild-card?

The potential research questions are limitless.


Results

We applied the SportsQuant conditional win procedure to seven years (1998-2004) of Major League Baseball data.  The following table gives win totals with conditional expectations in parentheses.

  1998 1999 2000 2001 2002 2003 2004
Angels 85
(81.7)
70
(66.5)
82
(80.4)
75
(75.8)
99
(106.7)
77
(79.8)
92
(94.0)
Astros 102
(112.6)
97
(99.7)
72
(80.3)
93
(90.6)
84
(87.9)
87
(97.3)
92
(94.3)
Athletics 74
(73.7)
87
(86.8)
91
(96.4)
102
(110.4)
103
(99.5)
96
(97.2)
91
(87.6)
Blue Jays 89
(87.7)
84
(83.7)
83
(75.5)
80
(82.8)
78
(79.1)
86
(89.0)
67
(67.5)
Braves 106
(112.4)
103
(103.6)
95
(93.3)
88
(92.7)
101
(100.2)
101
(101.1)
96
(98.4)
Brewers 74
(67.7)
74
(71.9)
74
(70.7)
68
(72.6)
56
(55.9)
68
(61.2)
67
(64.2)
Cardinals 84
(84.9)
75
(76.8)
95
(95.2)
93
(97.7)
97
(99.0)
85
(90.6)
105
(105.6)
Cubs 90
(86.2)
67
(60.1)
65
(64.0)
88
(90.8)
67
(74.0)
89
(87.1)
89
(97.0)
Devil Rays 63
(63.6)
69
(64.1)
69
(66.9)
62
(54.6)
55
(51.2)
63
(63.9)
70
(64.5)
Diamondbacks 65
(62.1)
100
(109.2)
85
(85.9)
92
(99.1)
98
(99.2)
84
(85.1)
51
(45.6)
Dodgers 83
(79.7)
77
(81.6)
86
(89.9)
86
(82.7)
92
(90.2)
85
(83.5)
93
(90.9)
Expos 65
(62.9)
68
(64.0)
67
(61.0)
68
(62.7)
83
(93.1)
83
(80.3)
67
(63.3)
Giants 89
(94.6)
86
(85.8)
97
(102.5)
90
(87.5)
96
(102.7)
100
(95.8)
91
(90.6)
Indians 89
(89.9)
97
(98.1)
90
(96.9)
91
(90.1)
74
(69.0)
68
(70.7)
80
(81.1)
Mariners 76
(81.2)
79
(75.7)
91
(96.5)
116
(117.1)
93
(95.2)
93
(101.4)
63
(65.1)
Marlins 54
(50.3)
64
(60.6)
79
(72.1)
76
(80.7)
79
(72.4)
91
(88.7)
83
(83.4)
Mets 88
(89.1)
97
(99.0)
94
(89.8)
82
(71.3)
75
(78.6)
66
(65.6)
71
(74.8)
Orioles 79
(85.1)
78
(85.5)
74
(66.8)
64
(63.2)
67
(67.2)
71
(71.7)
78
(82.6)
Padres 98
(96.1)
74
(71.7)
76
(73.1)
79
(78.1)
66
(61.2)
64
(61.4)
87
(89.1)
Phillies 75
(79.7)
77
(80.2)
65
(65.5)
86
(84.5)
80
(78.5)
86
(93.1)
86
(88.3)
Pirates 69
(72.3)
78
(79.5)
69
(69.5)
62
(55.5)
72
(68.5)
75
(75.1)
72
(72.0)
Rangers 88
(89.2)
95
(91.2)
71
(66.4)
73
(72.3)
72
(76.2)
71
(64.3)
89
(89.2)
Red Sox 92
(99.2)
94
(96.0)
85
(87.1)
82
(84.0)
93
(104.8)
95
(98.6)
98
(102.7)
Reds 77
(79.7)
96
(100.5)
82
(89.2)
66
(66.7)
78
(72.5)
69
(57.2)
76
(62.1)
Rockies 77
(77.5)
72
(67.3)
82
(89.2)
73
(82.9)
73
(66.4)
74
(76.2)
68
(70.4)
Royals 72
(58.2)
64
(73.0)
77
(75.1)
65
(65.1)
62
(62.6)
83
(77.1)
58
(58.3)
Tigers 65
(63.7)
69
(64.0)
79
(80.5)
66
(62.3)
55
(44.3)
43
(40.0)
72
(78.9)
Twins 70
(70.6)
64
(60.5)
69
(68.7)
85
(81.7)
94
(87.6)
90
(86.2)
92
(89.5)
White Sox 80
(73.5)
75
(69.7)
95
(93.0)
83
(81.5)
81
(87.9)
86
(90.5)
83
(85.1)
Yankees 114
(117.7)
98
(101.7)
87
(87.5)
95
(92.1)
103
(104.4)
102
(101.4)
101
(91.9)

Results

Over the seven seasons studied, we ranked each team according to wins in excess of those expected.  Teams with actual win totals greater than their conditional expectations overachieved while teams whose win totals lagged their conditional expectations performed poorly.  A few interesting observations are worth making.

  1. The 2001 Seattle Mariners set the record for most wins in a season (116), however our results indicate that they were even better than their gaudy record suggests.
  2. Although the Expos, Devil Rays, Brewers and Tigers were perpetual laughing-stocks, they ranked 1st, 3rd, 4th and 6th, respectively in terms of win differential.  This underscores the lack of talent on those teams.  Their dismal performance still exceeded expectations.
  3. The Red Sox were 30th (dead last) of the teams over the course of this study.  Without exception, the team won fewer games than would be expected for a team with their offensive and defensive abilities.  Their regular-season performance in 2004 (the season in which they eventually won the World Series) was typical -- namely 5 wins fewer than expected.
  4. There is growing sentiment that Joe Torre is a Hall-of-Fame caliber manager.  However, the Yankees rank 14th (of 30 teams) in win differential, indicating that they win about as many games as they should given their perennially talent-laden roster.

Reference

Annis, D. H. (2005) Approximate Conditional Inference for Evaluating Managerial Performance. (unpublished manuscript)

Back to Top

 
Send mail to webmaster@sportsquant.com with questions or comments about this web site.
Copyright © 2005-2008 David H. Annis, Ph.D.
Last modified: 01/05/2008