Pythagorean NHL Standings
About this time last year, I made several attempts at coming up with an equivalent to baseball's "Pythagorean Expectation." For those of you unfamiliar with the term, it was invented by baseball stat guru Bill James, and it describes the relationship between runs scored/runs allowed and wins/losses, then derives "standings" from a simple equation. In this post, I attempt to do the same thing for hockey. For those of you that I haven't lost with the words: Pythagorean, stat, derive, equation and James, a table and full explanation follow after the jump.
For those uninterested in the process behind all this, here are the standings.
For the sake of comparison, here are the standings based on actual points, note that these may not correlate to the NHL's standings because they don't account for tiebreakers.
Finally, since the most interesting part of Pythagorean systems is to see which teams are getting "lucky" and which are "unlucky," here's a table of the teams ranked from unluckiest to luckiest, based on delta between their expected points and their actual points - the higher the number, the luckier the team has been.
From a Caps perspective, this is sort of typical good news/bad news. The Caps have been getting somewhat "lucky," but in actuality, this is likely due to several lopsided defeats that the Caps have suffered (vs. ATL, NJD and NYR). The good news is that Tampa has been getting even luckier, and though they suffered a 6-0 drubbing at the Caps' hands, I'm not sure that they had a spate of awful play similar to the Caps' recent skid. They did, however, have awful goaltending for much of the season to date. The other bad news is that Atlanta and Florida have both been playing better than their records would suggest.
For those of you interested in the process behind all this, read on...
The first problem encountered with creating a Pythagorean system for hockey is that, unlike baseball, hockey results aren't binary. There are wins, losses, and overtime/shootout losses, so the James equation won't work. The NHL further muddies the water with the points system: two for a win, zero for a loss and one for an OT/SO loss. The NHL standings aren't based on Win-Loss records, they're based on the points that teams accumulate. This actually proved to be a saving grace.
Going back over the years since the lockout, I determined that there were several key numbers that were constant enough to be used as the basis for a possible Pythagorean Expectation for the NHL. First, the average number of points earned by teams in the NHL since the lockout is remarkably constant at 91.3. This means that the average team playing 82 games would earn 1.1134 points per game. Moreover, the "average" NHL team also scored exactly as many goals as it let in since, by definition, there are exactly as many goals scored every year as there are allowed. Armed with these two numbers, I calculated that to move one standings point away from the mean (91.3 points), a team would need to either score or not allow a combination of around 2.64 goals.
For this iteration of Pythagorean standings, I decided to subtract empty net goals from the goals for and against totals. The reasoning behind this is simple - empty net goals aren't a great barometer of skill disparity. They're simply a great barometer of which teams happen to enter the late stages of the 3rd period with a one to two goal lead.
The final equation ends up being:
Pythagorean Points = [(Goals For - Empty Net Goals For) - (Goals Against - Empty Net Goals Against)]/2.64 + (Games Played * 1.113414)
Here's the resultant table in alphabetical order.
Thoughts, suggestions and corrections all welcome. Any data entry errors are my own.
If this FanPost is written by someone other than one of the blog's editors, the opinions expressed in it do not necessarily reflect those of this blog or SB Nation.
29 comments
|
10 recs |
Do you like this story?
Comments
i hope atlanta picks up a lot of fans in the next few years. that’s an exciting young team that casual fans of other sports should be able to get behind. 6th in the standings and “unlucky” is a good place for that young core of players to be.
True dat
Kind of makes me wish I lived in Hotlanta… so I could thoroughly enjoy lower-bowl, Center Ice seats to the ThrashHawks games for on $25 a pop.
"Baseball was my first love... hockey is a sultry temptress and stole my heart." - Corey Masisak
Southleast what!?!?!?!?!?!?
I love that the division is doing more than just showing a little life, err well, as long as we still come out on top. I guess it’s somewhat inevitable though. The cap and CBA basically ensure all teams will be at least decent every 4-5 years or so. Unless you pull a Toronto and trade all your picks away and make horrible moves on the open market.
by Bushwood Bushwhacker on Jan 5, 2011 7:39 PM EST reply actions
It’s funny, because everyone thinks Tampa is at the forefront of the SE renaissance, whereas ATL and FLA have improved as much based on goals for and against.
T-R-A-P: TRAP, TRAP, TRAP!!!
Yea, it’s true. The Yzerman effect maybe, and also obviously because they’re on top of the division (aided by a touch of luck it seems.)
by Bushwood Bushwhacker on Jan 6, 2011 10:10 AM EST up reply actions
TBL also had the best core to work around.
Drunk Guy: "Alex Ovechkin is playing more like Magic Johnson than Michael Jordan this year."
Laichitor: "He has AIDS?"
It looks as though ATL effectively imported a new core from Chicago though. We’ll see how they do down the stretch, particularly now that Tampa might get decent goaltending. Nevertheless, I think ATL will be better overall because I think they’re a tougher team to play against on a nightly basis. Tampa has more top-end skill, but the drop-off from those guys to their 3rd and 4th line is pretty steep. ATL is the kind of team that would be a scary playoff matchup for any top seed.
T-R-A-P: TRAP, TRAP, TRAP!!!
ATL is probably deeper, and definitely has the better blue line and goaltending, but I dispute your characterization of the core. They got some nice players from CHI, but they aren’t anywhere near what TBL is working with. Buff and Ladd are great depth players, but MSL and Stamkos are studs, and Lecavalier, as maligned as he is, would still be the best forward on ATL by a country mile.
I do agree that ATL is more difficult to play against, but if TBL gets goaltending they could be just as scary as ATL. Difference is, TBL might actually be one of those top seeds, ATL won’t.
Drunk Guy: "Alex Ovechkin is playing more like Magic Johnson than Michael Jordan this year."
Laichitor: "He has AIDS?"
I agree that TBL has the better core up front, but ATL has much better depth at D and a better #1G (for now). ATL has better depth, and I like the work ethic that Ramsay has instituted on that team. TBL looks a bit like a team that tries to win with skill, and I think we’re all familiar with how that can turn out when the skill gets stymied.
T-R-A-P: TRAP, TRAP, TRAP!!!
If I am understanding this right, is it basically that a negative number means the team should have more standings points based on their goal differential, while a positive means they should have less standing poins based on goal differential?
There is no problem a hammer cannot either fix or make irrelevant.
Yep. Positive means “lucky,” whereas negative means “unlucky.” Florida should have six more points than they do, whereas Tampa should have nine less. I had it the other way around at first, but this seemed more intuitive. Let me know if there is a better way of phrasing it.
T-R-A-P: TRAP, TRAP, TRAP!!!
Let me know if there is a better way of phrasing it.
Tampa Bay should have nine fewer.
The guy is Peter Schumpmaker. Lord knows what a schump is, but you can bet your bippy his ancestors made them. What he's doing is far worse than crafting fine schumps.
by Steckel Me Elmo on Jan 6, 2011 12:45 AM EST up reply actions 7 recs
Don’t worry, you have a lot more people with you who don’t know when to use fewer and less.
Release the Mackan!
by Killer_Carlson on Jan 6, 2011 2:04 PM EST up reply actions
Empty Net Goals
Here’s the above table ranked from highest ENG differential to lowest. The teams at the top scored more empty-netters than they’ve given up, while those at the bottom gave up more than they’ve scored.

Interestingly, there seems to be a strong correlation between the teams that out-score their opponents in ENG, and those that have point totals higher than expected based on goal differential. It’s particularly interesting since my original data excluded ENGs. Here’s a graph showing the correlation:

T-R-A-P: TRAP, TRAP, TRAP!!!
I’ve noticed that I Caps pretty consistently capitalize on ENG opportunities without giving up those goals to their opponent, more than most teams. (part of that, of course, is finding themselves in a lot of 1 and 2 goals games) Nice to see that, for once, my eyes aren’t deceiving me.
nice work on all of this, btw.
Pledge Drive 2010-2011: SO KIDS CAN!! Help build a playground
If this were me, I’d take advantage of goals being distributed approximately randomly, i.e. use a Poisson distribution instead of an empirical linear equation. Take out ENG and OT goals, and run a correlated Poisson distribution on the per-game mean figures. If you give me your data set (because I’m too lazy to do this myself, of course), I have a program set up to do this more or less automatically; I’m mildly curious where our differences would lie after writing this paragraph.
It’s any interesting theory. Teams like PHX whose margin for error is slim due to inconsistent offensive production are always going to be lower in these standings than the actual ones.
The Artist Formerly Known as CP2Devil.
Contributor at Five For Howling.

by 


































