Much verbiage lately on what you can count on in terms of how a team performs this year influencing how it performs next year. The standard approach is to take your variable of interest, run a correlation from one year to the next, and look at the result.
The better the correlation, the more confident you can be that you are likely looking at a repeatable skill as opposed to just random variation. My general rule of thumb: 1/3 (up to 33%) is uncorrelated, 33% to 66% is a real correlation but with a generous number of confounding factors, and 67% and above is highly indicative. Of what, you have to figure out – but it’s definitely indicating something.
One last warning: these are correlations. A classic sciencey mistake is to confuse correlation (two things seen together) and causation (one causes the other). You gotta use some brains and logic to determine if you’re truly seeing a causative relationship.
So, for part 1 of this two (maybe three) part series, let’s take a look at a number of variables from 2013/14 to this season (we’ll make the hopefully safe assumption that with a single game left, the numbers for 2014 are not going to change in any meaningful way), and see which ones “stuck”.
All data you see here is sourced from the terrific folks over at war-on-ice, and the points from NHL.com.
Let’s start with the grandaddy – points. If you’re good (or bad) one year, you should be good or bad next year, right?
As you’d expect, there’s a pretty good relationship one year to the next. Is it unsurprising or surprising to you that it’s only about 55%?
Even Strength SCF%
As expected, it’s pretty sticky. Better than points, even. This is what the stats geek are pointing to when they talk about repeatability and predictability.
Unsurprisingly – the venerable EV Corsi is also representative of a repeatable skill. Again, stickier than points.
This shouldn’t – but may – come as a surprise that PDO (Sh% plus Sv%) does not carry over year to year. It means there is an awful lot of randomness embedded in this number (and since the two of them automatically sum to 1.0, this is where the idea that they will regress comes from).
Corollary: you can’t coach PDO.
By the way, notice the three incredibly horrible PDO teams this year. Those are ARI, CAR, and EDM. Notice that all three were decent (99-100) in PDO last year. Unlucky for us that these things didn’t carry over from last year to this year. Lucky for us these things are unlikely to carry over to next year, yes?
It means we can expect the Oilers to be a bad team next year because of managerial incompetence, not PDO!
All Situations Sh%
On first glance, that correlation is low but higher than expected – 30%? But the chart tells a clearer story, that it’s mostly a random effect. There’s not much of a correlation there. It’s more a small cluster of teams sitting right around the middle. Regressing to the meat, you might say. Team shooting is not nearly as repeatable a skill as people think it is.
All Situations Sv%
Again, you’d think that there would be a higher measure of consistency in goaltending from year to year, but there really isn’t. I suspect that’s because a. goalies are voodoo, with inherently high variability in performance year to year, and b. teams with horrific goaltending one year tend to replace the goalie the next year, and that guy often comes from a team with good goaltending.
All Situations FO%
Ah hah! Sanity restored. Good faceoff teams have a reasonable tendency to stay good faceoff teams.
So there you have it: count on SCF% and CF%.
Definitely some repeatability to points and FO%.
Do not count on PDO, Sh%, or Sv% to stay where they are.
Next edition: a look at which variables correlate well with other, different variables, year to year.
P.S. I have posted the CSV file where I gathered this information in the Data section, in case you want to try any of this cr*p for yourself!