What Should We Expect from NHL Goaltenders?

Analytic minds of the blogosphere are starting to converge on a new frontier: analyzing goaltending. We at Eyes on the Prize have been blessed to have Chris Boyle give us incredibly in depth analysis on the position that is seen nowhere else on the internet, but something came up recently in his piece on Marc-Andre Fleury that got me thinking… what exactly is replacement level goaltending?

Now, I’m not the only blogger out there that is trying to answer this question. Cam Charron earlier this year used it to try and give comparables to Winnipeg Jets starter Ondrej Pavelec, using other metrics other than league save percentage like measuring quality starts and blow-ups. I didn’t go this route with my own analysis, but it’d be worthwhile to build a database of these over time. Scott Reynolds of The Copper & Blue earlier this month decided to calculate a replacement level save percentage using all NHL goalies that had 15 or fewer games in a season. He split up even strength and shorthanded save percentages and found these goalies had a much lower threshold than the league average. It makes a lot more sense than just using the league average: clearly there are different expectation levels for goalies who are called up from the minors than those that have guaranteed NHL jobs, and largely the NHL does a decent job of picking who their best goalie is.

My methodology started with a similar threshold. Reynolds posted that on May 1, while I had been working on the charts in April, so they were come up with independently. I wanted to separate goalies by their usage: starters, platoons, backups, and replacements. I came up with my own cut-offs to determine this: anyone who played more than 60% of the minutes in an 82 game regulation schedule would be considered a starter, platoon goaltenders played between 40 and 60%, backups played in over 10 GP a year and up to 40% of a team’s minutes, and the rest were replacements. I broke out every year since 2003-04, since I was doing this for a project I’m working on about the 2003 Entry Draft that I’ll be posting on throughout June. I looked at save percentage only, but broke it between regular save percentage (all situations) and even strength only.

If you’re wondering why splitting up the situations is important, team effects can greatly contribute to a goalie’s save percentage on the Penalty Kill. While it is often said that your goaltender has to be your best penalty killer, teams with good penalty kills have to be better at preventing shots and prime scoring chances in order to succeed over the whole season. Therefore, a goalie’s save percentage on the penalty kill is less of a factor of his individual skill level than at even strength, although I do agree with Chris Boyle in that teams can design systems that make it a little easier on a goaltender to stop the puck at a higher rate at even strength. However, those fluctuations won’t be to the same level a good PK unit can effect a goalie’s save percentage. Even strength is a better indicator of a goalie’s skill level over time, but we’ll include both measures here.

After the jump are some of the results.

There are 4920 minutes in an 82 game season, excluding overtime. I can’t possibly be bothered to add all the overtime minutes together and figure out a threshold based on the actual minutes, so I kept it simple: all goalies had the potential to play 4920 minutes for their team, so we’ll base our classifications on them. 60% of 4920 minutes is 2952 minutes, which is just over 49 full games. The least amount of appearances a goalie made in this study that classified as a starter was Nicklas Backstrom in 2010-11, with 51 GP. I think that’s an acceptable figure for what we’d consider a starting goaltender, a good sample size. 40% of minutes played is 1968 minutes, or just under 33 GP, and Michael Leighton in 2003-04 had the fewest appearances of any goalie in this range with 34 GP, although a number of goalies had 35 GP and didn’t reach 1968 minutes. These are definitions I’ve made up, if one doesn’t like them, they’re free to adjust them, they just made sense to me. Generally, platoon goalies played in at least 35 games and up to 50. Backups, then, appeared in 11-35 GP. Here’s the sample size for each year:

Year	Starters	Platoon	Backups	Call-ups	Total
2003-04	20	15	29	30	94
2005-06	15	19	34	21	89
2006-07	22	12	30	20	84
2007-08	22	11	26	30	89
2008-09	21	12	31	25	89
2009-10	19	16	28	20	83
2010-11	24	8	32	23	87
2011-12	22	13	26	27	88

My educated guess for 2005-06 having more platoon goalies than starters is this: with the NHL lockout, teams lost a year of up close evaluation of their goaltenders and it took longer for them to figure out who their better goalie exactly was (there were more backups that year than any other as well). Every year since has been more normal, with 2010-11 being a bit of an extreme in favour of teams employing a pure starter/backup system. You’ll also notice a slight CBA effect: 2003-04 had the largest amount of goalies called up, when there were less restrictions on callups than there were post-lockout.

From here, I figured out the average save percentages for each group of goalie, but you’ll notice that it didn’t quite work out as linear as I had hoped:

**Even Strength Save Percentage**
Year	Starters	Platoon	Backups	Call-ups
2003-04	.925	.922	.917	.908
2005-06	.922	.914	.908	.900
2006-07	.922	.916	.907	.916
2007-08	.923	.917	.915	.903
2008-09	.921	.921	.916	.900
2009-10	.923	.917	.914	.916
2010-11	.925	.918	.915	.908
2011-12	.924	.919	.917	.912

**Total Save Percentage**
Year	Starters	Platoon	Backups	Call-ups
2003-04	.914	.910	.907	.899
2005-06	.907	.902	.894	.884
2006-07	.910	.909	.893	.900
2007-08	.912	.909	.904	.893
2008-09	.912	.909	.903	.891
2009-10	.915	.910	.904	.901
2010-11	.917	.909	.907	.903
2011-12	.917	.910	.910	.903

As you can see, for the most part, the divisions work, but due to the random nature of sample sizes, sometimes teams are forced to play really bad backups due to injuries, or a call-up has a particularly strong run and it muddles up the numbers at the bottom end. It also shows that it is now harder to score a goal on the top goalies in the NHL today than it was the year before the lockout, and goalies are making one more save every 100 shots than the first year after the lockout, depressing goal totals significantly.

Due to this finding, I decided to expand the sample size for defining a replacement level goalie. Fortunately, the NHL has their own standard definition of what an acceptable sample size is for goalies. It’s based on Games Played, not minutes, but who am I to argue against Big Hockey? For determining the awards for the William M. Jennings Trophy (Top Team GAA) and the Roger Crozier Saving Grace Award (Top Save Percentage), the NHL believes a goalie must appear in 25 games to be considered. So, according to the NHL, goalies who play less than that are not really statistically relevant. Perfect.

Therefore, I calculated a Replacement Level SV% based on all goaltenders that appeared in less than 25 GP. Here are the results:

**Replacement Level SV%**
Year	Even Strength	All Situations
2003-04	.911	.901
2005-06	.903	.890
2006-07	.905	.892
2007-08	.911	.901
2008-09	.909	.898
2009-10	.915	.906
2010-11	.915	.908
2011-12	.914	.906

I provide all these figures for future comparisons of goalies by save percentage. From here, I developed a RelSV% metric, basically a +/- figure comparing goalies to others in their peer group. Here are the RelSV% figures for the 2011-12 starters, as well as how they performed compared to a replacement level goalie:

**2011-12 Starting Goaltenders**
Rank	Player	Team	Rel EVSV%	Rel SV%	Rel Comb.	+/- RL (EV)	+/- RL (All)
1	Mike Smith	PHX	.012	.013	.025	+22	+24
2	Henrik Lundqvist	NYR	.009	.013	.022	+19	+24
3	Jonathan Quick	LA	.009	.012	.021	+19	+23
4	Pekka Rinne	NAS	.004	.006	.010	+14	+17
5	Jimmy Howard	DET	.005	.003	.008	+15	+14
6	Miikka Kiprusoff	CGY	.004	.004	.008	+14	+15
7	Roberto Luongo	VAN	.005	.002	.007	+15	+13
8	Kari Lehtonen	DAL	.002	.005	.007	+12	+16
9	Tim Thomas	BOS	.003	.003	.006	+13	+14
10	Jose Theodore	FLA	.004	.000	.004	+14	+11
11	Antti Niemi	SJ	.002	-.002	.000	+12	+9
12	Ryan Miller	BUF	-.002	-.001	-.003	+8	+10
13	Semyon Varlamov	COL	-.001	-.004	-.005	+9	+7
14	Craig Anderson	OTT	-.004	-.003	-.007	+6	+8
15	Cam Ward	CAR	-.005	-.002	-.007	+5	+9
16	Carey Price	MTL	-.006	-.001	-.007	+4	+10
17	Ilya Bryzgalov	PHI	-.003	-.008	-.011	+7	+3
18	Marc-Andre Fleury	PIT	-.009	-.004	-.013	+1	+7
19	Jonas Hiller	ANA	-.009	-.007	-.016	+1	+4
20	Ondrej Pavelec	WIN	-.007	-.011	-.018	+3	E
21	Martin Brodeur	NJ	-.013	-.009	-.022	-3	+2
22	Corey Crawford	CHI	-.009	-.014	-.023	+1	-3

As you can see, Cam Charron was right to put Ondrej Pavelec’s season into perspective: he’s in the replacement level range as a starter, while Martin Brodeur and Corey Crawford were definitely at that level this year. Mike Smith didn’t get the love from the General Managers and missed out on a Vezina nomination, but the other three finalists were right up there as well.

For the most part, teams are employing starters that are above replacement level. Even Marc-Andre Fleury is a step above that level despite Chris’ takedown of him last month. Carey Price had a down year, but 10 points above replacement level overall is a decent showing in those circumstances (and average for a starting goalie).

I came up with another metric for evaluating goalies as well that I’ll introduce in a follow-up piece. I’m sure there is plenty of material on statistically evaluating goalies that will be coming this summer.

Talking Points