Stats: Dropping Shots on Goal Out of the Picture
Last week I took a look at correlations between first half and second half rates of various statistics for MLS players to help explore the question of whether the rates were primarily driven by player skill or other factors. The upshot was that shot rate is pretty well correlated, but both accuracy (shots on goal / shots) and conversion (goals / shots on goal) were all over the place, indicating that factors other than player skill (which we'll call 'randomness') dominate those rates.
A number of questions follow from that initial look relating to total team rates, defensive (particularly goalkeeper) rates, and so on. But one I want to look at today is the possibility that using the Shots on Goal statistic as a component in both of the accuracy and conversion rates is accounting for some of the randomness.
First, goals come from shots on goal (mostly. Own goals can be an exception but are relatively rare). So it would seem to be true that Shots on Goal should correlate in some way with quality. But if you keep in mind the handy rule-of-thumb ratios that about 1 in 10 shots becomes a goal and about 4 in 10 shots are on goal, it becomes obvious that a majority of shots on goal (about 75%) are misses. And the implication of our previous analysis was that those misses are superior to misses that are not on goal and therefore correlate in some way to quality. But is that true? A hard shot from 20 yards out that rings off the bar is technically not on goal, but is more goal-dangerous (and indicative of offensive quality) than a slow roller to the keeper that is easily saved but is technically on goal. By more goal-dangerous, I mean that if you gave the same player repeated chances at the same shot, it would result in more goals on average, even though a higher percentage of them would be off target. So maybe introducing Shots on Goal into each of our ratios has simply corrupted any measurement of quality out of them.
Fortunately, there's an easy remedy for that. I can take a look at the Goals / Shots ratio and leave Shots on Goal out of it. Chris Anderson refers to this as the Reep Ratio after pioneering soccer analyst Charles Reep. Others refer to it as Conversion Rate, which we're already using. I'll call it the Strike Rate to add to and/or reduce confusion.
So I'm going to look at the Strike Rate for last season's MLS players who had at least 10 shots per half (to weed out sample size problems) and then correlate the values from one half to the next. Since player skill should hypothetically be fairly consistent over the course of a single season, then a high correlation is an argument that Strike Rate is governed by skill rather than more random factors. What do we get?
Yeah, it's still all over the place. The Pearson Correlation is 0.07, which isn't any better than the ratios that involve Shots on Goal. This bolsters our argument that players are largely distinguished by their ability to get shots off rather than any distinct ability to create more successful shots.
I also want to call out a couple of outliers on those results. Jeff Cunningham and Brian Mullan are extreme examples of players having their Strike Rates change dramatically from half to half. Cunningham had a first half Strike Rate of 3.7%. He scored four goals in the first half, but three were on penalties, so he had only one goal in the run of play out of 26 shots. But his second half rate was 29.2%, when he went on a tear along with the rest of the FC Dallas team. In contrast Mullan started off with a 33% strike rate but failed to score a goal in the second half of the season.
Of course, the notable name is Steve Zakuani. He had a very impressive first half strike rate of 27% and an even more impressive second half strike rate of 33%. That may be heartening to Sounders fans and fans of Zakuani, but remember that the theory we're developing here are that those ratios are largely out of the player's control. And since those ratios are off the charts compared to league average, I think it's likely we'll see them regress and unless Zakuani gets significantly more shots off than he did last season, he won't match last season's goal total of 10. If he maintains those ratios, it's an argument that he's so far off the charts quality-wise compared to other players in MLS that he defies this analysis. An important data point is that his Strike Rate so far this season is 14%, which is right around the league average.
At the other end we have Ryan Johnson, who last year was either uncommonly terrible or just unlucky. I'm going on the theory that he was unlucky and that he should see a significant increase in his Strike Rate (and public perception) this season if he's given the opportunity, which should keep San Jose from relying almost exclusively on Chris Wondolowski for goals.
I'll continue posting updates as the season goes on and next week I'll take a look at team totals to see if they behave the same as individual player values.
13 comments
|
1 recs |
Do you like this story?
Comments
Excellent analysis!
Love the analysis, but could you explain the chart a bit better? I don’t understand what each axis is, what those numbers on the graph mean.
I understand your percentages discussion, but don’t understand how the graph correlates to the discussion.
Thanks!
Right, it's strike rate (Goal / Shots) for each half of the 2010 season
Horizontal axis is the first half, Vertical axis is the second half. So Cunningham had a big improvement and Mullan had a big regression.
Nos Audietis
Readability Criticisms
Great analysis, but two comments for the charts.
1) Use axis labels (touched on in the first comment)
2) Keep the scales consistent. The number of pixels between 0.0 and 0.1 should be the same for both axes, especially when the units are the same. If I visually draw a 45% from the origin, I think Zakuani had a better first half. If I trace it based on the scale, I see it’s the other way around.
One other thought for a future post: the MLS has a lower level of talent than the European top flight leagues. Do the best in La Liga, Premier League, and Serie A have more strongly correlated numbers, or even “better” numbers?
Fan of: Cardinals, Blues, Sounders, Yellow Jackets, Wolverines, Rams, and Blazers.
by ColinMacLeod on Apr 6, 2011 5:31 PM PDT reply actions 1 recs
takeaways...
“(U)nless Zakuani gets significantly more shots off than he did last season, he won’t match last season’s goal total of 10…”
Someone hand Rosales and Friberg this analysis, they can do something about it. Does it “correlate” that the Sounders two big adds in midfield are set-up men. There’s your takeaway…
by P. Scott Cummins on Apr 6, 2011 6:45 PM PDT reply actions
Conclusion and commenting effort
I like the fact that this analysis shows the less useful nature of accuracy and conversion. The only problem is that it highlights the usefulness of shots without going deeper. This is problematic as it would just encourage players to shoot almost indiscriminately. StatDNA did some more work on this here: http://blog.statdna.com/post/2011/03/29/What-determines-shot-quality.aspx
Obviously, that analysis is tough and requires a lot of detailed data. I would just encourage you to mention this fact (don’t just shoot like crazy to optimize your offense). Otherwise, great work!
Also, commenting on your blog is quite difficult if you use the SportsNation method. I already had a SportsNation account, but it still complained that I didn’t allow tracking cookies, and even then I had to agree to “join” Sounders at Heart blog, just to comment.
Commenting
All SBNation blogs require that button push once.
You are all done with that from now until the entire network shifts
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart
StatDNA
Is worth checking out for those that haven’t taken a look.
"But who would listen to Little old me anyway?"
-by thehemogoblin
by Little old me on Apr 7, 2011 10:49 AM PDT up reply actions
Good points
But I think we’re well away from the point where open soccer analytics are mature enough that players would actually change their game to optimize some analytic measure. No one’s saying that a silly shot from 50 yards away is equal to a shot in the box, but in the context of the domain that the stats are gathered in — that is, MLS games in which the players deserve roster spots and are actually trying to win — the conclusions hold. If some player started taking relentless low-percentage shots to optimize his shot stats, he’d lose his job and that anomaly would be self-correcting.
That said, Kansas City took an enormous number of shots last season, to the point where it was clearly systemic (either their formation, their coaching philosophy, or the postage stamp of a field they played on) and I’m going to look into that sometime soon. It’s interesting to note that their shots per game are only just above average this year, so maybe it was the park.
Nos Audietis
Value of Shots on Goal
Yeah, I agree. Take too many poor quality shots and you’re not going to play much. One thing that may still be up for debate is strike rate among regular starters. I’m more of an EPL fan (sorry, yes, eurosnob, etc etc), and this season is notable for the first American to score in double figures. Clint Dempsey has scored 10 goals, but has taken 93 shots to do so. As much as I’d like to believe he’s one of the best 10 strikers in England, his strike rate (goals/shots) will probably not rate so highly. Among the top 10 goal scorers, I believe he’s 9th in strike rate (and its not particularly close). His lack of quality (compared to the other high scorers) may be reflected in his propensity to take lots of shots (many outside the box, and somewhat poorly hit).
I used data here: http://scores.nbcsports.msnbc.com/epl/player_leaders.asp?category=202
Yeah, it's interesting that he's 5th in the league in shots
If you do that and you’re not terrible, you’re going to get a lot of goals. But don’t discount the skill it takes to get a shot off. A lot of his shots come from climbing out of scrums in the box to get a head on it or finding space at the top of the box, and both of those take skill against EPL defenders. But a 1/9 strike rate is pretty good. Better than Drogba’s.
Nos Audietis

by 


















