Statistical Approach to Formations
As our own malcontentjake and countless others have demonstrated repeatedly, the numerical shorthand for soccer formations which we're all accustomed to (e.g. 4-4-2) is almost completely useless. Whether you have a 'band of 4' defenders or a 'band of 5' is much less important than what those defenders are doing at any given time. And the inherent assumption of symmetry doesn't map well to a world in which one fullback might be more attacking than another, one winger might cut to the middle more than the other, or one forward might play in front of — rather than next to — another.
Fundamentally, a formation is just a shorthand description of where the players tend to be on the field. So what better way to determine the formation than to simply keep track of where the players are? Thanks to the detailed statistics that Opta is collecting for MLS matches, we can do just that. Though the data doesn't include every player's location at every moment, it's granular enough to include the location of every pass, tackle, shot, interception, and other on-ball event. While there is a lot to playing a position that doesn't include any of these events, I think it's fair to say that if a player spends any significant time in an area of the field, he will tend to engage in those activities there as well.
So if we take every tracked event that a player was involved in and map it onto the field, we can see where that player played. This is exactly how the heat maps on the league site are generated. And if we plot the mean location of those events and call that location the player's 'formational position' we can see where the entire team is playing, on average. Here is an example from the final Seattle Sounders regular season game last season — versus Chivas USA:
Here each individual event is marked with a small dot and the mean location for each player is marked with a larger dot. And what we see is a pretty good match to what you'd intuitively expect the Seattle formation to be. The fullbacks are pushed ahead of the central defenders, the wingers largely stay wide, and Osvaldo Alonso is positioned centrally as a defensive midfielder behind the more offensive Brad Evans. It's worth noting that Mauro Rosales didn't last very long in this game after his knee was re-injured, and yet his few events were still enough to locate him accurately in the formation.
Something of a surprise (at least to me) is that forwards Fredy Montero and Mike Fucito are paired side-by-side. We generally think of Montero as a more withdrawn forward (or a trequartista) behind an advanced target man. But Fucito's an unlikely target man given his stature and speed. Here's another example, from the first match of the season against the Los Angeles Galaxy:
Now we see Montero playing behind a more advanced O'Brian White, who closer fits the mold of a target man. This version also shows Steve Zakuani before his season-ending injury. It seems to suggest that he cuts into the middle often enough to make himself a central midfielder as much as a winger. But while it's true that he does cut into the middle frequently as an inverted winger, his central location here is somewhat due to the fact that he frequently switches wings during a game. Here's a view of the isolated Zakuani data from that match:
You can see some events in the middle of the field, but much of the central skew is due to a number of events on the complete opposite flank. That's a weakness of this model, in that it doesn't differentiate between a player moving within the current formation and a the team moving into a different formation by switching the sides of the wing players or otherwise moving a player into a new position. That's something I hope to work on, but I expect we'll be incorporating these quick formation images into our game previews and recaps during the coming season.
35 comments
|
9 recs |
Do you like this story?
Comments
These are great.
Thanks for sharing and I would love to see them as the year goes on.
On a side note, the nomenclature surrounding positions and formations is essentially broken. People use the same words to mean different things. No solution, just a complaint.
On the left sidebar
you can find some posts by authors here about how we use nomenclature in regards to formations and roles. We try to have the staff be consistent about this.
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart | Follow Dave on Twitter @bedirthan
Maybe I should redeploy the tableau widget for formations?
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart | Follow Dave on Twitter @bedirthan
by Dave Clark on Feb 9, 2012 5:17 PM PST up reply actions 4 recs
Hey Dave,
Can you post a direct link to that post or a post. I`d love to read it and couldn`t find it on my own.
So does SBnation give you access to Opta?
Sorry about that
we tagged too many posts as “featured”
http://www.sounderatheart.com/2010/2/20/1319217/on-formations
http://www.sounderatheart.com/2010/3/12/1369583/on-roles-within-formations
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart | Follow Dave on Twitter @bedirthan
I think we're analyzing data collection artifacts...
As you pointed out, these mappings are weighted by touches, not time, or time as a threat. I don’t think it’s very surprising that Fucito’s average postion (by touches!) is back near Montero’s. This chart’s showing us more that our forwards get more touches 30 yds out than in the box, not so much the shape of our formation.
I like the info a lot
…I just don’t think it’s a good representation of our shape.
True enough
But I think it’s a problem specific to forwards, who are probably the only position on the field that do most of their playing in an area where they won’t get as many touches (or tackles, fouls, etc). And it’s worth noting that we calculate White playing further forward than Montero or Fucito, which is what you’d expect. So we can differentiate vertical positioning, even if the ‘true’ position would be further forward.
Nos Audietis
Forwards are also pulled 'back'
due to starts at half and restarts after goals.
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart | Follow Dave on Twitter @bedirthan
Yeah, I should probably boot out the initial few touches after a restart
but I’m not sure it skews it too much when you compare it to the total number of events a forward has during a game.
Nos Audietis
I think that speaks more to White's lack of overall touches
Normal handling seems like it would really quickly overshadow the low-touch more dangerous positioning. So OBW isn’t showing as a TF so much as he’s showing that he’s not a midfield connector.
Sample size is certainly key.
Hard to gauge OBW with so few games worth of data.
by DaveValleDrinkNight on Feb 9, 2012 5:29 PM PST up reply actions
Your ideas are intriguing to me and I wish to subscribe to your newsletter.
Awesome. Pure, unadulterated awesome.
Boo! Tomorrow AM. I want all things now!
by Perrinbar on Feb 9, 2012 5:18 PM PST reply actions 1 recs
By the way
if you like a post or comment
rec:
like
tweet
All easy to do and great ways to share the love
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart | Follow Dave on Twitter @bedirthan
If I tweet something, it reaches a guaranteed audience of me. So I tend to stay away from that. I suppose I can make my way all the way to the rec button. This time.
Boo! Tomorrow AM. I want all things now!
by Perrinbar on Feb 9, 2012 5:24 PM PST up reply actions 1 recs
This is pretty interesting
I think seeing the opposition “heat maps” with our team may give more insight. Is our formation dictating the game our is theirs? But still pretty awesome. Thanks.
by Colin Johnson on Feb 9, 2012 5:18 PM PST via Android app reply actions
During the Man U game I noticed their players were wearing what looked like tracking units under their jerseys.
Not sure if thats what those were but it would be interesting to see movement throughout the entire game. Not just touches but the build up runs as well.
Would seem to tell an even more accurate story.
by DaveValleDrinkNight on Feb 9, 2012 5:27 PM PST reply actions
GPS tracking units to be specific
Article on its use in the Telegraph last year.
You will hear us on Brougham, you will hear us on Occidental, you will hear us on King. Our yachts are all around you, there is no escape.
A tech like this is being added for training
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart | Follow Dave on Twitter @bedirthan
This is great
I like pointing out the problem with using mean on a player like Zakuani, and I like pointing out that the first “4” in 4-4-2 or whatever, doesn’t really mean anything. It’s ALWAYS “4”, but it’s also almost always 2- and something in real life, as the two central defenders hang back and the corner backs move up, sometimes almost into a wing forward position (unless they go to 3 in the back). Your first example is 2-3-3-2, your second is 2-4-3-1, but no one will ever call it that. I also think that a lot of teams play in such a way that the “lines” are diagonal, not straight across. Interesting stuff.
Not always
I’ve seen teams go to a 3 man back line when pushing late – a 3-4-3, for instance. Certainly not a starting formation, though.
by foolsgambit on Feb 9, 2012 6:45 PM PST via Android app up reply actions
I mean, it could be a starting position, but it’s rare enough that I can’t think of a major team that currently uses it. I think Barca used it in the Cruyff days.
by foolsgambit on Feb 9, 2012 6:51 PM PST via Android app up reply actions
That's what I meant
When I said “unless they go to a 3”, which was poorly worded. What I meant was “rarely teams will play three in the back, but it’s almost always four, but almost always four really means two”. What I was getting at is that everyone calls it four, even when it very often isn’t.
I love this
Also the graphics reminds me of mid 90s DOS strategy games, which are also awesome.
by Tohoya on Feb 10, 2012 12:03 AM PST reply actions 1 recs
We have no left?
What I find most interesting about these mean touch maps, if you will, is the fact that our entire formation leans right.
Granted, Rosales is great to get on the ball – but I wonder if we can’t utilize some more build up on the left this year.
Eternal Blue, Forever Green. Sounders 'Til I Die.
by Michael Wiegand on Feb 10, 2012 1:07 AM PST reply actions
More speed on the left
With players like Zak and Neagle relying more on their speed to “blow by” defenders than their right-sided counterparts, who more often progress the ball up the field with quick, short passes, less events are generated on that side of the field. That will tend to overstate how unbalanced our buildup is.
Just a thought.
"But who would listen to Little Old Me anyway?"
-by -Dave Clark
and -thehemogoblin
by Little old me on Feb 10, 2012 9:32 AM PST up reply actions
A dribble is a tracked action
also, while the mean shows Zakuani more to the left, again look at the Zakuani only image and his left side actions are quite wide.
I am not a Supporter | I am not a Fan | I am a Sounder
Sounder At Heart | Follow Dave on Twitter @bedirthan
I think that's natural
so to speak. If 7-10% of the population is naturally right handed/footed then it’s not surprising to see teams stronger on the right side where the body can shield defenders from the ball while on their stronger foot. Also, this means advancing versus the opposition’s left, or weaker, side (you often hear of England’s “problematic left back position” but rarely does a team lack for adequate right-sided defenders). A visual display like this is probably most useful if you factor in that the right side will most likely be the stronger, more frequented side.
That said, as you point out one take-away is probably how valuable an increased left-sided attack may be (think choosing a left-handed pitcher in baseball).

by 




















