One of the bigger pieces of NBA analytics industry news over the offseason was Brian Kopp leaving STATS, Inc. to head up Catapult Sportās North American operations. Kopp had spearheaded the efforts to first get the SportVU system into all 29 NBA arenas. More importantly, from a public perspective, was his role in ensuring that access to a subset of SportVU-derived data was available on NBA.com.
Entering the second season of the SportVU era, I spoke with Ryan Warkins, Assistant Vice President of STATS and Koppās primary replacement, about the transition, changes and improvements in the underlying technology, adoption within the league and a hint of whatās going to be new for this year.
Q: Ā Are you the new Mr. SportVU?
Ryan Warkins: I prefer not to go by that term! Itās been a transition from me being the day-to-day support for [NBA] teams to me being out there to grow the business by both by supporting the teams and putting together our own [internal] team that can fulfill that support. Weāve added a new Basketball Manager. He [Charlie Rohlf] was actually a four-year manager at Duke under Coach K. Heās a world-class programmer as well. So his ability to know the game and then decipher and make sense of the data has been awesome in terms of our growth. Adding him to the team has definitely made my job and the transition [after Kopp left for Catapult] easier.
Ā
Having someone who āspeaks basketballā to interface with teams?
The things he can do to create new data points, instead of me trying to explain how something should look, he can just go ahead an do it.
Can you tell me more about the your current role as well as what is entailed by the ābasketball managerā role? How much of the league is actively taking advantage of that support?
My day-to-day role is overseeing the application development. Our web platform that people [inside the league] are using on a day-to-day basis to gain access to the data.
And then on the SportVU side, the underlying technology to push the product forward. How do we make this better?Ā How do we push the product forward in terms of the live environment and get access to the data sooner? I oversee that andĀ a team of three data analysts and Charlie.
Charlieās role is then to work with the developer on both [internal] teams, application and SportVU development to make sure heās bridging the gap between client [team/franchise] need and whatās actually possible from a development standpoint. So with Charlie and the analysts they support the daily calls from the teams. So if a team calls in and says āhey we want to run this kind of queryā Charlie or the analysts will field that call, will understand what they [the NBA team] is looking for and will work to get that data into the teamās hands.
In terms of use, Iād say about tw0-thirds of the league is actively using our web platform. And there is also a small subset of teams that are taking in our raw [XML] files and are doing the coding and analysis in-house. We donāt have a lot of daily interaction with them, we donāt really know what they are doing with the data. Maybe they are doing a lot, maybe not so much and they just donāt have questions for us but itās hard for us to pinpoint.
Ā
What are some of the improvements that have been made in the system itself in the last few months?
Weāve looked hard at our algorithms and how we can use more physics within the data to read more ācharacter flawsā looking at the jerseys.[1. SportVU identifies players on the floor by reading jersey numbers]Ā If the system reads a ā6ā of a jersey that looks a lot like an ā8ā crumpled up. So figuring out how that happens and patterns of movement we can go ahead and fix some of those in terms of the technologies a rely less on the operators.[2. Two individuals present at each game which assist the system by responding to prompts such as when the system is unable to identify players on the floor] Weāve made improvements to understanding what happens when players run by each other or bump into each other on the floor just making sure to understand that when a player is running full speed heās not going to [suddenly]Ā change direction. So understanding that when players cross paths keeping those paths consistent, things like that.
Weāve also been working with the league on workflow. Weāre hoping by January 1st to be able to produce everything by the end of aĀ game. Right now weāre delivering stuff the next morning to teams in terms of some of our more intensive algorithms. Weāre looking to move that forward to allow for teams to have access to our whole suite of data at the end of games. A lot of that will move downstream and will allow our live data to be that much more powerful.
Does this mean better real-time data to the fan or are you focusing primarily on the data to the teams?
Both. We have daily interactions with the team at NBA.com, working with their group to decide which data to expose to the public. Some of it is sensitive information. Certain [NBA] teams donāt want certain data points out there, so thatās always a balance. Thereās also the danger of too much information for groups of fans. There are the die-hard group that really wants to dive into this data, but how do we make this data interesting to someone like my father who might not even be a box-score guy? How do you make that jump and make it applicable? We could be watching a game and I could say āthe Bulls ran that ICE to perfection, really downed that pick-and-roll effectively, Hinrich did a great job fighting over the screenā thatās already too much information for my dad who wants to complain about people traveling. So how do you bridge that gap?
Ā
How much of a priority isĀ integrationĀ with the media, having SportVU-based metrics become part of the game presentation?
Thatās part of the overall goal. Again, weāre walking that tightrope both of whatās sensitive to the teams and what isnāt and also whatās applicable to the broadcaster. Each broadcaster is different.Ā Certain guys youāll get them the data and theyāll want to use it in their presentation. Some just want to talk about the game. Both can be interesting and compelling to the fan, itās just different styles.
What can we bring now that we have a full year of data? Itās all about context because some of these stats are so new. What if I told you this player had 54 touches in a game, what does that mean? Well now we have a full season of tracked data and have a better understanding of whatās good and whatās not. We use this as a baseline. Using team ranks or player rankings makes it much more digestible. Thatās how the media is going to use it and itās just a matter of finding out the proper workflow. I would say at the start of the season, broadcasters will probably be not as engaged, but itĀ will grow more and more as the season goes along.
Ā
Certainly some broadcast teams are much more open to dropping nuggets of quote-unquote āanalyticsā into their commentary than others?
When Iām talking to talent and coaches, I try not to use the term āanalytics.ā Itās data reporting. Itās stuff theyāve been manually tracking for years. Itās a simple as something like āhow many ball-screens did a player use in the course of a game?ā You start using a term like āanalyticsā and coaches think āyouāre trying to push an agenda on me, youāre trying to make a point using numbers and numbers only.ā
What I like to communicate is āweāve had this reporting structure for a long time. Let us help you evaluate what youāre already doing. If you then want to throw it in a regression on whatās important and how it impacts the game, we can do that. But you tell me the information you want, and weāll generate a report for that.ā Thatās how I try to communicate with coaches and broadcasters.
When it comes to communicating with front offices, itāsĀ a more strictly analytical play. Itās trying to communicate it in a way so that no one gets scared or threatened.
The A-word, ābasketball peopleā sometimes recoil if they think youāre trying to convince them of something with a spreadsheet rather than a tape cut-down, while what youāre offering is the best of both?
We do some research on the stuff to make sure itās statistically valid and weāre not just throwing statistics out there, but you have to know your audience. You mentioned people getting scared, but Zach Lowe says this all the time, when people rag on these āanalytic guysā, these guys all watch a ton of basketball. Theyāre watching, theyāre learning, theyāre listening to coaches and they are committed to learning the game the right way and then supplying an additional tool to that toolbox. All they [analysts] are trying to do is present more information.
Iām never going to go sit down next to Tom ThibodeauĀ and tell him how to run his offense. I want to learn what heās doing, why heās doing it and how I can help evaluate how effective itās been. Thatās the best way for it to work and thatās where I see us beginning to make ground and not be seen as āa threatā but rather be seen as someone who can help them make better decisions.
Ā
So in the way Zach likes to put it, how do the āgeeksā talk to the ājocks?ā Well the āgeeksā have to speak ābasketballā not the other way around is the only way it will work?
Thatās definitely the case. Youāve got changes in front offices, Ben Falk going from Portland to Philadelphia, so obviously they [analysts] are becoming more influential.
I wouldnāt want a team of all data analysts who never played the game. Thereās great value in that āold schoolā mentality. But maybe thereās things you miss by just watching tape, and maybe we can help with analytics and data to help you watch tape differently, make it more efficient. Maybe you only need to watch a 6 or 7 minute clip of what Carmelo is doing on the floor. If youāre trying to key on his tendencies maybe we can point you towards those tendencies and make your workflow much better.
Ā
Having been knee deep in this data for several years now, what are some things that most surprised you and most changed the way you thought about things?
Thatās a good question. let me think about it some and weāll come back to it.
Ā
In terms of teams āpushing backā on the public data, what goes into that balancing act? Is it teams being very circumspect or teams being protective of the questions they are asking so as to not give their hands away?
We are very protective of teamsā data requests. If for example [two teams] came to us with the same request, weād never say āwell we have this query we just ran for Team X available.ā Weāll wait for them to ask for it specifically.
Now there is a laundry list of things that general basketball minds would agree are some of the next steps theyād like to see, and those weāll just go ahead and develop. If seven different teams have asked us about a particular type of data, weāre going to develop that and make it available for everybody. We walk that tightrope all the time.
From day one, the team business has driven all the data creation. So weāve done it from a team standpoint and then we look at it and see what can we adjust and what can we water down from the fan perspective. You might want to see everything the same way a team sees it, but not everybody does. How do we make it digestible for the casual fan?
If you give a broadcaster, now that we know about various pick & roll coverageĀ āĀ hard hedge, soft hedge, blitz, ādownā under, overĀ -now that we know that, thatās awesome information. But thatās a three minute segment to explain all of itĀ just to integrate it into a broadcast. So how do we break that down into maybe a 15-second bit that might last a whole possession? If you canāt get it in in 10-15 seconds and the casual fan doesnāt already know it, they probably arenāt going to use it. So itās a matter of building up that portfolio of knowledge for things that can be used on a daily basis.
If I can go back to the question you asked before
About things that have surprised you?
Yes.
One of the things we looked a lot last year was pick-and-roll coverages. At that point, we were only able to identify the four players involved: the ball-handler, the screener, the on-ball defender and the screen defender. What was really interesting is that we were looking at Noah and Hinrich [compared to] Hinrich and Boozer, Hinrich and Boozer were a ābetterā combination in terms of defending the pick-and-roll on a points per possession basis, We thought āhow can this be?ā Joakim Noah is a much better defender, you have to watch 10 seconds of basketball to know that.
So when we looked at it, who is the other big on the floor? When it was Boozer [defending the screen] it was Joakim Noah [as the other big on the floor.] Noah was more impactful when heās loading up the strong side on āICEā coverage than when heās put in a ball-screen. [In ICE} he can defend those next two passes, so heās more important than when Boozer is just trying to force [the ball-handler] to the sideline and keep him out of the middle.
So it was really interesting to think about, how does that work for other teams? Do you want to put the best [defensive] big man in the ball-screen so you weaken the help-side defense? How do you do that strategically? And we went from thinking about just two players to thinking about how the whole team worked together defensively.
The other cool thing is to come up with these algorithms we create, we watch tons and tons of tape [to check accuracy]. So weāre asking do coaches play that big a role? So when youāre grading algorithms[3. A process where the team manually charts a number of games to test the accuracy of a given query to identify if it is capturing all the desired information while not picking up extraneous āfalse positiveā data], you watch how the Bulls play pick-and-roll, they play ādown.ā They go over the screen. So itās a lot of āICEā combinations. We were watching a Bulls-Wizards games and the Wizards were doing the same thing in pick-and-roll coverage, it was always ādownā, ādownā, ādownā pushing [the ball-handler]Ā sideline.
Middle of the third quarter, I noticed something weird. [Washington]Ā was switching all the time now. There was an offensive adjustment, because [the Bulls]Ā werenāt getting a good relationship between the two players in a traditional little-big pick-and-roll, they started going wing-wing pick-and-roll. The interesting thing was, when those combinations changed, they [the Wizards] changed how they were defending pick-and-roll. They went from down and ICE everything to switching everything just in the course of the game. And you see that when youāre grading it and watching it at that level of detail, you start to see the impact that coaches can have on the game, the nuance.
Ā
There has been a fair amount of academic work using SportVU data, is that something STATS is still involved with?
There are still some groups out there using the data. Weāre working more closely with them this year to know how they are using the data. Itās still a big part of the community and itās great exposure for the data and the system to be used in that way. To give people a chance to play with the data and advance the game of basketball.
But weāre more focused on what is applicable. A lot of that stuff [academic research] is really great, but we have to make sure itās actionable intelligence in a decision-making process. How can people use it and actually make the game better rather than just being a theory?
Charlie is a lot more dialed into that community, he wants to get his PhD, he has that academic background. Personally, thatās less interesting to me, I get more excited about pushing the team business forward and integrating with the media, whereas Charlie is really interested in the pure research side of it, using machine learning techniques and data mining technologies.
Ā
So youāre less interested in āsolvingā basketball than in more āad hoc usefulā analysis?
More or less.
Ā
What can you say about the research going on out there?
I wouldnāt want to steal their thunder. But think about defense and the stuff Kirk [Goldsberry] is working on, those are the big ones.
Ā
Is the defense research along the lines of the āghost playerā stuff Zach wrote about in Toronto a few years ago?
Probably not to that level. What you have to remember was that wasnāt done as pure research. If you were doing it as research, you wouldnāt have access to the coaches. Thatās what gets a little lost even to this day. People think it was just computer programmers that went and did that, but whatās actually the case is those were really smart guys fortunate enough to have a front office that was really into data and analytics so they were able to get access to coaches to program that around their schemes.
For us [STATS] to do that, that gets back our earlier conversation about me telling you how to play defense instead of letting me help evaluate how well youāre playing defense.
Ā
Is the SportVU system progressing into the college game much this season?
Weāre trying to approach the problem from the conference level, so you get both teams involved in every game. The value becomes exponential when you add larger sample of data and I think thatās what we saw with the NBA this past year. So the question is how do you get the entire ACC or SEC to jump on board so we get more data and it applies more places than just Dukeās 20 home games.
Ā
Are there conferences coming online this year, or is that an in-progress initiative?
In-progress.
Ā
I would imagine that would have implications for predraft rankings and scouting.
Yeah, thatās when you know people are using the data is when they start asking āhey what about college dataā then you know they are measuring player performance based on that data and they think theyāve found something so they want to measure incoming players on those metrics.
Ā
How frequently do you get those types of questions?
Quite a bit.
Ā
Is that the next thing people say āthis is cool. When do I see it for college players.ā
Yup. And then you ask ābut are you willing to pay for it?ā and the answer is usually āof course not, but I really want itā
Ā
So youād say two-thirds of the teams are fairly actively using the system?
That other third breaks down about half of that third is doing everything in-house and then the other half, everyone has access to the web plaftorm and a lot of teams are heavy users, but some teams just want custom excel-based reports that we send to them after every game, and we work with them that way. They donāt want to log in to look at something, but they are happy to look at in a Word or PDF format.
Ā
Brian and I talked about the partnership with Catapult last spring and now that heās working there is that progressing?
Obviously, I have a good relationship with Brian and we continue to work together on a daily basis. Itās really merging best-of-breeds in terms of practice tracking and in-game performance metrics. Itās measuring athlete load and athlete performance, how do we manage that over the court of the season. LeBron going to Cleveland, how do they manage his minute. More importantly how do they manage his athletic load? If you compare a guy like LeBron with Ā Luol Deng, they could do the exact things on the floor, thatās still going to be harder on LeBron because of his sheer size.
Unless youāre Gregg Popovich and you just do whatever you want, youāre probably not going to be able to influence LeBronās playing time ā coaches are putting their lineups together to win every game. So how trainers and strength and conditioning coaches use practice to learn about how to we best prepare players for the next game? And now with Catapult and SportVU youāre able to manage that across the full infrastructure.
So what can you tell me aboutĀ stuff thatās new this year?Ā
I canāt talk too much about it because we havenāt finalized our plans for the new things coming on NBA.com this season. Hopefully it weāll be doing more and more over the course of the season and not just releasing it all at the start of the season and having it be static over the course of the season.
Ā
Can I make a date to talk again closer to the season to talk again once that becomes finalized? From a fan perspective, a walk through the new goodies is something people would be interested!
We could do something closer to the season. I can tell you the biggest differences are that itās going to be more visual and more interactive. Quick snippets that people can share easily. Making it much easier to go to a player page and look at player tendencies and play types. Making it easier for player comparisons and stuff like that.