WNBA play-by-play stats are offered in CSV format that has all events occur in a game. Additionally, it includes active lineups, shot distances and shot locations in X, Y coordinates. You get individual game logs and a combined file for the season which includes all games from regular season and playoffs.
Each season dataset comes up with 2 types of files: 1) Individual CSV files for all games played in the regular season and the playoffs. 2) A season-to-date CSV file where all CSV game files are combined. Having this file, you can analyze the whole season stats in one sheet. In brief, our database-friendly (each play presented in a row) log includes every in-game movement such as: “Active players on the court”, “event time (remaining/elapsing)”, “play length & id”, “activity type (substitution/shot/free throw/turnover/foul committed & drawn/rebound/assist/jump ball etc.)”, “shot location” and “shot coordinates”.
Open the sample dataset where descriptions for all columns have been already provided. Keep in mind that, those descriptions do not appear on the season game logs, so we recommend you to keep the sample file easily accessible until you get along with our play-by-play logs.
To give you an idea: Inside the combined CSV file of 2018 season, there are 220 individual games and 85,732 rows; which makes ~390 rows per game. The size of 2018 complete dataset is 57.3 MB and becomes 5.3 MB when zipped.
WNBA basketball court is 50 feet wide, 94 feet long.
Please read the column descriptions on the sample file to get a more detailed explanation of coordinates.
Why do the logs have 2 different types of shot coordinates?
We provide shot coordinates in two versions:
1) Original coordinates (belongs to a specific x-axis, y-axis system) that are also being used by the WNBA official source.
2) Converted coordinates that reflect real basketball court dimensions.
We have developed a proprietary algorithm in which, substitutions and the relevant game events (points, fouls, getting fouled, assist, steals and etc.) that are assigned to players are taken into account. Note that; despite being a very rare situation, if the player does not record anything or did not take part in any game event while he’s on the court, the chances are our algorithm might not be 100% accurate.