ENG Historical Football Dataset – 2020-2021



Exportable to Excel, English Premier League team dataset includes all match-by-match statistics and betting odds for each football match played. Columns have offensive and defensive stats; goals scored, total number of shots, expected goals (xG), tackles, interceptions, cards and many more. Betting columns have moneyline odds, over/unders, and asian handicap, alongside Elo ratings for each team. Match-specific lineup information such as team 11-men lineups, referee and venue.


1. Which leagues are covered in the football/soccer datasets at BigDataBall?
Our dataset currently focuses on the “Big 5” European leagues: the English Premier League, German Bundesliga, Italian Serie A, French Ligue 1, and Spanish La Liga. It includes detailed team-level match statistics exclusively for league matches in these top-tier leagues.

2. What type of data is included in the dataset?
Our datasets include all main statistics for each football match in the covered leagues, starting from the 2019-2020 season. This encompasses offense and defense stats; goals scored, total number of shots, expected goals (xG), tackles, interceptions, cards and many more. Additionally, it features betting-related information such as moneyline odds, over/unders, and Asian handicap, alongside Elo ratings for each team. Finally, it has match-specific information such as team lineups, referee and venue.

3. How are the betting odds in the dataset calculated?
The odds indicated in our dataset represent the closing odds and are calculated as the average of all odds from top bookmakers.

4. What is the Elo rating mentioned in the dataset?
The Elo rating in our dataset represents the team’s Elo rating at the start of each match. It’s a widely recognized metric used to measure the strength of a team based on their past game results, valuable for understanding team performance trends and extensively used in sports analytics and betting.

5. What do “Expected Goals (xG)” and “Expected Assists (x) mean in the dataset?
Expected Goals (xG) is a statistical measure used to assess the quality of scoring opportunities. It assigns a probability to each goal-scoring chance, indicating how likely it is that the chance would be scored. A high xG value suggests a high likelihood of scoring. This metric helps understand how many goals a team or player should have scored on average, given the quality and quantity of the shots taken.
Expected Assists (xA) measures the likelihood that a given pass will become an assist. It considers factors such as the type of pass, the location from where it was made, and the subsequent actions of the receiver. xA provides insight into a player’s playmaking abilities, indicating their effectiveness in creating goal-scoring opportunities for teammates.