Jump to content
*** Cheltenham Festival Competition: Well done to 1st. Skittle, 2nd. glavintobuy & 3rd. Cragwood ***
** March Poker League Result : 1st muttley, 2nd andybell666, 3rd ian309 **
** Football Tipster Competition Result : 1st MABS, 2nd PercyP, 3rd Kingdom for, 4th Catijohn, 5th runadrum **
** March Naps Competition Result: 1st GARY66, 2nd TRAINMAD091, 3rd ZIDANE123, 4th MICK33. KO Cup Winner SILVER FOX, Most Winners: ALASTAIR **

Predict Premier League outcomes with optimal equations

Recommended Posts

In this article, I' will show how we can predict football outcomes by using optimal equations. We can apply this method to any football division, but we will use the England Premier League as an example.

The method includes three steps:

1.       We use the data from past years to build a new dataset containing variables with proved predicting ability. Then, we cut each variable (for now on, I'll refer to them as "predictors") into small groups and get the percentages for various outcomes within each group for the specific division (e.g., Premier League). By doing this, we can assess the predicting ability of each predictor.

2.       We fit a third-degree polynomial to our data and extract the coefficients, the correlation factor, the standard error, and the R-Squared values. Then we find the optimal equation for each outcome.

3.       Having the best equation for an outcome and the predictors' values for the new games, we can estimate fairly accurately, the probability of this outcome to occur.


I will skip the first step, messy and time-consuming, by providing the tables with the X-value and the Y-value for every predictor. I used data from 2006-2007 to 2019-2020, so our data sample is big enough (4.463 games).

Suppose we define as outcomes the expected goals number for the home and guest team. In that case, we can calculate the emergence probability of any possible combinations of outcomes on a football game. Then we can compare our fair odds with bookmakers' odds and, if we see a positive expected value, place a bet.

You will find the entire article here in Kaggle because it contains some coding in R-Language and it is impossible to run here.

I'll apologise for the inconvenience. I would like very much to have your comments in here.




Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

  • Create New...