Fantasy Premier League Machine Learning

Repository for preparation, analysis, model fitting and prediction for fantasy premier league data.

Credit to https://github.com/vaastav/Fantasy-Premier-League for building a fantastic data resource.

Project aims

First stage of the project is to train models and perform model selection, predicting player scores for each game week of a premier league season. We are only using features that are available on https://fantasy.premierleague.com. We take a model agnostic approach to the models we use, this repo will support scikit-learn and pytorch models.

Second stage of the project is to implement heuristic algorithms that build teams within the game restrictions to maximise the expeceted points of a team for either short-term or long-term gains.

We aim to provide a platform for fast experimental iteration and model comparison using a combination of pytorch-lightning for defining standardised datamodules and dataloaders, hydra-zen for config management and boiler-plate free hyperparameter optimization with Optuna or Hyperopt. All experiments are logged in a standardised fashion using MLflow

Quick start

Clone repository https://github.com/vaastav/Fantasy-Premier-League which contains the raw data
Edit fpl_ml/user_config.py setting the directories for the repo set in (1.). Output directory where we will write processed data, and the mlruns directory where experiments will be logged to.
Run python run_prepare_data.py, this takes the raw data from vaastav/Fantasy-Premier-League and processes it into tabular data. In the prepared data, columns of features prefixed with X_ and the target variable is total_points
Run python run_train.py. Currently runs a hyperparameter grid search for RandomForestRegressor and GradientBoostedRegressor.
Above your mlruns directory run mlflow ui. This will launch a local server for visualising the experiments that have been run.

Example output

Example below shows performance on validation set for predicting player scores for the 2019-2020 season. Each datapoint is the number of points a player got in a given game week. The purple line is the identity (x == y). Datapoints can come from any season. Future efforts will hold out a season for the validation set and a season for the test set.

Current features

Data preparation
Train sklearn models
Logging with mlflow
Hydra-zen configs

TODO:

Unittesting
Pytorch support
Heueristic simulations for building teams
Containerisation

Name	Name	Last commit message	Last commit date
Latest commit behzadk Linting (#32 ) Feb 4, 2024 1ed5ee2 · Feb 4, 2024 History 58 Commits
configs	configs	Config update (#29 )	Feb 4, 2024
fpl_ml_projects	fpl_ml_projects	Linting (#32 )	Feb 4, 2024
ml_core	ml_core	Linting (#32 )	Feb 4, 2024
.gitignore	.gitignore	update gitignore (#21 )	Nov 11, 2023
LICENSE	LICENSE	Initial commit	Oct 28, 2023
README.md	README.md	Update README.md	Oct 29, 2023
environment.yml	environment.yml	overhaul: entry to train, entry configs, stores (#27 )	Dec 17, 2023
run_evaluation.py	run_evaluation.py	Evaluation and simulation entry points (#31 )	Feb 4, 2024
run_prepare_data.py	run_prepare_data.py	Linting (#32 )	Feb 4, 2024
run_prepare_demo_data.py	run_prepare_demo_data.py	overhaul: entry to train, entry configs, stores (#27 )	Dec 17, 2023
run_simulation.py	run_simulation.py	Linting (#32 )	Feb 4, 2024
run_train.py	run_train.py	overhaul: entry to train, entry configs, stores (#27 )	Dec 17, 2023
user_config.py	user_config.py	Refactor data preparation (#19 )	Oct 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fantasy Premier League Machine Learning

Project aims

Quick start

Example output

Current features

TODO:

About

Releases

Packages

Languages

License

behzadk/fpl_ml

Folders and files

Latest commit

History

Repository files navigation

Fantasy Premier League Machine Learning

Project aims

Quick start

Example output

Current features

TODO:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages