MLOps Pipeline

This repository is my official beginning of MLOps journey. Instead of focusing on piece of production-grade machine learning, we will be building full end-to-end pipeline.

We will be training simple regression models on NYC taxi ride dataset and build MLOps pipeline including model training, hyperparameter optimization, experiment tracking, orchestrating, deployment, monitoring, etc. This repository is inspired by the mlops-zoomcamp course by DataTalks.Club.

Since the MLOps tool landscape is very wide, There will be more follow up work on this with various tech stacks.

Tech Stack

Notes

Setting up a VM on GCP
Dataset
MLFlow Experiment Tracking
MLFlow Experiment Tracking on GCP
Workflow Orchestration with Prefect
Model Deployment as a web-service with Docker, Kubernetes, and GKS.
Model Deployment with model from model registry
Streaming Model Deployment (Online)
Batch Model Deployment (Offline)
Scheduling batch scoring jobs with Prefect
Monitoring and debugging with Evidently

Setup

Install requirements

conda create -n mlops-orbit python=3.9
conda activate mlops-orbit

pip install -r requirements.txt

For remote VM

Forward MLflow port which is 0.0.0.0:5000.

Forward the port for jupyter if you are using it (127.0.0.1:8888).

Forward port for Prefect server (127.0.0.1:4200).

You can also do it in ~/.ssh/config.

Host gcp-mlflow-tracking-server
    HostName xx.xx.xx.xxx # VM Public IP
    User pytholic # VM user
    IdentityFile ~/.ssh/mlops-zoomcamp # Private SSH key file
    StrictHostKeyChecking no
    LocalForward 5001 0.0.0.0:5000
    LocalForward 4200 127.0.0.1:4200

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.github/workflows		.github/workflows
01-training		01-training
02-experiment-tracking		02-experiment-tracking
03-workflow-orchestration		03-workflow-orchestration
04-deployment		04-deployment
05-monitoring		05-monitoring
assets		assets
data		data
data_backup		data_backup
extras		extras
models		models
notes		notes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.yamllint		.yamllint
README.md		README.md
prefect.yaml		prefect.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLOps Pipeline

Tech Stack

Notes

Setup

Install requirements

For remote VM

About

Releases

Packages

Languages

pytholic/mlops-orbit

Folders and files

Latest commit

History

Repository files navigation

MLOps Pipeline

Tech Stack

Notes

Setup

Install requirements

For remote VM

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages