LLM Fine-tuning Project

This is an experimental attempt to create a production-ready pipeline for fine-tuning language models using LoRA (Low-Rank Adaptation) with MLflow tracking and Prefect orchestration.

Project Overview

This project implements a complete workflow for fine-tuning the Qwen2.5-1.5B-Instruct model using LoRA, tracking experiments with MLflow, and serving the fine-tuned model via a FastAPI endpoint. The pipeline is orchestrated using Prefect and can be deployed on Kubernetes.

Technologies Used

Python 3.12
Qwen2.5-1.5B-Instruct: Base language model
PEFT/LoRA: Parameter-efficient fine-tuning technique
MLflow: Experiment tracking and model registry
Prefect: Workflow orchestration
FastAPI: Inference API
Docker: Containerization
Kubernetes: Orchestration platform

Project Structure

.
├── data
│   └── dataset.jsonl        # Training dataset with synthetic data
├── src
│   ├── finetuning           # Fine-tuning scripts
│   ├── inference            # Inference API
│   └── workflows            # Prefect workflow definitions
└── models                   # Saved model artifacts

Setup Instructions

Prerequisites

Python 3.12+
Docker
Kubernetes cluster (optional)
MLflow server

1. Start MLflow Server

mlflow server --host 0.0.0.0 --port 5000

2. Install Dependencies

pip install -r requirements.txt

3. Run Fine-tuning with Prefect

# Start Prefect server
prefect server start

# In another terminal, create a work pool
prefect work-pool create process-pool --type process

# Start a worker
prefect worker start -p process-pool

# or
prefect worker start --pool k8s-pool --type kubernetes

# Run the fine-tuning flow
python src/workflows/fine_tune_flow.py

4. Deploy to Kubernetes (Optional)

# Apply MLflow Helm chart
helm install mlflow community-charts/mlflow --namespace llm-finetuning --create-namespace

# Setup Self-hosted Prefect Server
helm install prefect-server prefect/prefect-server --namespace llm-finetuning

# Setup Worker for Self-hosted Prefect Server
helm install prefect-worker prefect/prefect-worker --namespace llm-finetuning -f prefect-worker-values.yaml

# Apply job to deploy workflow
kubectl apply -f prefect-deploy-job.yaml

Common Commands

Prefect Commands

# Apply all deployments from prefect.yaml
prefect deploy -n complete-pipeline-deployment

# Run a specific deployment
prefect deployment run 'LLM Pipeline/complete-pipeline-deployment'

MLflow Commands

# View experiments
mlflow ui

# Set tracking URI
export MLFLOW_TRACKING_URI=http://localhost:5000

Docker Commands

# Build fine-tuning image
docker build -f Dockerfile.finetuning -t llm-finetuning:latest .

# Build inference image
docker build -f Dockerfile.inference -t llm-inference:latest .

### For local registry
docker run -d -p 5000:5000 --restart always --name registry registry:2

# Build and push fine-tuning image
docker build -f Dockerfile.finetuning -t localhost:5000/llm-finetuning:latest .
docker push localhost:5000/llm-finetuning:latest

# Build and push inference image
docker build -f Dockerfile.inference -t localhost:5000/llm-inference:latest .
docker push localhost:5000/llm-inference:latest

Future Improvements

Add support for more base models
Implement distributed training
Add model evaluation metrics
Implement A/B testing for deployed models

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
src		src
.gitignore		.gitignore
Dockerfile.finetuning		Dockerfile.finetuning
Dockerfile.inference		Dockerfile.inference
LICENSE.md		LICENSE.md
README.md		README.md
pod-watcher-role.yaml		pod-watcher-role.yaml
pod-watcher-rolebinding.yaml		pod-watcher-rolebinding.yaml
prefect-deploy-job.yaml		prefect-deploy-job.yaml
prefect-run-job.yaml		prefect-run-job.yaml
prefect-worker-values.yaml		prefect-worker-values.yaml
prefect.yaml		prefect.yaml
pyproject.toml		pyproject.toml
update-pull-steps.yaml		update-pull-steps.yaml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Fine-tuning Project

Project Overview

Technologies Used

Project Structure

Setup Instructions

Prerequisites

1. Start MLflow Server

2. Install Dependencies

3. Run Fine-tuning with Prefect

4. Deploy to Kubernetes (Optional)

Common Commands

Prefect Commands

MLflow Commands

Docker Commands

Future Improvements

License

About

Releases

Packages

Languages

License

spyker77/llm-finetuning

Folders and files

Latest commit

History

Repository files navigation

LLM Fine-tuning Project

Project Overview

Technologies Used

Project Structure

Setup Instructions

Prerequisites

1. Start MLflow Server

2. Install Dependencies

3. Run Fine-tuning with Prefect

4. Deploy to Kubernetes (Optional)

Common Commands

Prefect Commands

MLflow Commands

Docker Commands

Future Improvements

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages