Ray Serve Iris Demo

This project demonstrates a containerized machine learning deployment using Ray Serve, featuring a Scikit-learn Iris classifier, custom Prometheus metrics, and a pre-configured Grafana dashboard.

Project Structure

models/: Serialized model artifacts (.pkl).
data/: Model metadata and labels (.json).
monitoring/: Prometheus and Grafana configuration.
serve_model.py: Core deployment logic with custom metrics and health checks.
serve_config.yaml: Ray Serve deployment configuration.
locustfile.py: Load testing script for simulating traffic.
docker-compose.yml: Full-stack orchestration (Ray, Prometheus, Grafana, Locust).

Prerequisites

Docker and Docker Compose.
uv (for running the local test client).

Getting Started

1. Launch the Stack

Start the entire environment (Ray Serve, Prometheus, and Grafana) in detached mode:

docker compose up --build -d

Wait about 20-30 seconds for the Ray cluster and Serve application to fully initialize.

2. Query the Model

Use the provided test client to send a prediction request:

uv run python query_model.py

3. Monitor the Application

Ray Dashboard: http://localhost:8265 — Monitor cluster status, logs, and integrated Grafana metrics (under the "Metrics" tab).
Grafana: http://localhost:3000 — Dedicated visualization platform.
- Default login: admin / admin (Anonymous viewing enabled).
- Dashboard: Navigate to Dashboards -> Ray -> Default Dashboard.
Prometheus: http://localhost:9090 — Query raw metrics.

Custom Metrics

The application exports the following metrics:

ray_iris_predictions_total: Counter of predictions labeled by Iris species.
ray_iris_prediction_latency_ms: Histogram of prediction processing time.

4. Load Testing with Locust

To simulate traffic and see the metrics in the Ray Dashboard and Grafana:

Open Locust: http://localhost:8089
Number of users: 10
Spawn rate: 2
Host: http://ray-head:8000
Start swarming.

5. Cleanup

To stop all services and remove containers:

docker compose down -v

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
models		models
monitoring		monitoring
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
locustfile.py		locustfile.py
pyproject.toml		pyproject.toml
query_model.py		query_model.py
serve_config.yaml		serve_config.yaml
serve_model.py		serve_model.py
start_node.sh		start_node.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ray Serve Iris Demo

Project Structure

Prerequisites

Getting Started

1. Launch the Stack

2. Query the Model

3. Monitor the Application

Custom Metrics

4. Load Testing with Locust

5. Cleanup

About

Uh oh!

Releases

Packages

Languages

inovex/rayserve-demo

Folders and files

Latest commit

History

Repository files navigation

Ray Serve Iris Demo

Project Structure

Prerequisites

Getting Started

1. Launch the Stack

2. Query the Model

3. Monitor the Application

Custom Metrics

4. Load Testing with Locust

5. Cleanup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages