huggingface_model_deployment

TODO:

Upload model to HuggingFace modelhub
Productionize and modularize code
Makefile
Dockerize
- Integrate poetry
- Separate the fastAPI backend and gradio frontend services in different containers
- Handle them with Docker compose
[IN PROGRESS] CI/CD
- Use a self-hosted CI/CD runner
- Use a self-hosted docker image repository
- Self-host the deployment environment
- Properly tag docker images
- Create a release on new changes to the main branch
- Clean-up old images/containers on the prod server
Reduce image size by using a lighter version of PyTorch
Pytest / doctest - unit/system tests
Nox/Tox
Mypy - typechecking
Linting
Logging
Prometheus, Grafana, Loki
Training/Validation/Testing scripts and modularity
Documentation (MkDocs / Sphinx) + Docstrings
Kubernetes/Kserve/Helmchart + GPU
Model tracking, drift, automated retraining
Airflow/Dagster/Argo
Create an diagram for the entire architecture (CI/CD, model retraining, etc.)
OPTIONAL: Deploy in AWS - EC2 or ECS
OPTIONAL: Terraform infrastructure

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
docker		docker
notebooks		notebooks
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml