NYP FYP CNC Chatbot

A chatbot to help staff identify and use correct sensitivity labels in communications. Built with Python, Gradio, Pandoc, Tesseract OCR, and OpenAI.

The software is in a beta state, expect bugs quirks and potentially broken code.

🚀 Quick Start

Recommended: Use Docker and Docker Compose for setup and running.

RTFM at https://www.docker.com/ if not sure

Prerequisites

Docker and Docker Compose (v2+)
OpenAI API key (add to .env)

See https://platform.openai.com/api-keys

(For local dev: Python 3.11+ and Git)
Setup & Run (Docker, Docker Compose)

git clone https://github.com/chweekueh1/nyp-fyp-project
cd nyp-fyp-project
cp .env.dev .env   # Add your OpenAI API key to .env
python setup.py --docker-build
python setup.py --docker-run

setup.py is just a wrapper over Docker commands, so run them directly if you are unable to run the setup script on Windows.

Note that certain paths in the source code are hard coded.

🐳 Docker & Multi-Container

Uses separate containers for dev, test, prod, and docs. Requires Docker Compose for multi-container workflows and benchmarks. See Docker Compose install.

Common commands:

python setup.py --docker-build         # Build dev container
python setup.py --docker-run           # Run app
python setup.py --docker-test          # Run tests
python setup.py --docs                 # Build & serve docs (http://localhost:8080)

Note that the sites are currently exposed by nginx reverse proxy (generated by Gradio), which is exposed on http://0.0.0.0:7680 -> site_url. Documentation and other Docker containers may use other ports.

🧪 Testing

To be implemented

📁 Data Storage

User data is stored in ~/.nypai-chatbot/ (local) or /home/appuser/.nypai-chatbot/ (Docker).

You would need to create the following under the project root since we are currently using a volume mount:

|-- data
|---- cache
|---- memory_persistence
|---- reports
|---- vector_store

📚 Documentation

Build and serve docs:

python setup.py --docs

Docs available at http://127.0.0.1:8080

Technical detail: this just grabs docstrings and renders it in Sphinx.

⏳️ Benchmarking

Benchmarks for various function and API calls in the codebase can be triggered via:

python setup.py --run-benchmarks

It will output to the <project root>/data directory as benchmark.md once complete. This directory also has a JSON and SQLITE file recording Docker build details.

🔧 Code Quality

Pre-commit hooks with ruff for linting and formatting:

Note: The pre-commit flag in the setup script might not work depending on the directory you are in when you invoke the script.

If this is the case, make use of these steps instead

Activate and create Python virtual environment named .venv and activate it. See Python Docs for more on virtual environments.

pip install -r requirements/requirements-precommit.txt

Then you can run git commit and git push within the context of the virtual environment and it will automatically run the configure pre-commit hooks.

You can also manually run the pre-commit hooks at any time. See here for details.

🐛 Troubleshooting

API Key Issues: Check .env and your OpenAI API key. Port Conflicts: Default is 7860; Gradio will use the next available port. Dependencies: Pandoc, ffmpeg, hyperfine (handled by Docker).

📝 License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
docker		docker
requirements		requirements
src		src
styles		styles
.dockerignore		.dockerignore
.env.dev		.env.dev
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.markdownlint.yaml		.markdownlint.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
ruff.toml		ruff.toml
sanity_check.py		sanity_check.py
setup_sanity.py		setup_sanity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NYP FYP CNC Chatbot

The software is in a beta state, expect bugs quirks and potentially broken code.

🚀 Quick Start

Prerequisites

🐳 Docker & Multi-Container

🧪 Testing

📁 Data Storage

📚 Documentation

⏳️ Benchmarking

🔧 Code Quality

🐛 Troubleshooting

📝 License

About

Uh oh!

Releases

Packages

Languages

License

chweekueh1/nyp-fyp-project

Folders and files

Latest commit

History

Repository files navigation

NYP FYP CNC Chatbot

The software is in a beta state, expect bugs quirks and potentially broken code.

🚀 Quick Start

Prerequisites

🐳 Docker & Multi-Container

🧪 Testing

📁 Data Storage

📚 Documentation

⏳️ Benchmarking

🔧 Code Quality

🐛 Troubleshooting

📝 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages