LLama 3.1 Instruct Chat Sample

This repository contains a simple demo to build a streaming chat interface for the LLama 3.1 8B instruct model from Meta. Please note that this demo is pretty rough. We use this to quickly check out new TPU-enabled machines.

System requirements

Getting started

Run the following commands to sync and activate the environment:

rye sync
source .venv/bin/activate

Next, login in with the hugging face CLI using the following command:

huggingface-cli login

After that, run the following command to start the app:

chainlit run src/main.py

How does the sample work?

This sample uses the Llama 3.1 8B Instruct model with a basic system prompt to implement a chat scenario. It doesn't have a conversation history other than the messages in the current chat session. I've left building history functionality up to you as an exercise.

This sample supports streaming the response back to the user using the appropriate tools from the transformers library.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.chainlit		.chainlit
src		src
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
chainlit.md		chainlit.md
pyproject.toml		pyproject.toml
requirements-dev.lock		requirements-dev.lock
requirements.lock		requirements.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLama 3.1 Instruct Chat Sample

System requirements

Getting started

How does the sample work?

About

Releases

Packages

Languages

wmeints/llama-chat

Folders and files

Latest commit

History

Repository files navigation

LLama 3.1 Instruct Chat Sample

System requirements

Getting started

How does the sample work?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages