Skip to content

asnaylor/nersc_llm_prototype

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NERSC Perlmutter Triton + TensorRT-LLM Demo

This demo uses content from this repo https://github.com/NVIDIA/GenerativeAIExamples/tree/main

Setup

To copy across the model files and download the container image on Perlmutter run on a login node:

./deploy.sh setup

Deploy

Once the setup is complete, start up an interactive slurm job (replacing your account):

salloc -N 1 -C gpu -G 4 --gpu-bind=closest -t 01:00:00 -q interactive -A <account>

Inside the slurm job run:

./deploy.sh run

Connect & Test

Open up notebook.ipynb to connect to the LLM container and test.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published