GitHub - genji970/llm_api_service_deploying_in_AWS: collecting pdf data by using ray. fine tuning pretrained model gpt2. building rest api, using fine tuned model. deploying in aws. aws 배포용 llm 서비스 파이프라인 구축

how to run

git clone llm_project_train folder and dockerfile
pip install -r llm_project_train/master/requirements.txt
python -m llm_project_train.master.main True/False
git clone service folder
move saved model from llm_project_train to service folder
pip install -r api_for_service/requirements.txt
python -m main
api run
'ctrl + c' url and add docs. Then, you can test chat system. http://127.0.0.1:8000/docs

or just simply

docker pull ghcr.io/genji970/api:latest
docker run -d -p 8000:8000 --name api_container ghcr.io/genji970/api_image:latest
http://<EC2_PUBLIC_IP>:8000

Detail

This repo consist of two parts. llm_project_train folder + Dockerfile. api_for_service folder.(I merged two different project into one.)

Data_generating -> model_build -> master

In Data_generating folder, train_dataset will be made and saved in the format of csv.

In model_build folder, gpt2 will be loaded from huggingface, gpt2 will be fine tuned.

After fine tuned, weight and whole model structure will be saved as saved model. You have to move this saved model folder into service project consist of service folder.

if you run service project, rest api will run.

used

python==3.10.12 , torch , ray , huggingface , langchain(not yet) , docker , csv , fast api, aws ec2, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
api_for_service		api_for_service
llm_project_train		llm_project_train
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

how to run

Detail

used

About

Releases

Packages

Languages

License

genji970/llm_api_service_deploying_in_AWS

Folders and files

Latest commit

History

Repository files navigation

how to run

Detail

used

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages