slow, and random result size when asking for a list of 50 items #422
dportabella
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am testing the latest localGPT version, with the defaults (original SOURCE_DOCUMENTS, ingest.py and run_localGPT.py). No other documents ingested. The only difference, in run_localGPT.py I changed max_ctx_size and max_length from 2048 to 20480.
I have a machine with proxmox (server virtualization). Inside proxmox, I have a VM with 24 GB mem, 20 processors (1 socket, 20 cores), and NVDIA GeForce RTX 4070 with 16 GB VRAM (directly attached from host to the VM). Is this a good setup for fast replies?
I ask localGPT the following question: "tell me a list of 50 influential books. Write a json array. each item in the array is a object with the fields name, author, ISBN."
It takes 3 minutes to answer. Why is it so slow?
Also, it sometimes returns 5 books, and sometimes it returns 20 books (I ask for 50 books).
Why so?
Beta Was this translation helpful? Give feedback.
All reactions