slow, and random result size when asking for a list of 50 items #422
dportabella
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am testing the latest localGPT version, with the defaults (original SOURCE_DOCUMENTS, ingest.py and run_localGPT.py). No other documents ingested. The only difference, in run_localGPT.py I changed max_ctx_size and max_length from 2048 to 20480.
I have a machine with proxmox (server virtualization). Inside proxmox, I have a VM with 24 GB mem, 20 processors (1 socket, 20 cores), and NVDIA GeForce RTX 4070 with 16 GB VRAM (directly attached from host to the VM). Is this a good setup for fast replies?
I ask localGPT the following question: "tell me a list of 50 influential books. Write a json array. each item in the array is a object with the fields name, author, ISBN."
It takes 3 minutes to answer. Why is it so slow?
Also, it sometimes returns 5 books, and sometimes it returns 20 books (I ask for 50 books).
Why so?
Beta Was this translation helpful? Give feedback.
All reactions