.env.local config for llama-2-7b.Q4_K_S.gguf with llama.cpp server #747

smamindl · 2024-01-29T00:54:19Z

I am using the following .env.local with llama-2-7b.Q4_K_S.gguf and llama prompt template

MODELS=`[
  {
      "name": "llama-2-7b.Q4_K_S.gguf",
      "chatPromptTemplate": "<s>[INST] <<SYS>>\n{{preprompt}}\n<</SYS>>\n\n{{#each messages}}{{#ifUser}}{{content}} [/INST] {{/ifUser}}{{#ifAssistant}}{{content}} </s><s>[INST] {{/ifAssistant}}{{/each}}",
      "parameters": {
        "temperature": 0.1,
        "top_p": 0.95,
        "repetition_penalty": 1.2,
        "top_k": 50,
        "truncate": 1000,
        "max_new_tokens": 2048,
        "stop": ["</s>"]
      },
      "endpoints": [
        {
         "url": "http://127.0.0.1:8080",
         "type": "llamacpp"
        }
      ]
  }
]`

I am trying to get this work with chat-ui and it doesn't work and chat-ui is frozen. However server is receiving request from client.

The text was updated successfully, but these errors were encountered:

nsarrazin · 2024-01-29T10:39:25Z

Quick question how did you start your llama.cpp server ? Did you specify -np 3 in the parameters ?

smamindl · 2024-01-29T16:30:26Z

Quick question how did you start your llama.cpp server ? Did you specify -np 3 in the parameters ?

@nsarrazin yes I have specified -np 2

MDCurrent · 2024-02-22T14:54:07Z

likely resolved with my PR! #867 check out my branch and see if it helps ❤️

nsarrazin added the support A request for help setting things up label Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.env.local config for llama-2-7b.Q4_K_S.gguf with llama.cpp server #747

.env.local config for llama-2-7b.Q4_K_S.gguf with llama.cpp server #747

smamindl commented Jan 29, 2024 •

edited

nsarrazin commented Jan 29, 2024

smamindl commented Jan 29, 2024

MDCurrent commented Feb 22, 2024

.env.local config for llama-2-7b.Q4_K_S.gguf with llama.cpp server #747

.env.local config for llama-2-7b.Q4_K_S.gguf with llama.cpp server #747

Comments

smamindl commented Jan 29, 2024 • edited

nsarrazin commented Jan 29, 2024

smamindl commented Jan 29, 2024

MDCurrent commented Feb 22, 2024

smamindl commented Jan 29, 2024 •

edited