File tree Expand file tree Collapse file tree 2 files changed +5
-5
lines changed Expand file tree Collapse file tree 2 files changed +5
-5
lines changed Original file line number Diff line number Diff line change @@ -110,9 +110,9 @@ The RAG application has a number of environment variables that can be set to con
110
110
| | | ` gq: Tell me about Linux ` |
111
111
| | | ` linux -> macos -> microsoft windows ` |
112
112
| | | ` linux -> macos -> microsoft windows gq: Tell me about Linux ` |
113
- | LLM | Path to LLM | x86-64: [ Mistral-7B-OpenOrca- AWQ] ( https://huggingface .co/TheBloke/Mistral-7B-OpenOrca- AWQ ) |
114
- | | | arm64 : [ Mistral-7B-OpenOrca- GGUF] ( https://huggingface .co/TheBloke/Mistral-7B-OpenOrca -GGUF ) |
115
- | EMBEDDINGS | Embeddings database path | [ neuml/txtai-wikipedia-slim] ( https://huggingface .co/NeuML/txtai-wikipedia-slim ) |
113
+ | LLM | Path to LLM | x86-64: [ Llama-3.1-8B-Instruct- AWQ-INT4 ] ( https://hf .co/hugging-quants/Meta-Llama-3.1-8B-Instruct- AWQ-INT4 ) |
114
+ | | | arm64 : [ Llama-3.1-8B-Instruct- GGUF] ( https://hf .co/bartowski/Meta-Llama-3.1-8B-Instruct -GGUF ) |
115
+ | EMBEDDINGS | Embeddings database path | [ neuml/txtai-wikipedia-slim] ( https://hf .co/NeuML/txtai-wikipedia-slim ) |
116
116
| MAXLENGTH | Maximum generation length | 2048 for topics, 4096 for RAG |
117
117
| CONTEXT | RAG context size | 10 |
118
118
| DATA | Optional directory to index data from | None |
Original file line number Diff line number Diff line change @@ -290,9 +290,9 @@ def __init__(self):
290
290
self .llm = LLM (
291
291
os .environ .get (
292
292
"LLM" ,
293
- "TheBloke/Mistral-7B-OpenOrca- AWQ"
293
+ "hugging-quants/Meta-Llama-3.1-8B-Instruct- AWQ-INT4 "
294
294
if platform .machine () in ("x86_64" , "AMD" )
295
- else "TheBloke/Mistral-7B-OpenOrca- GGUF/mistral-7b-openorca. Q4_K_M.gguf" ,
295
+ else "bartowski/Meta-Llama-3.1-8B-Instruct- GGUF/Meta-Llama-3.1-8B-Instruct- Q4_K_M.gguf" ,
296
296
)
297
297
)
298
298
You can’t perform that action at this time.
0 commit comments