Skip to content

Commit f12098c

Browse files
committed
Update default models, closes #20
1 parent 1ae0ad7 commit f12098c

File tree

2 files changed

+5
-5
lines changed

2 files changed

+5
-5
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -110,9 +110,9 @@ The RAG application has a number of environment variables that can be set to con
110110
| | | `gq: Tell me about Linux` |
111111
| | | `linux -> macos -> microsoft windows` |
112112
| | | `linux -> macos -> microsoft windows gq: Tell me about Linux` |
113-
| LLM | Path to LLM | x86-64: [Mistral-7B-OpenOrca-AWQ](https://huggingface.co/TheBloke/Mistral-7B-OpenOrca-AWQ) |
114-
| | | arm64 : [Mistral-7B-OpenOrca-GGUF](https://huggingface.co/TheBloke/Mistral-7B-OpenOrca-GGUF) |
115-
| EMBEDDINGS | Embeddings database path | [neuml/txtai-wikipedia-slim](https://huggingface.co/NeuML/txtai-wikipedia-slim) |
113+
| LLM | Path to LLM | x86-64: [Llama-3.1-8B-Instruct-AWQ-INT4](https://hf.co/hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4) |
114+
| | | arm64 : [Llama-3.1-8B-Instruct-GGUF](https://hf.co/bartowski/Meta-Llama-3.1-8B-Instruct-GGUF) |
115+
| EMBEDDINGS | Embeddings database path | [neuml/txtai-wikipedia-slim](https://hf.co/NeuML/txtai-wikipedia-slim) |
116116
| MAXLENGTH | Maximum generation length | 2048 for topics, 4096 for RAG |
117117
| CONTEXT | RAG context size | 10 |
118118
| DATA | Optional directory to index data from | None |

rag.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -290,9 +290,9 @@ def __init__(self):
290290
self.llm = LLM(
291291
os.environ.get(
292292
"LLM",
293-
"TheBloke/Mistral-7B-OpenOrca-AWQ"
293+
"hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4"
294294
if platform.machine() in ("x86_64", "AMD")
295-
else "TheBloke/Mistral-7B-OpenOrca-GGUF/mistral-7b-openorca.Q4_K_M.gguf",
295+
else "bartowski/Meta-Llama-3.1-8B-Instruct-GGUF/Meta-Llama-3.1-8B-Instruct-Q4_K_M.gguf",
296296
)
297297
)
298298

0 commit comments

Comments
 (0)