How can I use Llama.cpp quantized models? #247

NPap0 · 2023-05-17T13:00:22Z

NPap0
May 17, 2023

Am I correct in saying that if I want to use the Llama.cpp models I would have to get the alpaca weights after getting access to them from META (Not from the notorious magnet link) and then run them through quantization via https://github.com/ggerganov/llama.cpp/issues and then get the output of that and use it in privateGPT or am I missing something? And is it a grey area to be distributing the final output(quantized models) because the weights are not 'free' for commercial use?

Answered by andrewginns

May 17, 2023

If you look around you can find that others have posted the complete quantised ggml format models including alpaca online.

View full answer

andrewginns · 2023-05-17T13:48:12Z

andrewginns
May 17, 2023

If you look around you can find that others have posted the complete quantised ggml format models including alpaca online.

2 replies

NPap0 May 17, 2023
Author

If you look around you can find that others have posted the complete quantised ggml format models including alpaca online.

Thought so but couldn't find anything and thought because the quantization format (llama.cpp) was changed recently there wouldn't be any such cases.
I'll continue looking around, thanks!

thekit May 17, 2023

I solved getting some q5 models to work in this discussion here: #233

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I use Llama.cpp quantized models? #247

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How can I use Llama.cpp quantized models? #247

NPap0 May 17, 2023

Replies: 1 comment · 2 replies

andrewginns May 17, 2023

NPap0 May 17, 2023 Author

thekit May 17, 2023

NPap0
May 17, 2023

Replies: 1 comment 2 replies

andrewginns
May 17, 2023

NPap0 May 17, 2023
Author