Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

quantize: be able to override metadata by key #6321

Merged
merged 2 commits into from Mar 26, 2024

Conversation

ikawrakow
Copy link
Contributor

This can be useful, e.g., for low-bit quantization where the experience has shown that one can improve the model by changing the f_rms_norm parameter. Instead of having to specify the metadata override each time the model is used, with this PR one can encode the override during quantization using

./quantize --override-kv KEY=TYPE:VALUE other_commandline_arguments

The --override-kv argument can be repeated multiple times.

@ggerganov ggerganov merged commit d25b1c3 into master Mar 26, 2024
31 of 36 checks passed
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024
* quantize: be able to override metadata by key

* minor : spacing

---------

Co-authored-by: Iwan Kawrakow <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 3, 2024
* quantize: be able to override metadata by key

* minor : spacing

---------

Co-authored-by: Iwan Kawrakow <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
@pipixia244
Copy link
Contributor

Can this also be able to override string type metadata? This has not been supported yet.

tybalex pushed a commit to tybalex/function.cpp that referenced this pull request Apr 17, 2024
* quantize: be able to override metadata by key

* minor : spacing

---------

Co-authored-by: Iwan Kawrakow <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants