llama-cpp: merge upstream changes #299271

josephst · 2024-03-26T18:29:50Z

Description of changes

llama-cpp upstream has had some recent changes in their package.nix, namely embedding Metal shaders on Darwin to avoid xcrun and Xcode dependency and renaming cuBlas backend to CUDA. This PR brings those changes into the llama-cpp that's packaged in nixpkgs.

Things done

Add a 👍 reaction to pull requests you find important.

Matches change from upstream ggerganov/llama.cpp@2803459

port of ggerganov/llama.cpp#6118, although compiling shaders with XCode disabled as it requires disabling sandbox (and only works on MacOS anyways)

philiptaron · 2024-03-26T20:18:00Z

I'll be able to build this on aarch64-darwin tonight if ofBorg doesn't snipe me first.

drupol · 2024-03-28T13:10:51Z

@abysssol Do you think you could have a look at this?

abysssol · 2024-03-28T15:10:58Z

@drupol I took a look at the changes, but I'm afraid I can't offer much insight, as I'm not familiar with llama-cpp (the nix package, the flake, nor the project itself). If you want me to try to figure something specific out, you'll need to be more detailed about what you actually want me to do.

The only thing I noticed is that a version update is probably necessary.
The pr renaming LLAMA_CUBLAS to LLAMA_CUDA is newer than release b2481, which is the version currently used in the nix package. So, LLAMA_CUDA probably won't have any effect due to being used with an outdated version of llama-cpp that hasn't yet renamed the environment variable.
LLAMA_METAL_EMBED_LIBRARY probably doesn't have this problem, since the pr adding support for it is older than release b2481.

josephst

Updated to latest upstream package - b2568

josephst added 2 commits March 26, 2024 13:54

llama-cpp: rename cuBLAS to CUDA

7aa588c

Matches change from upstream ggerganov/llama.cpp@2803459

llama-cpp: embed (don't pre-compile) metal shaders

e1ef3aa

port of ggerganov/llama.cpp#6118, although compiling shaders with XCode disabled as it requires disabling sandbox (and only works on MacOS anyways)

ofborg bot requested review from elohmeier, philiptaron and dit7ya March 26, 2024 19:01

ofborg bot added 10.rebuild-darwin: 1-10 10.rebuild-linux: 1-10 labels Mar 26, 2024

llama-cpp: update from b2481 to b2568

a06a03e

josephst commented Mar 29, 2024

View reviewed changes

drupol merged commit 40fe01a into NixOS:master Mar 29, 2024
22 of 23 checks passed

josephst deleted the fix-llamacpp branch April 6, 2024 15:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-cpp: merge upstream changes #299271

llama-cpp: merge upstream changes #299271

josephst commented Mar 26, 2024

philiptaron commented Mar 26, 2024

drupol commented Mar 28, 2024

abysssol commented Mar 28, 2024 •

edited

josephst left a comment

llama-cpp: merge upstream changes #299271

llama-cpp: merge upstream changes #299271

Conversation

josephst commented Mar 26, 2024

Description of changes

Things done

philiptaron commented Mar 26, 2024

drupol commented Mar 28, 2024

abysssol commented Mar 28, 2024 • edited

josephst left a comment

Choose a reason for hiding this comment

abysssol commented Mar 28, 2024 •

edited