Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in-middle etal
#7097
opened May 6, 2024 by
hanishkvc
Loading…
Documenting debugging one test without anything else in the loop.
#7096
opened May 6, 2024 by
josh-ramer
Loading…
opencl alignment size should be converted from bits to bytes
#7090
opened May 5, 2024 by
albertjin
Loading…
Add left recursion check: quit early instead of going into an infinite loop
#7083
opened May 5, 2024 by
nuchi
Loading…
fix: use
vm_allocate
instead of posix_memalign
for Metal on macOS
#7078
opened May 4, 2024 by
giladgd
Loading…
convert-hf : save memory with lazy evaluation
enhancement
New feature or request
high priority
Very important issue
need feedback
Testing and feedback with results are needed
#7075
opened May 4, 2024 by
compilade
Loading…
7 tasks done
Script to convert Grok-1 weights from raw JAX pickle files.
#7058
opened May 3, 2024 by
heiner
Loading…
convert.py: When --vocab-only is passed, generate false but valid params
#7027
opened May 1, 2024 by
20kdc
Loading…
docs: Fix typo and update description for --embeddings flag
#7026
opened May 1, 2024 by
louixs
Loading…
new tokenizer-verifier tool to check gguf tokenizer parameters
#6988
opened Apr 29, 2024 by
anisse
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.