-
Notifications
You must be signed in to change notification settings - Fork 13.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml-cpu: add RISC-V Vector support for RWKV WKV6 operation
ggml
changes relating to the ggml tensor library for machine learning
#17716
opened Dec 3, 2025 by
ixgbe
Loading…
common : add parser for ministral/mistral large 3
documentation
Improvements or additions to documentation
examples
server
testing
Everything test related
fix: convert_hf_to_gguf - map new mistral-common valid_tokenizer_files output to avoid crash with --mistral-format
python
python script changes
#17712
opened Dec 3, 2025 by
SmartestWashingMachine
Loading…
vulkan: Use one row per workgroup for f32 mmv
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17711
opened Dec 3, 2025 by
jeffbolznv
Loading…
build: for GGML_BACKEND_DL, ggml need not depend on backend
ggml
changes relating to the ggml tensor library for machine learning
#17709
opened Dec 3, 2025 by
jeffbolznv
Loading…
build: enable parallel builds in msbuild using MTT
build
Compilation issues
#17708
opened Dec 3, 2025 by
jeffbolznv
Loading…
common: Deepseek V3.2 tool call parser
testing
Everything test related
#17707
opened Dec 3, 2025 by
hksdpc255
Loading…
CANN: Support fusion operator that supports mul and add
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#17706
opened Dec 3, 2025 by
TianHao324
•
Draft
cuda: optimize SOLVE_TRI using registers and FMAF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17703
opened Dec 2, 2025 by
wsbagnsv1
Loading…
vulkan: add more num_blocks instantiations in rms_norm
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17701
opened Dec 2, 2025 by
jeffbolznv
Loading…
llama-server: fix duplicate HTTP headers in multiple models mode
examples
server
#17698
opened Dec 2, 2025 by
ServeurpersoCom
Loading…
model : add ASR support for LFM2-Audio-1.5B
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
testing
Everything test related
ggml-zendnn : add ZenDNN backend for AMD CPUs
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
#17690
opened Dec 2, 2025 by
z-vishal
Loading…
Use OpenAI-compatible
/v1/models endpoint by default
examples
server
#17689
opened Dec 2, 2025 by
allozaur
Loading…
vulkan: enable mmvq for q2_k on NVIDIA
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17675
opened Dec 2, 2025 by
jeffbolznv
Loading…
vulkan: perf_logger improvements
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17672
opened Dec 2, 2025 by
jeffbolznv
Loading…
vulkan: fix top_k bug when there are ties in the input
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#17659
opened Dec 1, 2025 by
jeffbolznv
Loading…
ggml: use 'exists( const std::filesystem::path&, std::error_code&)' instead of 'exists( const std::filesystem::path&)' to enhance robustness
ggml
changes relating to the ggml tensor library for machine learning
#17653
opened Dec 1, 2025 by
flyinskyin2013
Loading…
ggml: added missing cast sections in memcpy
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17651
opened Dec 1, 2025 by
GermanAizek
Loading…
ggml-cpu: remove duplicate conditional check 'iid'
ggml
changes relating to the ggml tensor library for machine learning
#17650
opened Dec 1, 2025 by
GermanAizek
Loading…
sgemm: reuse loaded vector in AVX dot product calculation
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17648
opened Dec 1, 2025 by
GermanAizek
Loading…
llama-vocab: replace postfix with prefix increment for iterators
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17646
opened Dec 1, 2025 by
GermanAizek
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.