Skip to content

Releases: ggerganov/llama.cpp

b1378

12 Oct 20:41
1e0e873
Compare
Choose a tag to compare
CLBlast: Fix matrix-vector multiplication (#3544)

b1377

12 Oct 15:56
370359e
Compare
Choose a tag to compare
examples: support LLaVA v1.5 (multimodal model) (#3436)

* WIP: start implementing LLaVA

* rm scratch buf for now, will revert after cleanup

* LLaVA image encoder is working. will combine with llama

* Add llava inference code, but it's buggy. debugging

* LLaVA is working e2e, needs to optimize memory allocation + cleanup

* Use ggml_allocr + rm unnecessary code

* fix: crlf -> lf

* fix: new line at EoF

* fix: trailing whitespace

* Add readme

* Update readme

* Some cleanup

* Are you happy editorconfig?

* rm unused batch image preprocessing

* rm unused import

* fix: rm designated initializers

* introduce pad-to-square mode for non-square images

* are you happy editorconfig?

* gitignore /llava

* Handle cases where image file does not exist

* add llava target to Makefile

* add support for 13b model variant

* Maybe seed is unlucky?

* Check if apples are compared to apples

* are you happy editorconfig?

* Use temperature = 0.1 by default

* command line: use gpt_params_parse()

* minor

* handle default n_predict

* fix typo

* llava : code formatting, rename files, fix compile warnings

* do not use Wno-cast-qual for MSVC

---------

Co-authored-by: Georgi Gerganov <[email protected]>

b1375

12 Oct 11:57
d28e572
Compare
Choose a tag to compare
cmake : fix add_compile_options on macOS

b1372

12 Oct 07:29
b016596
Compare
Choose a tag to compare
server : add completion mode (no chat) (#3582)

b1370

12 Oct 06:51
57dd55e
Compare
Choose a tag to compare
server : fix kv cache management (#3588)

b1369

11 Oct 21:29
b8fe4b5
Compare
Choose a tag to compare
main : fix session loading bug (#3400)

b1368

11 Oct 20:19
a8bdd65
Compare
Choose a tag to compare
server : add parameter -tb N, --threads-batch N (#3584)

Co-authored-by: Michael Coppola <[email protected]>

b1367

11 Oct 20:11
70c29da
Compare
Choose a tag to compare
common : fix mirostat state when using multiple sequences (#3543)

* Fix mirostat state when using multiple sequences

* Fix mirostat by completely refactoring sampling!

* Try to fix zig build.

* Export function to fetch/create default sampler states

Code formatting cleanups and add some comments

Silence a warning about id not being used when logging is disabled

* Apply some renaming suggestions.

Fix comments that were out of sync with the pull.

* Use more consistant naming convention for sampling contexts

b1366

11 Oct 18:53
8c70a5f
Compare
Choose a tag to compare
batched : add bench tool (#3545)

* batched : add bench tool

* batched : minor fix table

* batched-bench : add readme + n_kv_max is now configurable

* batched-bench : init warm-up batch

* batched-bench : pass custom set of PP, TG and PL

* batched-bench : add mmq CLI arg

b1365

11 Oct 11:41
24ba3d8
Compare
Choose a tag to compare
examples : add batched.swift + improve CI for swift (#3562)