Releases · ggerganov/llama.cpp

12 Oct 20:41

1e0e873

CLBlast: Fix matrix-vector multiplication (#3544)

Assets 12

12 Oct 15:56

github-actions

b1377

370359e

b1377

examples: support LLaVA v1.5 (multimodal model) (#3436)

* WIP: start implementing LLaVA

* rm scratch buf for now, will revert after cleanup

* LLaVA image encoder is working. will combine with llama

* Add llava inference code, but it's buggy. debugging

* LLaVA is working e2e, needs to optimize memory allocation + cleanup

* Use ggml_allocr + rm unnecessary code

* fix: crlf -> lf

* fix: new line at EoF

* fix: trailing whitespace

* Add readme

* Update readme

* Some cleanup

* Are you happy editorconfig?

* rm unused batch image preprocessing

* rm unused import

* fix: rm designated initializers

* introduce pad-to-square mode for non-square images

* are you happy editorconfig?

* gitignore /llava

* Handle cases where image file does not exist

* add llava target to Makefile

* add support for 13b model variant

* Maybe seed is unlucky?

* Check if apples are compared to apples

* are you happy editorconfig?

* Use temperature = 0.1 by default

* command line: use gpt_params_parse()

* minor

* handle default n_predict

* fix typo

* llava : code formatting, rename files, fix compile warnings

* do not use Wno-cast-qual for MSVC

---------

Co-authored-by: Georgi Gerganov <[email protected]>

Assets 12

12 Oct 11:57

github-actions

b1375

d28e572

b1375

cmake : fix add_compile_options on macOS

Assets 12

12 Oct 07:29

github-actions

b1372

b016596

b1372

server : add completion mode (no chat) (#3582)

Assets 12

12 Oct 06:51

github-actions

b1370

57dd55e

b1370

server : fix kv cache management (#3588)

Assets 12

11 Oct 21:29

github-actions

b1369

b8fe4b5

b1369

main : fix session loading bug (#3400)

Assets 12

11 Oct 20:19

github-actions

b1368

a8bdd65

b1368

server : add parameter -tb N, --threads-batch N (#3584)

Co-authored-by: Michael Coppola <[email protected]>

Assets 12

11 Oct 20:11

github-actions

b1367

70c29da

b1367

common : fix mirostat state when using multiple sequences (#3543)

* Fix mirostat state when using multiple sequences

* Fix mirostat by completely refactoring sampling!

* Try to fix zig build.

* Export function to fetch/create default sampler states

Code formatting cleanups and add some comments

Silence a warning about id not being used when logging is disabled

* Apply some renaming suggestions.

Fix comments that were out of sync with the pull.

* Use more consistant naming convention for sampling contexts

Assets 12

11 Oct 18:53

github-actions

b1366

8c70a5f

b1366

batched : add bench tool (#3545)

* batched : add bench tool

* batched : minor fix table

* batched-bench : add readme + n_kv_max is now configurable

* batched-bench : init warm-up batch

* batched-bench : pass custom set of PP, TG and PL

* batched-bench : add mmq CLI arg

Assets 12

11 Oct 11:41

github-actions

b1365

24ba3d8

b1365

examples : add batched.swift + improve CI for swift (#3562)

Assets 12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ggerganov/llama.cpp

b1378

b1377

b1375

b1372

b1370

b1369

b1368

b1367

b1366

b1365