Releases: mudler/LocalAI
v2.0.0_beta
What's Changed
Breaking Changes 🛠
- 🔥 add LLaVA support and GPT vision API, Multiple requests for llama.cpp, return JSON types by @mudler in #1254
- refactor: rename llama-stable to llama-ggml by @mudler in #1287
Bug fixes 🐛
- fix: respect OpenAI spec for response format by @mudler in #1289
- fix: handle grpc and llama-cpp with REBUILD=true by @mudler in #1328
- fix: propagate CMAKE_ARGS when building grpc by @mudler in #1334
Exciting New Features 🎉
- feat(certificates): add support for custom CA certificates by @vitorstone in #880
- feat(conda): conda environments by @mudler in #1144
- refactor: move backends into the backends directory by @mudler in #1279
- feat: allow to run parallel requests by @mudler in #1290
- feat(transformers): add embeddings with Automodel by @mudler in #1308
- ci(core): add -core images without python deps by @mudler in #1309
- feat: initial watchdog implementation by @mudler in #1341
👒 Dependencies
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1231
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1236
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1285
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1288
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1291
Other Changes
- Update .gitignore for backend/llama.cpp by @dave-gray101 in #1235
- llama index example by @sfxworks in #1237
- chianlit example by @sfxworks in #1238
- Fixes the bug 1196 by @diego-minguzzi in #1232
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1242
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1256
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1265
- deps(go-piper): update to 2023.11.6-3 by @M0Rf30 in #1257
- feat(llama.cpp): support lora with scale and yarn by @mudler in #1277
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1272
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1280
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1282
- feat: queue up requests if not running parallel requests by @mudler in #1296
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1297
- fix(api/config): allow YAML config with .yml by @Papawy in #1299
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1300
- llava.yaml (yaml format standardization) by @lunamidori5 in #1303
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1304
- ⬆️ Update mudler/go-piper by @localai-bot in #1305
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1306
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1310
- fix: ExLlama Backend Context Size & Rope Scaling by @ok2sh in #1311
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1313
- docs: Initial import from localai-website by @mudler in #1312
- fix: move python header comments below shebang in some backends by @B4ckslash in #1321
- Feat: OSX Local Codesigning by @dave-gray101 in #1319
- docs: Add llava, update hot topics by @mudler in #1322
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1323
- docs: Update Features->Embeddings page to reflect backend restructuring by @B4ckslash in #1325
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1330
- fix: rename transformers.py to avoid circular import by @mudler in #1337
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1340
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1345
- feat(petals): add backend by @mudler in #1350
- fix: go-piper add libucd at linking time by @M0Rf30 in #1357
- docs: Add docker instructions, add community projects section in README by @mudler in #1359
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1351
- docs: Update getting started and GPU section by @mudler in #1362
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1363
- ci: limit concurrent jobs by @mudler in #1364
- fix/docs: Python backend dependencies by @B4ckslash in #1360
- ci: split into reusable workflows by @mudler in #1366
- fix: OSX Build Fix Part 1: Metal by @dave-gray101 in #1365
- docs: add fine-tuning example by @mudler in #1374
- docs: site/how-to clean up by @lunamidori5 in #1342
New Contributors
- @vitorstone made their first contribution in #880
- @sfxworks made their first contribution in #1237
- @diego-minguzzi made their first contribution in #1232
- @M0Rf30 made their first contribution in #1257
- @Papawy made their first contribution in #1299
- @ok2sh made their first contribution in #1311
- @B4ckslash made their first contribution in #1321
Full Changelog: v1.40.0...v2.0.0_beta
v1.40.0
This release is a preparation before v2 - the efforts now will be to refactor, polish and add new backends. Follow up on: #1126
Hot topics
This release now brings the llama-cpp
backend which is a c++ backend tied to llama.cpp. It follows more closely and tracks recent versions of llama.cpp. It is not feature compatible with the current llama
backend but plans are to sunset the current llama
backend in favor of this one. This one will be probably be the latest release containing the older llama
backend written in go and c++. The major improvement with this change is that there are less layers that could be expose to potential bugs - and as well it ease out maintenance as well.
Support for ROCm/HIPBLAS
This release bring support for AMD thanks to @65a . See more details in #1100
More CLI commands
Thanks to @jespino now the local-ai binary has more subcommands allowing to manage the gallery or try out directly inferencing, check it out!
What's Changed
Bug fixes 🐛
- fix(openai): Populate ID and Created fields in OpenAI compatible responses by @jespino in #1164
- Fix backend/cpp/llama CMakeList.txt on OSX by @dave-gray101 in #1212
Exciting New Features 🎉
- feat(llama.cpp): enable ROCm/HIPBLAS support by @65a in #1100
- Cleaning up
examples/
models and starter.env
files by @jamesbraza in #1124 - Update docker-compose.yaml by @scenaristeur in #1131
- Feats: bruno example, gallery improvements for new scraper by @dave-gray101 in #1161
- feat(cli): Adding models subcommand with list and install subcommands by @jespino in #1165
- feat: add tts command line subcommand by @jespino in #1169
- Adding transcript subcommand by @jespino in #1171
- feat: llama.cpp gRPC C++ backend by @mudler in #1170
- feat(metrics): Adding initial support for prometheus metrics by @jespino in #1176
- ci: add GPU tests by @mudler in #1095
👒 Dependencies
- fix(deps): update module github.com/onsi/gomega to v1.28.0 by @renovate in #1113
- ⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1106
- fix(deps): update github.com/tmc/langchaingo digest to e16b777 by @renovate in #1101
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 79f9587 by @renovate in #1085
- fix(deps): update module github.com/shirou/gopsutil/v3 to v3.23.9 by @renovate in #1120
- fix(deps): update module github.com/sashabaranov/go-openai to v1.15.4 by @renovate in #1122
- fix(deps): update module github.com/rs/zerolog to v1.31.0 by @renovate in #1102
- ⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1130
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 6018c9d by @renovate in #1129
- ⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1136
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 1676dcd by @renovate in #1135
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 56c0d28 by @renovate in #1140
- fix(deps): update module github.com/onsi/ginkgo/v2 to v2.13.0 by @renovate in #1152
- fix(deps): update module google.golang.org/grpc to v1.58.3 by @renovate in #1160
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to aeba71e by @renovate in #1155
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 10f9b49 by @renovate in #1158
- fix(deps): update module github.com/sashabaranov/go-openai to v1.16.0 by @renovate in #1159
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 22de3c5 by @renovate in #1172
- fix(deps): update github.com/tmc/langchaingo digest to a02d4fd by @renovate in #1175
- fix(deps): update module github.com/gofiber/fiber/v2 to v2.50.0 by @renovate in #1177
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 9a19c74 by @renovate in #1179
- fix(deps): update github.com/tmc/langchaingo digest to c636b3d by @renovate in #1188
- fix(deps): update module google.golang.org/grpc to v1.59.0 by @renovate in #1189
- chore(deps): update actions/checkout action to v4 by @renovate in #1006
- feat(llama.cpp): update by @mudler in #1200
- ⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1156
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to c25dc51 by @renovate in #1191
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1204
- fix(deps): update module github.com/onsi/gomega to v1.28.1 by @renovate in #1205
Other Changes
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 6711bdd by @renovate in #1079
- ci: cleanup worker by @mudler in #1166
- docs(examples): Add mistral example by @mudler in #1214
- feat(llama.cpp): Bump llama.cpp, adapt grpc server by @mudler in #1211
- cleanup: drop bloomz and ggllm as now supported by llama.cpp by @mudler in #1217
- ci: use self-hosted to build container images by @mudler in #1206
- ci: run only cublas on selfhosted by @mudler in #1224
- ⬆️ Update ggerganov/llama.cpp by @localai-bot in #1207
New Contributors
- @65a made their first contribution in #1100
- @localai-bot made their first contribution in #1106
- @scenaristeur made their first contribution in #1131
- @jespino made their first contribution in #1164
Full Changelog: v1.30.0...v1.40.0
v1.30.0
What's Changed
This is an exciting LocalAI release! Besides bug-fixes and enhancements this release brings the new backend to a whole new level by extending support to vllm
and vall-e-x
for audio generation!
Bug fixes 🐛
- fix(llama): resolve lora adapters correctly from the model file by @mudler in #964
- fix(diffusers): correctly check alpha by @mudler in #967
- fix(utf8): prevent multi-byte utf8 characters from being mangled by @samm81 in #981
- fix(vall-e-x): make audiopath relative to models by @mudler in #1012
- fix(vall-e-x): copy vall-e-x next to the local-ai binary in the container image by @mudler in #1082
Exciting New Features 🎉
- feat(diffusers): Add lora by @mudler in #965
- feat: expose all the jobs via
/models/jobs
endpoint by @Jirubizu in #983 - feat: Model Gallery Endpoint Refactor / Mutable Galleries Endpoints by @dave-gray101 in #991
- Allow to manually set the seed for the SD pipeline by @maxjcohen in #998
- feat: add vall-e-x by @mudler in #1007
- feat(vllm): Initial vllm backend implementation by @mudler in #1026
- feat(speculative-sampling): allow to specify a draft model in the model config by @mudler in #1052
- fix(vllm): set default top_p with vllm by @mudler in #1078
- feat(python-grpc): allow to set max workers with PYTHON_GRPC_MAX_WORKERS by @mudler in #1081
- feat(vllm): Allow to set quantization by @mudler in #1094
👒 Dependencies
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 9072315 by @renovate in #963
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #968
- fix(deps): update module github.com/gofiber/fiber/v2 to v2.49.0 by @renovate in #966
- fix(deps): update github.com/tmc/langchaingo digest to c85d396 by @renovate in #962
- fix(deps): update module github.com/valyala/fasthttp to v1.49.0 by @renovate in #971
- fix(deps): update go-llama.cpp by @mudler in #980
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to bf3f946 by @renovate in #979
- fix(deps): update module github.com/sashabaranov/go-openai to v1.15.1 by @renovate in #984
- fix(deps): update module github.com/shirou/gopsutil/v3 to v3.23.8 by @renovate in #989
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to b6e38d6 by @renovate in #988
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to c5622a8 by @renovate in #992
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #996
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to d8c8547 by @renovate in #997
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #1002
- fix(deps): update module github.com/gofiber/fiber/v2 to v2.49.1 by @renovate in #1001
- chore(deps): remove unused package by @appleboy in #1003
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #1005
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 05dc4b6 by @renovate in #1004
- feat(llama.cpp): update by @mudler in #1024
- fix(deps): update module github.com/sashabaranov/go-openai to v1.15.2 by @renovate in #1022
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 4145bd5 by @renovate in #1025
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #1027
- fix(deps): update github.com/tmc/langchaingo digest to 9c8845b by @renovate in #1029
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #1043
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to f0735ef by @renovate in #1034
- fix(deps): update module github.com/sashabaranov/go-openai to v1.15.3 by @renovate in #1035
- chore(deps): update docker/build-push-action action to v5 by @renovate in #1039
- chore(deps): update docker/login-action action to v3 by @renovate in #1040
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to cc8a123 by @renovate in #1041
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #1048
- chore(deps): update docker/metadata-action action to v5 by @renovate in #1045
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to cf4eb53 by @renovate in #1047
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #1062
- fix(deps): update module google.golang.org/grpc to v1.58.1 by @renovate in #1020
- fix(deps): update module github.com/gofiber/fiber/v2 to v2.49.2 by @renovate in #1049
- fix(deps): update module github.com/valyala/fasthttp to v1.50.0 by @renovate in #1060
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to b471eb7 by @renovate in #1050
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to e86c637 by @renovate in #1059
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #1084
- fix(llama.cpp): update, run go mod tidy by @mudler in #1088
- fix(deps): update module google.golang.org/grpc to v1.58.2 by @renovate in #1090
- fix(deps): update github.com/tmc/langchaingo digest to 2c309cf by @renovate in #1097
Other Changes
- [query_data example] max_chunk_overlap in PromptHelper must be in 0..1 range by @quoing in #1000
- feat(diffusers): don't set seed in params and respect device by @mudler in #1010
- Update README to reflect changes in Continue's config file by @rdeaton in #1014
- 1038 - Streamlit bot with LocalAI by @majoshi1 in #1072
- Cleaned up chatbot-ui READMEs by @jamesbraza in #1075
- Requested Changes from GPT4ALL to Luna-AI-Llama2 by @lunamidori5 in #1092
- Add the CONTRIBUTING.md by @Aisuko in #1098
New Contributors
- @Jirubizu made their first contribution in #983
- @maxjcohen made their first contribution in #998
- @appleboy made their first contribution in #1003
- @quoing made their first contribution in #1000
- @rdeaton made their first contribution in #1014
- @majoshi1 made their first contribution in #1072
- @jamesbraza made their first contribution in #1075
- @lunamidori5 made their first contribution in #1092
Full Changelog: v1.25.0...v2.0.0
v1.25.0
What's Changed
Breaking Changes 🛠
Bug fixes 🐛
- fix: disable usage by default (still experimental) by @mudler in #929
- fix: drop racy code, refactor and group API schema by @mudler in #931
Exciting New Features 🎉
- feat(Makefile): allow to restrict backend builds by @mudler in #890
- feat(diffusers): various enhancements by @mudler in #895
- feat: make initializer accept gRPC delay times by @mudler in #900
- feat(diffusers): add DPMSolverMultistepScheduler++, DPMSolverMultistepSchedulerSDE++, guidance_scale by @mudler in #903
- feat(diffusers): overcome prompt limit by @mudler in #904
- feat(diffusers): add img2img and clip_skip, support more kernels schedulers by @mudler in #906
- Usage Features by @dave-gray101 in #863
- feat(diffusers): be consistent with pipelines, support also depthimg2img by @mudler in #926
- feat: add --single-active-backend to allow only one backend active at the time by @mudler in #925
- feat: add llama-stable backend by @mudler in #932
- feat: allow to customize rwkv tokenizer by @dave-gray101 in #937
- feat: backend monitor shutdown endpoint, process based by @dave-gray101 in #938
- feat: Allow to load lora adapters for llama.cpp by @mudler in #955
👒 Dependencies
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #878
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 4e55940 by @renovate in #893
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 18f25c2 by @renovate in #894
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #899
- fix(deps): update github.com/tmc/langchaingo digest to eb0cbd3 by @renovate in #902
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to f03869d by @renovate in #901
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to a630935 by @renovate in #898
- fix(deps): update module github.com/shirou/gopsutil/v3 to v3.23.7 - autoclosed by @renovate in #924
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 36f7fb5 by @renovate in #908
- fix(deps): update github.com/tmc/langchaingo digest to fef0821 by @renovate in #922
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #911
- fix(deps): update module github.com/google/uuid to v1.3.1 by @renovate in #936
- fix(deps): update module github.com/onsi/ginkgo/v2 to v2.12.0 by @renovate in #949
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 27a8b02 by @renovate in #947
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #953
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #954
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to bf63302 by @renovate in #930
- fix(deps): update github.com/tmc/langchaingo digest to 1e2a401 by @renovate in #948
- fix(deps): update module github.com/otiai10/openaigo to v1.6.0 by @renovate in #960
Other Changes
- infra: add setup-go@4, test against 1.20.x (go.mod) and stable (1.21) by @dave-gray101 in #935
- Example: Continue (dev) by @gruberdev in #940
- initial draft of an importable Insomnia profile for developers by @dave-gray101 in #942
New Contributors
- @gruberdev made their first contribution in #940
Full Changelog: v1.24.1...v1.25.0
v1.24.1
v1.24.0
What's Changed
Bug fixes 🐛
Exciting New Features 🎉
- feat: add bark and AutoGPTQ by @mudler in #871
- feat: Add Diffusers by @mudler in #874
- feat: add API_KEY list support by @neboman11 in #877
- feat: Add exllama by @mudler in #881
- feat: pre-configure LocalAI galleries by @mudler in #886
👒 Dependencies
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #870
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 0f2bb50 by @renovate in #869
- fix(deps): update github.com/tmc/langchaingo digest to fd8b7f0 by @renovate in #882
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 4d855af by @renovate in #875
Other Changes
- docs: base-Update comments in .env for cublas, openblas, clblas by @scott4290 in #867
New Contributors
- @scott4290 made their first contribution in #867
- @neboman11 made their first contribution in #877
Full Changelog: v1.23.2...v.1.24.0
v1.23.2
What's Changed
Bug fixes 🐛
Exciting New Features 🎉
- feat: "simple" chat/edit/completion template system prompt from config by @dave-gray101 in #856
- feat: add ngqa and RMSNormEps parameters by @mudler in #860
- feat: update integer, number and string rules - allow primitives as root types by @mudler in #862
👒 Dependencies
- fix(deps): update github.com/tmc/langchaingo digest to 271e9bd by @renovate in #857
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to c449b71 by @renovate in #858
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #859
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 50cee77 by @renovate in #861
Full Changelog: v1.23.1...v1.23.2
v1.23.1
What's Changed
Bug fixes 🐛
- Fix: Model Gallery Downloads by @dave-gray101 in #835
- fix(examples): use pinned versions in the k8sgpt example by @mudler in #845
- fix(gallery): preload from file should by in YAML format by @mudler in #846
- Revert ":arrow_up: Update go-skynet/go-llama.cpp" by @mudler in #850
- fix(gallery): use inline YAML by @mudler in #851
Exciting New Features 🎉
- feature: model list :: filter query string parameter by @dave-gray101 in #830
👒 Dependencies
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #847
Other Changes
- fix(deps): update module github.com/rs/zerolog to v1.30.0 by @renovate in #836
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #824
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #825
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 8c51308 by @renovate in #822
- fix(deps): update github.com/tmc/langchaingo digest to 7df4fe5 by @renovate in #826
- refactor: Remove remaining uses of depreciated package
io/ioutil
by @dave-gray101 in #837 - Create .gitattributes to force git clone to keep the LF line endings on .sh files by @energynumbers in #838
- Added "BMO Chatbot" to "Projects already using LocalAI to run local models" section." by @longy2k in #828
- ✨📝 Add OpenOps to README's project list by @azigler in #832
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #841
- fix(deps): update github.com/tmc/langchaingo digest to 8f10160 by @renovate in #843
- feat: Update logo by @mudler in #849
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to cbdcde8 by @renovate in #833
- Update to working k8sgpt + localai example in documentation by @tharpooljha in #852
New Contributors
- @energynumbers made their first contribution in #838
- @longy2k made their first contribution in #828
- @azigler made their first contribution in #832
- @tharpooljha made their first contribution in #852
Full Changelog: v1.23.0...v1.23.1
v1.23.0
What's Changed
Breaking Changes 🛠
Bug fixes 🐛
- fix missing openai_api_base on langchain-chroma example by @wendyliga in #818
- fix: add rope settings during model load, fix CUDA by @mudler in #821
- fix: select function calls if 'name' is set in the request by @mudler in #827
- fix: symlink libphonemize in the container by @mudler in #831
Exciting New Features 🎉
- Added CPU information to entrypoint.sh by @finger42 in #794
- feat: cancel stream generation if client disappears by @tmm1 in #792
- fix: use bytes in gRPC proto instead of strings by @mudler in #813
👒 Dependencies
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #807
- fix(deps): update github.com/tmc/langchaingo digest to 7d5f9fd by @renovate in #768
- fix(deps): update github.com/donomii/go-rwkv.cpp digest to c898cd0 by @renovate in #748
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 9100b2e by @renovate in #753
- fix(deps): update module github.com/sashabaranov/go-openai to v1.14.1 by @renovate in #783
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #814
- fix(deps): update module google.golang.org/grpc to v1.57.0 by @renovate in #815
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 6ba16de by @renovate in #820
- fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 39acbc8 by @renovate in #817
Other Changes
- fix: update gitignore and make clean by @dave-gray101 in #798
- fix(deps): update github.com/go-skynet/go-llama.cpp digest to 562d2b5 by @renovate in #766
New Contributors
- @finger42 made their first contribution in #794
- @tmm1 made their first contribution in #792
- @wendyliga made their first contribution in #818
Full Changelog: v1.22.0...v1.23.0
v1.22.0
What's Changed
Breaking Changes 🛠
Bug fixes 🐛
- fix: move metal file to grpcs assets by @mudler in #777
- fix: make completions endpoint more close to OpenAI specification by @mudler in #790
Exciting New Features 🎉
- feat: add llama-master backend by @mudler in #752
- [build] pass build type to cmake on libtransformers.a build by @TonDar0n in #741
- feat: resolve JSONSchema refs (planners) by @mudler in #774
- feat: backends improvements by @mudler in #778
- feat(llama2): add template for chat messages by @dave-gray101 in #782
👒 Dependencies
- ⬆️ Update go-skynet/go-bert.cpp by @ci-robbot in #758
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #759
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #769
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #770
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #775
- ⬆️ Update go-skynet/go-llama.cpp by @ci-robbot in #723
- ⬆️ Update nomic-ai/gpt4all by @ci-robbot in #785
Other Changes
- example(functions): Add OpenAI functions example by @mudler in #767
- [Telegram-bot example] Fix lint for command docker-compose by @exsplashit in #787
New Contributors
- @TonDar0n made their first contribution in #741
- @exsplashit made their first contribution in #787
Full Changelog: v1.21.0...v1.22.0