Releases · BerriAI/litellm

08 Jun 16:43

github-actions

v1.40.7

93a3a0c

v1.40.7 Latest

Latest

Full Changelog: v1.40.6...v1.40.7

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.7

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	97	126.50565680197539	6.4278560269757214	0.003340881510902142	1924	1	82.64289499999222	1316.4627209999935
Aggregated	Passed ✅	97	126.50565680197539	6.4278560269757214	0.003340881510902142	1924	1	82.64289499999222	1316.4627209999935

Assets 4

08 Jun 02:54

github-actions

v1.40.6

c5a611c

v1.40.6

🚨 Note: LiteLLM Proxy Added `opentelemetry` as a dependency on this release. We recommend waiting for a stable release before upgrading your production instances

✅ LiteLLM Python SDK Users: You should be unaffected by this change (`opentelemetry` was only added for the proxy server)

🔥 LiteLLM 1.40.6 - Proxy 100+ LLMs AT Scale with our production grade OpenTelemetry logger. Trace LLM API Calls, DB Requests, Cache Cache Requests 👉 Start here: https://docs.litellm.ai/docs/proxy/logging#logging-proxy-inputoutput-in-opentelemetry-format

🐞 [Fix]- Allow redacting messages from slack alerting https://docs.litellm.ai/docs/proxy/alerting#advanced---redacting-messages-from-alerts

🔨 [Refactor] - Refactor proxy_server.py to use common function for add_litellm_data_to_request

✨ [Feat] OpenTelemetry - Log Exceptions from Proxy Server

✨ [FEAT] OpenTelemetry - Log Redis Cache Read / Writes

✨ [FEAT] OpenTelemetry - LOG DB Exceptions

✨ [Feat] OpenTelemetry - Instrument DB Reads

🐞 [Fix] UI - Allow custom logout url and show proxy base url on API Ref Page

What's Changed

feat(bedrock_httpx.py): add support for bedrock converse api by @krrishdholakia in #4033
feature - Types for mypy - issue #360 by @mikeslattery in #3925
[Fix]- Allow redacting messages from slack alerting by @ishaan-jaff in #4047
Fix to support all file types supported by Gemini by @nick-rackauckas in #4055
[Feat] OTEL - Instrument DB Reads by @ishaan-jaff in #4058
[Refactor] - Refactor proxy_server.py to use common function for add_litellm_data_to_request by @ishaan-jaff in #4065
[Feat] OTEL - Log Exceptions from Proxy Server by @ishaan-jaff in #4067
Raw request debug logs - security fix by @krrishdholakia in #4068
[FEAT] OTEL - Log Redis Cache Read / Writes by @ishaan-jaff in #4070
[FEAT] OTEL - LOG DB Exceptions by @ishaan-jaff in #4071
[Fix] UI - Allow custom logout url and show proxy base url on API Ref Page by @ishaan-jaff in #4072

New Contributors

@mikeslattery made their first contribution in #3925
@nick-rackauckas made their first contribution in #4055

Full Changelog: v1.40.5...v1.40.6

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.6

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	130.0	151.53218399526997	6.362696017911015	0.0	1903	0	109.01354200001379	1319.1295889999992
Aggregated	Passed ✅	130.0	151.53218399526997	6.362696017911015	0.0	1903	0	109.01354200001379	1319.1295889999992

Contributors

mikeslattery, krrishdholakia, and 2 other contributors

Assets 4

07 Jun 19:56

github-actions

v1.40.5

6024f9e

v1.40.5

What's Changed

Table format fix and Typo by @SujanShilakar in #4037
feat: add langfuse metadata via proxy request headers by @ndrsfel in #3990
Add Ollama as a provider in proxy ui by @sha-ahammed in #4020
modified docs proxy->logging->langfuse by @syGOAT in #4035
fix tool usage null content using vertexai by @themrzmaster in #4039
Fixed openai token counter bug by @Raymond1415926 in #4036
feat(router.py): enable settting 'order' for a deployment in model list by @krrishdholakia in #4046
docs: add llmcord.py to projects by @jakobdylanc in #4060
Fix log message in Custom Callbacks doc by @iwamot in #4061
refactor: replace 'traceback.print_exc()' with logging library by @krrishdholakia in #4049
feat(aws_secret_manager.py): Support AWS KMS for Master Key encrption by @krrishdholakia in #4054
[Feat] Enterprise - Enforce Params in request to LiteLLM Proxy by @ishaan-jaff in #4043
feat - OTEL set custom service names and custom tracer names by @ishaan-jaff in #4048

New Contributors

@ndrsfel made their first contribution in #3990
@sha-ahammed made their first contribution in #4020
@syGOAT made their first contribution in #4035
@Raymond1415926 made their first contribution in #4036
@jakobdylanc made their first contribution in #4060
@iwamot made their first contribution in #4061

Full Changelog: v1.40.4...v1.40.5

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.5

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	98	123.75303621190369	6.512790176735744	0.0	1949	0	80.83186400000386	1991.117886999973
Aggregated	Passed ✅	98	123.75303621190369	6.512790176735744	0.0	1949	0	80.83186400000386	1991.117886999973

Contributors

iwamot, themrzmaster, and 8 other contributors

Assets 4

06 Jun 05:17

github-actions

v1.40.4

685d6e4

v1.40.4

What's Changed

feat: clarify slack alerting message by @nibalizer in #4023
[Admin UI] Analytics - fix div by 0 error on /model/metrics by @ishaan-jaff in #4021
Use DEBUG level for curl command logging by @grav in #2980
feat(create_user_button.tsx): allow admin to invite user to proxy via user-email/pwd invite-links by @krrishdholakia in #4028
[FIX] Proxy redirect to PROXY_BASE_URL/ui after logging in by @ishaan-jaff in #4027
[Feat] Audit Logs for Key, User, ProxyModel CRUD operations by @ishaan-jaff in #4030

New Contributors

@nibalizer made their first contribution in #4023

Full Changelog: v1.40.3...v1.40.4

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.4

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.4

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	74	89.43947919222931	6.450062450815326	0.0	1930	0	64.37952199996744	1143.0389689999743
Aggregated	Passed ✅	74	89.43947919222931	6.450062450815326	0.0	1930	0	64.37952199996744	1143.0389689999743

Contributors

grav, nibalizer, and 2 other contributors

Assets 4

05 Jun 19:41

github-actions

v1.40.3-stable

4b3b1e0

v1.40.3-stable

What's Changed

feat: clarify slack alerting message by @nibalizer in #4023

New Contributors

@nibalizer made their first contribution in #4023

Full Changelog: v1.40.3...v1.40.3-stable

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.3-stable

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	140.0	166.81647102860174	6.3100225495221665	0.0	1888	0	109.54055500008053	2288.330084999984
Aggregated	Passed ✅	140.0	166.81647102860174	6.3100225495221665	0.0	1888	0	109.54055500008053	2288.330084999984

Contributors

nibalizer

Assets 4

05 Jun 18:30

github-actions

v1.40.3

d22b0a8

v1.40.3

What's Changed

[FIX] Proxy - only log cache credentials in debug mode by @ishaan-jaff in #4024

Full Changelog: v1.40.2...v1.40.3

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.3

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	130.0	168.35103872813087	6.385058663866248	0.0	1909	0	109.50845100001061	8353.559378
Aggregated	Passed ✅	130.0	168.35103872813087	6.385058663866248	0.0	1909	0	109.50845100001061	8353.559378

Contributors

ishaan-jaff

Assets 4

05 Jun 16:40

github-actions

v1.40.2-stable

6b57352

v1.40.2-stable

Full Changelog: v1.40.1.dev4...v1.40.2-stable

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.2-stable

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	100.0	135.25610868094057	6.399866394760457	0.0	1915	0	82.61822200000779	2219.8920350000435
Aggregated	Passed ✅	100.0	135.25610868094057	6.399866394760457	0.0	1915	0	82.61822200000779	2219.8920350000435

Assets 4

05 Jun 05:54

github-actions

v1.40.2

6b57352

v1.40.2

What's Changed

Add simple OpenTelemetry tracer by @yujonglee in #3974
[FEAT] Add native OTEL logging to LiteLLM by @ishaan-jaff in #4010
[Docs] Use OTEL logging on LiteLLM Proxy by @ishaan-jaff in #4011
fix(bedrock): raise nested error response by @pharindoko in #3989
[Feat] Admin UI - Add, Edit all LiteLLM callbacks on UI by @ishaan-jaff in #4014
feat(assistants/main.py): add assistants api streaming support by @krrishdholakia in #4012
feat(utils.py): Support stream_options param across all providers by @krrishdholakia in #4015
fix(utils.py): fix cost calculation for openai-compatible streaming object by @krrishdholakia in #4009
[Fix] Admin UI Internal Users by @ishaan-jaff in #4016

Full Changelog: v1.40.1...v1.40.2

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.2

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	72	86.0339053382131	6.392727588765549	0.0	1913	0	61.2748209999836	896.4834699999642
Aggregated	Passed ✅	72	86.0339053382131	6.392727588765549	0.0	1913	0	61.2748209999836	896.4834699999642

Contributors

pharindoko, krrishdholakia, and 2 other contributors

Assets 4

05 Jun 04:52

github-actions

v1.40.1.dev4

03c501b

v1.40.1.dev4

What's Changed

Add simple OpenTelemetry tracer by @yujonglee in #3974
[FEAT] Add native OTEL logging to LiteLLM by @ishaan-jaff in #4010
[Docs] Use OTEL logging on LiteLLM Proxy by @ishaan-jaff in #4011
fix(bedrock): raise nested error response by @pharindoko in #3989
[Feat] Admin UI - Add, Edit all LiteLLM callbacks on UI by @ishaan-jaff in #4014
feat(assistants/main.py): add assistants api streaming support by @krrishdholakia in #4012
feat(utils.py): Support stream_options param across all providers by @krrishdholakia in #4015
fix(utils.py): fix cost calculation for openai-compatible streaming object by @krrishdholakia in #4009
[Fix] Admin UI Internal Users by @ishaan-jaff in #4016

Full Changelog: v1.40.1...v1.40.1.dev4

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.1.dev4

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	110.0	130.49834083376624	6.432223242582805	0.0	1925	0	92.76206099997353	2155.1117690000297
Aggregated	Passed ✅	110.0	130.49834083376624	6.432223242582805	0.0	1925	0	92.76206099997353	2155.1117690000297

Contributors

pharindoko, krrishdholakia, and 2 other contributors

Assets 4

04 Jun 16:56

github-actions

v1.40.1.dev2

ac2fa23

v1.40.1.dev2

What's Changed

Add simple OpenTelemetry tracer by @yujonglee in #3974

Full Changelog: v1.40.1...v1.40.1.dev2

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.1.dev2

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	140.0	177.0382996107586	6.334561220731733	0.0	1896	0	114.13910500004931	1784.0317350000134
Aggregated	Passed ✅	140.0	177.0382996107586	6.334561220731733	0.0	1896	0	114.13910500004931	1784.0317350000134

Contributors

yujonglee

Assets 4

Releases: BerriAI/litellm

v1.40.7

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

v1.40.6

🚨 Note: LiteLLM Proxy Added opentelemetry as a dependency on this release. We recommend waiting for a stable release before upgrading your production instances

✅ LiteLLM Python SDK Users: You should be unaffected by this change (opentelemetry was only added for the proxy server)

What's Changed

New Contributors

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

v1.40.5

What's Changed

New Contributors

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

v1.40.4

What's Changed

New Contributors

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

v1.40.3-stable

What's Changed

New Contributors

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

v1.40.3

What's Changed

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

v1.40.2-stable

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

v1.40.2

What's Changed

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

v1.40.1.dev4

What's Changed

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

v1.40.1.dev2

What's Changed

Docker Run LiteLLM Proxy

Don't want to maintain your internal proxy? get in touch 🎉

Load Test LiteLLM Proxy Results

Contributors

🚨 Note: LiteLLM Proxy Added `opentelemetry` as a dependency on this release. We recommend waiting for a stable release before upgrading your production instances

✅ LiteLLM Python SDK Users: You should be unaffected by this change (`opentelemetry` was only added for the proxy server)