New Models / Updated Modelsβ
- Gemini (VertexAI + Google AI Studio)
/chat/completion
/realtime
- VertexAI
/chat/completion
- Fix llama streaming error - where model response was nested in returned streaming chunk - PR
- Ollama
/chat/completion
- structure responses fix - PR
- Bedrock
- Nvidia NIM
/chat/completion
[NEED DOCS ON SUPPORTED PARAMS]- Add tools, tool_choice, parallel_tool_calls support - PR
- Novita AI
- New Provider added for
/chat/completion
routes - PR
- New Provider added for
- Azure
/image/generation
- Fix azure dall e 3 call with custom model name - PR
- Cohere
/embeddings
- Migrate embedding to use
/v2/embed
- adds support for output_dimensions param - PR
- Migrate embedding to use
- Anthropic
/chat/completion
- Web search tool support - native + openai format - PR [NEEDS DOCS]
- VLLM
/chat/completion
- Support embedding input as list of integers - PR [NEEDS DOCS]
- OpenAI
LLM API Endpointsβ
- Responses API
- Fix delete API support - PR
- Rerank API
/v2/rerank
now registered as βllm_api_routeβ - enabling non-admins to call it - PR
- Realtime API
- Gemini Multimodal Live API support - PR
Spend Tracking Improvementsβ
/chat/completion
,/messages
/audio/transcription
/embeddings
- Azure AI - Add cohere embed v4 pricing - PR
Management Endpoints / UIβ
- Models
- Ollama - adds api base param to UI
- Logs
- Add team id, key alias, key hash filter on logs - https://github.com/BerriAI/litellm/pull/10831
- Guardrail tracing now in Logs UI - https://github.com/BerriAI/litellm/pull/10893
- Teams
- Patch for updating team info when team in org and members not in org - https://github.com/BerriAI/litellm/pull/10835
- Guardrails
- Add Bedrock, Presidio, Lakers guardrails on UI - https://github.com/BerriAI/litellm/pull/10874
- See guardrail info page - https://github.com/BerriAI/litellm/pull/10904
- Allow editing guardrails on UI - https://github.com/BerriAI/litellm/pull/10907
- Test Key
- select guardrails to test on UI
Logging / Alerting Integrationsβ
- StandardLoggingPayload
- Generic API Logger
- Support passing application/json header
- Arize Phoenix
- PagerDuty
- Pagerduty is now a free feature - PR
- Alerting
- Sending slack alerts on virtual key/user/team updates is now free - PR
Guardrailsβ
- Guardrails
- New
/apply_guardrail
endpoint for directly testing a guardrail - PR [NEEDS DOCS]
- New
- Lakera
/v2
endpoints support - PR
- Presidio
- Aim Security
- Support for anonymization in AIM Guardrails - PR
Performance / Loadbalancing / Reliability improvementsβ
- Allow overriding all constants using a .env variable - PR
- Maximum retention period for spend logs
General Proxy Improvementsβ
- Authentication
- HandleΒ Bearer $LITELLM_API_KEYΒ inΒ x-litellm-api-keyΒ custom header PR
- New Enterprise pip package -
litellm-enterprise
- fixes issue whereenterprise
folder was not found when using pip package - Proxy CLI
- Add
models import
command - PR
- Add
- OpenWebUI
- Configure LiteLLM to Parse User Headers from Open Web UI
- LiteLLM Proxy w/ LiteLLM SDK
- Option to force/always use the litellm proxy when calling via LiteLLM SDK
New Contributorsβ
- @imdigitalashish made their first contribution in PR #10617
- @LouisShark made their first contribution in PR #10688
- @OscarSavNS made their first contribution in PR #10764
- @arizedatngo made their first contribution in PR #10654
- @jugaldb made their first contribution in PR #10805
- @daikeren made their first contribution in PR #10781
- @naliotopier made their first contribution in PR #10077
- @damienpontifex made their first contribution in PR #10813
- @Dima-Mediator made their first contribution in PR #10789
- @igtm made their first contribution in PR #10814
- @shibaboy made their first contribution in PR #10752
- @camfarineau made their first contribution in PR #10629
- @ajac-zero made their first contribution in PR #10439
- @damgem made their first contribution in PR #9802
- @hxdror made their first contribution in PR #10757
- @wwwillchen made their first contribution in PR #10894