LiteLLM

a self hosted proxy server (AI Gateway) to call all the LLMs in the OpenAI format

LiteLLM is a versatile and robust solution for developers and AI platform teams seeking to streamline their interaction with various large language models (LLMs). It offers both a Python SDK for direct integration into code and a proxy server for centralized access, catering to diverse needs. LiteLLM uniquely standardizes access across numerous LLM providers, including OpenAI, Azure, Anthropic, Mistral, and many others, presenting them through a unified OpenAI-compatible API. This means existing code that works with OpenAI can seamlessly be adapted to other LLMs without extensive modifications. It also simplifies error handling by mapping all provider-specific exceptions to OpenAI equivalents, and enables easy integration with observability tools like Lunary and Langfuse.

A key benefit of LiteLLM is its sophisticated routing and load balancing capabilities, allowing users to distribute requests across multiple model instances or fall back to different models in case of failures. This ensures high availability and optimal performance. The proxy also allows for modification of requests and responses as well as LLM caching to improve efficiency. The combination of its broad model support, unified interface, robust error handling, and advanced routing features makes LiteLLM an invaluable tool for organizations navigating the complexities of modern AI infrastructure.

https://github.com/BerriAI/litellm