LiteLLM is a library that simplifies calling Anthropic, Azure, Huggingface, Replicate, and many other LLM providers through a single unified interface.
Installation and setup
Chat models
Embeddings
LiteLLMEmbeddings embeds text across 100+ providers with a single consistent interface. All configuration is explicit, with no environment variables required.
model. The interface stays the same:
LiteLLMEmbeddingsRouter:
Document loaders
LiteLLMOCRLoader loads documents via a LiteLLM proxy’s OCR endpoint (e.g. Azure Document Intelligence). The proxy handles all provider-specific authentication and configuration.
API reference
For detailed documentation of all classes and configurations, see the langchain-litellm API reference.| Class | Description |
|---|---|
| ChatLiteLLM | LangChain chat model wrapper for LiteLLM |
| ChatLiteLLMRouter | Router-backed chat model for load balancing and fallbacks |
| LiteLLMEmbeddings | Embed text across 100+ providers with a single consistent interface |
| LiteLLMEmbeddingsRouter | Router-backed embeddings for load balancing across deployments |
| LiteLLMOCRLoader | Document loader via LiteLLM proxy’s OCR endpoint |
Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

