Quick Start | Documentation | OpenAI API Compatibility | Discord
[!IMPORTANT] Llama Stack is now OGX. The name changed, and so did the mission — model-agnostic, multi-SDK, production-grade. Read the full announcement →
Open-source agentic API server for building AI applications. OpenAI-compatible. Any model, any infrastructure.
OGX is a drop-in replacement for the OpenAI API that you can run anywhere — your laptop, your datacenter, or the cloud. Use any OpenAI-compatible client or agentic framework. Swap between Llama, GPT, Gemini, Mistral, or any model without changing your application code.
from openai import OpenAI
client = OpenAI(base_url="http://localhost:8321/v1", api_key="fake")
response = client.chat.completions.create(
model="llama-3.3-70b",
messages=[{"role": "user", "content": "Hello"}],
)
/v1/chat/completions, /v1/completions, and /v1/embeddings endpoints, compatible with any OpenAI client/v1/vector_stores and /v1/files for managed document storage and search/v1/batches for offline batch processing/v1/messages) or Google GenAI SDK (/v1alpha/interactions) natively alongside the OpenAI APIOGX has a pluggable provider architecture. Develop locally with Ollama, deploy to production with vLLM, or connect to a managed service — the API stays the same.
See the provider documentation for the full list.
Install and run a OGX server:
# One-line install
curl -LsSf https://github.com/ogx-ai/ogx/raw/main/scripts/install.sh | bash
# Or install via uv
uv pip install ogx[starter]
# Start the server (uses the starter distribution with Ollama)
uv run ogx stack run starter
Then connect with any OpenAI, Anthropic, or Google GenAI client — Python, TypeScript, curl, or any framework that speaks these APIs.
See the Quick Start guide for detailed setup.
Client SDKs:
OGX provides official client SDKs for Python and TypeScript:
| Language | SDK | Package |
|---|---|---|
| Python | ogx-client-python | |
| TypeScript | ogx-client-typescript |
Alternative Python SDK:
For users who prefer an OpenAPI Generator-based SDK, an alternative Python client is available:
The official ogx_client SDK is recommended for most use cases. The ogx_open_client package offers an alternative for teams with specific OpenAPI tooling requirements.
We hold regular community calls every Thursday at 09:00 AM PST — see the Community Event on Discord for details.
Thanks to all our amazing contributors!