| 1 | """Shared OpenAI-compatible chat completions client. Used by vllm + llama.cpp. |
| 2 | |
| 3 | Both servers expose `/v1/chat/completions` with the same request/response shape, |
| 4 | so this module is the single point of HTTP contact for both backends. |
nothing calls this directly
no test coverage detected