Endpoints
Chat completions
Create OpenAI-compatible chat completions with Capriole AI models.
POST
Use this endpoint when your application, SDK, or gateway expects the OpenAI Chat Completions request and response format.
Send a standard Chat Completions payload with
model and messages. Capriole AI rewrites the model value to the selected provider model ID, forwards compatible fields such as messages, tools, tool_choice, response_format, and stream, and returns the upstream JSON or SSE stream unchanged. For streaming requests, Capriole AI sets stream_options.include_usage=true before forwarding so usage can be recorded.
For Capriole-native text generation, use POST /v1/chat. Use POST /v1/chat/completions when OpenAI-compatible wire behavior is required.Authorizations
Use an API key created in the Capriole AI page. Send it as Authorization: Bearer sk-....
Body
application/json
Public model identifier returned by GET /v1/models
Available options:
openai/gpt-5.5, openai/gpt-5.4-mini, anthropic/claude-opus-4-8, anthropic/claude-opus-4-7, google/gemini-3.1-pro-preview, google/gemini-3.5-flash OpenAI-compatible conversation messages.
OpenAI-compatible tool definitions.
OpenAI-compatible tool choice.
OpenAI-compatible response format.
Stream the response as OpenAI-compatible server-sent events.
OpenAI-compatible stream options. For streaming requests, Capriole AI sets include_usage=true before upstream dispatch for usage accounting.