Endpoints
Chat completions
Create OpenAI-compatible chat completions with Capriole AI models.
POST
Use this endpoint when your application, SDK, or gateway expects the OpenAI Chat Completions request and response format. Send a standard Chat Completions payload withDocumentation Index
Fetch the complete documentation index at: https://docs.capriole.ai/llms.txt
Use this file to discover all available pages before exploring further.
model and messages. Capriole AI rewrites the model value to the selected provider model ID, forwards compatible fields such as messages, tools, tool_choice, response_format, and stream, and returns the upstream JSON or SSE stream unchanged. For streaming requests, Capriole AI sets stream_options.include_usage=true before forwarding so usage can be recorded.
For Capriole-native text generation, use POST /v1/chat. Use POST /v1/chat/completions when OpenAI-compatible wire behavior is required.Authorizations
Use an API key created in the Capriole AI page. Send it as Authorization: Bearer sk-....
Body
application/json
Public model identifier returned by GET /v1/models
Available options:
openai/gpt-5.5, openai/gpt-5.4-mini, anthropic/claude-opus-4-8, anthropic/claude-opus-4-7, google/gemini-3.1-pro-preview, google/gemini-3.5-flash OpenAI-compatible conversation messages.
OpenAI-compatible tool definitions.
OpenAI-compatible tool choice.
OpenAI-compatible response format.
Stream the response as OpenAI-compatible server-sent events.
OpenAI-compatible stream options. For streaming requests, Capriole AI sets include_usage=true before upstream dispatch for usage accounting.