Create chat response
Endpoints
Chat
Creates a text response from a model.
POST
Create chat response
Use this endpoint to send plain text input to a model and receive a plain text response. Capriole AI web chat and the public API are separate product surfaces. In web chat, Claude Fast and Claude Thinking are product modes over the same Claude Opus 4.7 model: Fast is the lower-latency chat lane over that model, and Thinking is the deeper-analysis chat lane. The public API does not expose those web chat modes as separate model IDs; for Claude API requests, useDocumentation Index
Fetch the complete documentation index at: https://docs.capriole.ai/llms.txt
Use this file to discover all available pages before exploring further.
anthropic/claude-opus-4-7.Authorizations
Use an API key created in the Capriole AI page. Send it as Authorization: Bearer sk-....
Body
application/json
Public model identifier returned by GET /v1/models
Available options:
openai/gpt-5.5, openai/gpt-5.4-mini, google/gemini-3.1-pro-preview, google/gemini-3.5-flash, anthropic/claude-opus-4-7 Plain text user input
Enable provider-native web search when the selected model supports it.
Optional sampling temperature.
Required range:
x >= 0Optional maximum number of output tokens.
Optional maximum number of provider retries.
Required range:
x >= 0Optional provider request timeout in seconds.