Create chat response
Endpoints
Chat
Creates a text response from a model.
POST
Create chat response
Use this endpoint to send plain text input to a model and receive a plain text response.
Capriole AI web chat and the public API are separate product surfaces. In web chat, Claude Fast and Claude Thinking are product modes over Claude Opus 4.8. The public API does not expose those web chat modes as separate model IDs; for Claude API requests, use
anthropic/claude-opus-4-8. Existing anthropic/claude-opus-4-7 integrations remain supported.Authorizations
Use an API key created in the Capriole AI page. Send it as Authorization: Bearer sk-....
Body
application/json
Public model identifier returned by GET /v1/models
Available options:
openai/gpt-5.5, openai/gpt-5.4-mini, anthropic/claude-opus-4-8, anthropic/claude-opus-4-7, google/gemini-3.1-pro-preview, google/gemini-3.5-flash Plain text user input
Enable provider-native web search when the selected model supports it.
Optional sampling temperature.
Required range:
x >= 0Optional maximum number of output tokens.
Optional maximum number of provider retries.
Required range:
x >= 0Optional provider request timeout in seconds.