Create chat completion
Creates a model response for the given chat conversation. Supports streaming via SSE when stream: true.
Response headers
Inference-Id— Unique ID for this request. Include this when contacting support.
Error codes
| Status | Meaning |
|---|---|
402 | Insufficient credits |
429 | Rate limited |
504 | No backend available |
529 | No healthy backends are available for the requested model |
Authorization
BearerAuth API key passed as Bearer token
In: header
Request Body
application/json
TypeScript Definitions
Use the request body type in TypeScript.
Response Body
application/json
curl -X POST "https://example.com/v1/chat/completions" \ -H "Content-Type: application/json" \ -d '{ "model": "string", "messages": [ {} ] }'{
"id": "string",
"object": "string",
"created": 0,
"model": "string",
"choices": [
{
"index": 0,
"message": {
"role": "string",
"content": "string",
"tool_calls": [
{}
],
"tool_call_id": "string"
},
"finish_reason": "string"
}
],
"usage": {
"prompt_tokens": 0,
"completion_tokens": 0,
"total_tokens": 0
}
}List models GET
Lists the currently available models with pricing and capability metadata.
Create completion POST
Creates a completion for the provided prompt. Supports streaming via SSE when `stream: true`. **Response headers** - `Inference-Id` — Unique ID for this request. Include this when contacting support. **Error codes** | Status | Meaning | |--------|---------| | `402` | Insufficient credits | | `429` | Rate limited | | `504` | No backend available | | `529` | No healthy backends are available for the requested model |