Create async chat completion
Submits a chat completion request for asynchronous execution. Returns a job ID immediately with HTTP 202. Poll the corresponding GET endpoint with the job ID to retrieve the result. Streaming is not supported for async requests.
Bearer token authentication. Use your MPilot virtual-key JWT or admin JWT.
Virtual keys (prefixed with sk-bf-) can also be passed here.
In: header
Header Parameters
Time-to-live in seconds for the job result after completion. Defaults to 3600 (1 hour). After expiry, the job result is automatically cleaned up.
3600Model in provider/model format (e.g., openai/gpt-4)
List of messages in the conversation
Fallback models in provider/model format
Whether to stream the response
-2 <= value <= 2Empty Object
Empty Object
-2 <= value <= 2Format for the response
Empty Object
0 <= value <= 2Deterministic sampling seed
Nucleus sampling parameter
0 <= value <= 1Number of most likely tokens to return at each position
0 <= value <= 20Up to 4 sequences where the API will stop generating tokens
Predicted output content for the model to reference (OpenAI only). Can reduce latency.
Prompt cache retention policy
"in-memory" | "24h"Web search options for chat completions (OpenAI only)
"low" | "medium" | "high"Response Body
curl -X POST "https://loading/{AI_GATEWAY_URL}/v1/async/chat/completions" \ -H "x-bf-async-job-result-ttl: 3600" \ -H "Content-Type: application/json" \ -d '{ "model": "openai/gpt-4", "messages": [ { "role": "assistant" } ] }'{
"id": "string",
"status": "pending",
"expires_at": "2019-08-24T14:15:22Z",
"created_at": "2019-08-24T14:15:22Z",
"completed_at": "2019-08-24T14:15:22Z",
"status_code": 0,
"result": null,
"error": {
"event_id": "string",
"type": "string",
"is_bifrost_error": true,
"status_code": 0,
"error": {
"type": "string",
"code": "string",
"message": "string",
"param": "string",
"event_id": "string"
},
"extra_fields": {
"provider": "openai",
"model_requested": "string",
"request_type": "string"
}
}
}{
"event_id": "string",
"type": "string",
"is_bifrost_error": true,
"status_code": 0,
"error": {
"type": "string",
"code": "string",
"message": "string",
"param": "string",
"event_id": "string"
},
"extra_fields": {
"provider": "openai",
"model_requested": "string",
"request_type": "string"
}
}{
"event_id": "string",
"type": "string",
"is_bifrost_error": true,
"status_code": 0,
"error": {
"type": "string",
"code": "string",
"message": "string",
"param": "string",
"event_id": "string"
},
"extra_fields": {
"provider": "openai",
"model_requested": "string",
"request_type": "string"
}
}