MAI DS R1

Developer documentation

MAI DS R1 for text generation, reasoning, tool calling, and streaming responses.

Model Reference

Chat and reasoning models

Language, code, reasoning, multimodal chat, tool calling, and streaming responses. Endpoint: http://omixa.cloud/api/v1/chat/completions

MAI DS R1

MAI-DS-R1

MAI DS R1 for text generation, reasoning, tool calling, and streaming responses.

Chat Streaming Context window: 163,840 tokens Max output: 163,840 tokens

input per 1m tokens $1.350000

output per 1m tokens $5.400000

minimum hold $0.010000

Integration reference

Connect MAI DS R1

Use Omixa's unified endpoint and your workspace API key. Provider routing, billing, failover, and usage records are handled by Omixa.

POST http://omixa.cloud/api/v1/chat/completions

Provider: azure-foundry
Endpoint type: azure_model_inference_chat
Context window: 163,840 tokens
Max output: 163,840 tokens
Streaming supported

Request schema

Request fields

Only send options supported by this model. Required fields and accepted values are listed below.

Field	Type	Required	Accepted values	Description
`model`	string	Yes	MAI-DS-R1	Use `MAI-DS-R1`. Omixa resolves the active provider route and failover key automatically.
`messages`	array	Yes	Any valid value	Conversation turns. Each item contains `role` and `content`. Roles are `system`, `user`, `assistant`, and tool roles when supported.
`messages[].content`	string\|array	Yes	Any valid value	Text prompt or multimodal parts. Multimodal-capable models accept `{type:"text"}`, `{type:"image_url"}`, and inline files supported by Omixa.
`stream`	boolean	No	Any valid value	When true, returns OpenAI-compatible Server-Sent Events until `data: [DONE]`.
`max_tokens\|max_completion_tokens`	integer	No	Any valid value	Maximum answer tokens. Omixa maps the correct field for Google, Azure chat, and reasoning model families.
`temperature`	number	No	Any valid value	Sampling randomness for non-reasoning models. Reasoning-only models may ignore or reject it, so Omixa strips it where needed.
`top_p`	number	No	Any valid value	Nucleus sampling. Use either `temperature` or `top_p` for predictable tuning.
`stop`	string\|array	No	Any valid value	Stop sequence or sequences. Not all provider routes support it.
`response_format`	object	No	Any valid value	Structured output hints such as `{ "type": "json_object" }` when the selected provider supports JSON mode.

Ready to send

Payload and response

Start with this model-safe payload and expect the normalized Omixa response shape shown beside it.

Example JSON payload

{
    "model": "MAI-DS-R1",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "temperature": 0.7
}

Response shape

{
    "object": "chat.completion",
    "choices": [
        {
            "message": {
                "role": "assistant",
                "content": "..."
            }
        }
    ],
    "usage": {
        "prompt_tokens": 123,
        "completion_tokens": 45,
        "total_tokens": 168
    }
}

Language examples

Copy-ready integration code

Replace the example API key with a workspace key and keep model-specific fields unchanged unless the table above marks them optional.

cURL

curl -X POST http://omixa.cloud/api/v1/chat/completions \
  -H "Authorization: Bearer omx_live_xxx" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "MAI-DS-R1",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "temperature": 0.7
}'

JavaScript fetch

const response = await fetch('http://omixa.cloud/api/v1/chat/completions', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer omx_live_xxx',
    'Content-Type': 'application/json'
  },
  body: "{\n    \"model\": \"MAI-DS-R1\",\n    \"messages\": [\n        {\n            \"role\": \"system\",\n            \"content\": \"You are a concise API assistant.\"\n        },\n        {\n            \"role\": \"user\",\n            \"content\": \"Explain this model in three practical bullet points.\"\n        }\n    ],\n    \"stream\": true,\n    \"max_tokens\": 2048,\n    \"temperature\": 0.7\n}"
});
const data = await response.json();

Python requests

import requests

response = requests.post(
    'http://omixa.cloud/api/v1/chat/completions',
    headers={'Authorization': 'Bearer omx_live_xxx', 'Content-Type': 'application/json'},
    json={
    "model": "MAI-DS-R1",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": True,
    "max_tokens": 2048,
    "temperature": 0.7
}
)
print(response.json())

PHP cURL

$ch = curl_init('http://omixa.cloud/api/v1/chat/completions');
curl_setopt_array($ch, [
    CURLOPT_POST => true,
    CURLOPT_HTTPHEADER => ['Authorization: Bearer omx_live_xxx', 'Content-Type: application/json'],
    CURLOPT_POSTFIELDS => '{
    "model": "MAI-DS-R1",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "temperature": 0.7
}',
    CURLOPT_RETURNTRANSFER => true,
]);
$response = curl_exec($ch);

C# HttpClient

using var client = new HttpClient();
client.DefaultRequestHeaders.Authorization = new System.Net.Http.Headers.AuthenticationHeaderValue("Bearer", "omx_live_xxx");
var json = @"{
    ""model"": ""MAI-DS-R1"",
    ""messages"": [
        {
            ""role"": ""system"",
            ""content"": ""You are a concise API assistant.""
        },
        {
            ""role"": ""user"",
            ""content"": ""Explain this model in three practical bullet points.""
        }
    ],
    ""stream"": true,
    ""max_tokens"": 2048,
    ""temperature"": 0.7
}";
var response = await client.PostAsync("http://omixa.cloud/api/v1/chat/completions", new StringContent(json, System.Text.Encoding.UTF8, "application/json"));
var body = await response.Content.ReadAsStringAsync();

Go net/http

payload := []byte(`{
    "model": "MAI-DS-R1",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "temperature": 0.7
}`)
req, _ := http.NewRequest("POST", "http://omixa.cloud/api/v1/chat/completions", bytes.NewReader(payload))
req.Header.Set("Authorization", "Bearer omx_live_xxx")
req.Header.Set("Content-Type", "application/json")
resp, err := http.DefaultClient.Do(req)

Streaming note

Set `"stream": true` and read Server-Sent Events line by line until `data: [DONE]`. Each event is OpenAI-compatible `chat.completion.chunk` JSON.

Production checklist

Operational notes

Authenticate with `Authorization: Bearer omx_live_xxx`.
Omixa handles provider keys, routing, billing, failover, and usage recording behind this endpoint.