Gemini 3 Flash Preview

Developer documentation

Gemini 3 Flash Preview

Gemini 3 Flash Preview for text generation, reasoning, tool calling, and live streaming responses.

Model Reference

Chat and reasoning models

Language, code, reasoning, multimodal chat, tool calling, and streaming responses. Endpoint: http://omixa.cloud/api/v1/chat/completions

Gemini 3 Flash Preview

gemini-3-flash-preview

Gemini 3 Flash Preview for text generation, reasoning, tool calling, and live streaming responses.

Chat Streaming Tools Context window: 1,048,576 tokens Max output: 65,536 tokens
input per 1m tokens $0.500000
cached input per 1m tokens $0.050000
output per 1m tokens $3.000000
Integration reference

Connect Gemini 3 Flash Preview

Use Omixa's unified endpoint and your workspace API key. Provider routing, billing, failover, and usage records are handled by Omixa.

POST http://omixa.cloud/api/v1/chat/completions
  • Provider: google
  • Endpoint type: google_openai_chat
  • Context window: 1,048,576 tokens
  • Max output: 65,536 tokens
  • Streaming supported
  • Tool/function calling supported
  • Inline images/documents supported through message parts
  • Reasoning controls: minimal, low, medium, high
Request schema

Request fields

Only send options supported by this model. Required fields and accepted values are listed below.

Field Type Required Accepted values Description
model string Yes gemini-3-flash-preview Use `gemini-3-flash-preview`. Omixa resolves the active provider route and failover key automatically.
messages array Yes Any valid value Conversation turns. Each item contains `role` and `content`. Roles are `system`, `user`, `assistant`, and tool roles when supported.
messages[].content string|array Yes Any valid value Text prompt or multimodal parts. Multimodal-capable models accept `{type:"text"}`, `{type:"image_url"}`, and inline files supported by Omixa.
stream boolean No Any valid value When true, returns OpenAI-compatible Server-Sent Events until `data: [DONE]`.
max_tokens|max_completion_tokens integer No Any valid value Maximum answer tokens. Omixa maps the correct field for Google, Azure chat, and reasoning model families.
temperature number No Any valid value Sampling randomness for non-reasoning models. Reasoning-only models may ignore or reject it, so Omixa strips it where needed.
top_p number No Any valid value Nucleus sampling. Use either `temperature` or `top_p` for predictable tuning.
stop string|array No Any valid value Stop sequence or sequences. Not all provider routes support it.
response_format object No Any valid value Structured output hints such as `{ "type": "json_object" }` when the selected provider supports JSON mode.
tools array No Any valid value OpenAI-style tool/function declarations. Omixa forwards compatible declarations to tool-capable providers.
tool_choice string|object No Any valid value `auto`, `none`, `required`, or a specific tool choice when the provider supports it.
reasoning_effort string No minimal, low, medium, high Reasoning budget/effort. Omixa normalizes options per provider family.
show_thinking boolean No Any valid value Requests reasoning summaries or native `reasoning_content` chunks when supported by the selected route.
attachments inline parts No Any valid value Send image or document parts inside `messages[].content`. Use data URLs/base64 with MIME type; remote URLs are not fetched by Omixa for Google Vertex.
Ready to send

Payload and response

Start with this model-safe payload and expect the normalized Omixa response shape shown beside it.

Example JSON payload
{
    "model": "gemini-3-flash-preview",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "reasoning_effort": "minimal",
    "show_thinking": true,
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "lookup_order",
                "description": "Return order status by ID.",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "order_id": {
                            "type": "string"
                        }
                    },
                    "required": [
                        "order_id"
                    ]
                }
            }
        }
    ],
    "tool_choice": "auto"
}
Response shape
{
    "object": "chat.completion",
    "choices": [
        {
            "message": {
                "role": "assistant",
                "content": "..."
            }
        }
    ],
    "usage": {
        "prompt_tokens": 123,
        "completion_tokens": 45,
        "total_tokens": 168
    }
}
Language examples

Copy-ready integration code

Replace the example API key with a workspace key and keep model-specific fields unchanged unless the table above marks them optional.

cURL
curl -X POST http://omixa.cloud/api/v1/chat/completions \
  -H "Authorization: Bearer omx_live_xxx" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash-preview",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "reasoning_effort": "minimal",
    "show_thinking": true,
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "lookup_order",
                "description": "Return order status by ID.",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "order_id": {
                            "type": "string"
                        }
                    },
                    "required": [
                        "order_id"
                    ]
                }
            }
        }
    ],
    "tool_choice": "auto"
}'
JavaScript fetch
const response = await fetch('http://omixa.cloud/api/v1/chat/completions', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer omx_live_xxx',
    'Content-Type': 'application/json'
  },
  body: "{\n    \"model\": \"gemini-3-flash-preview\",\n    \"messages\": [\n        {\n            \"role\": \"system\",\n            \"content\": \"You are a concise API assistant.\"\n        },\n        {\n            \"role\": \"user\",\n            \"content\": \"Explain this model in three practical bullet points.\"\n        }\n    ],\n    \"stream\": true,\n    \"max_tokens\": 2048,\n    \"reasoning_effort\": \"minimal\",\n    \"show_thinking\": true,\n    \"tools\": [\n        {\n            \"type\": \"function\",\n            \"function\": {\n                \"name\": \"lookup_order\",\n                \"description\": \"Return order status by ID.\",\n                \"parameters\": {\n                    \"type\": \"object\",\n                    \"properties\": {\n                        \"order_id\": {\n                            \"type\": \"string\"\n                        }\n                    },\n                    \"required\": [\n                        \"order_id\"\n                    ]\n                }\n            }\n        }\n    ],\n    \"tool_choice\": \"auto\"\n}"
});
const data = await response.json();
Python requests
import requests

response = requests.post(
    'http://omixa.cloud/api/v1/chat/completions',
    headers={'Authorization': 'Bearer omx_live_xxx', 'Content-Type': 'application/json'},
    json={
    "model": "gemini-3-flash-preview",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": True,
    "max_tokens": 2048,
    "reasoning_effort": "minimal",
    "show_thinking": True,
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "lookup_order",
                "description": "Return order status by ID.",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "order_id": {
                            "type": "string"
                        }
                    },
                    "required": [
                        "order_id"
                    ]
                }
            }
        }
    ],
    "tool_choice": "auto"
}
)
print(response.json())
PHP cURL
$ch = curl_init('http://omixa.cloud/api/v1/chat/completions');
curl_setopt_array($ch, [
    CURLOPT_POST => true,
    CURLOPT_HTTPHEADER => ['Authorization: Bearer omx_live_xxx', 'Content-Type: application/json'],
    CURLOPT_POSTFIELDS => '{
    "model": "gemini-3-flash-preview",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "reasoning_effort": "minimal",
    "show_thinking": true,
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "lookup_order",
                "description": "Return order status by ID.",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "order_id": {
                            "type": "string"
                        }
                    },
                    "required": [
                        "order_id"
                    ]
                }
            }
        }
    ],
    "tool_choice": "auto"
}',
    CURLOPT_RETURNTRANSFER => true,
]);
$response = curl_exec($ch);
C# HttpClient
using var client = new HttpClient();
client.DefaultRequestHeaders.Authorization = new System.Net.Http.Headers.AuthenticationHeaderValue("Bearer", "omx_live_xxx");
var json = @"{
    ""model"": ""gemini-3-flash-preview"",
    ""messages"": [
        {
            ""role"": ""system"",
            ""content"": ""You are a concise API assistant.""
        },
        {
            ""role"": ""user"",
            ""content"": ""Explain this model in three practical bullet points.""
        }
    ],
    ""stream"": true,
    ""max_tokens"": 2048,
    ""reasoning_effort"": ""minimal"",
    ""show_thinking"": true,
    ""tools"": [
        {
            ""type"": ""function"",
            ""function"": {
                ""name"": ""lookup_order"",
                ""description"": ""Return order status by ID."",
                ""parameters"": {
                    ""type"": ""object"",
                    ""properties"": {
                        ""order_id"": {
                            ""type"": ""string""
                        }
                    },
                    ""required"": [
                        ""order_id""
                    ]
                }
            }
        }
    ],
    ""tool_choice"": ""auto""
}";
var response = await client.PostAsync("http://omixa.cloud/api/v1/chat/completions", new StringContent(json, System.Text.Encoding.UTF8, "application/json"));
var body = await response.Content.ReadAsStringAsync();
Go net/http
payload := []byte(`{
    "model": "gemini-3-flash-preview",
    "messages": [
        {
            "role": "system",
            "content": "You are a concise API assistant."
        },
        {
            "role": "user",
            "content": "Explain this model in three practical bullet points."
        }
    ],
    "stream": true,
    "max_tokens": 2048,
    "reasoning_effort": "minimal",
    "show_thinking": true,
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "lookup_order",
                "description": "Return order status by ID.",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "order_id": {
                            "type": "string"
                        }
                    },
                    "required": [
                        "order_id"
                    ]
                }
            }
        }
    ],
    "tool_choice": "auto"
}`)
req, _ := http.NewRequest("POST", "http://omixa.cloud/api/v1/chat/completions", bytes.NewReader(payload))
req.Header.Set("Authorization", "Bearer omx_live_xxx")
req.Header.Set("Content-Type", "application/json")
resp, err := http.DefaultClient.Do(req)
Streaming note
Set `"stream": true` and read Server-Sent Events line by line until `data: [DONE]`. Each event is OpenAI-compatible `chat.completion.chunk` JSON.
Production checklist

Operational notes

  • Authenticate with `Authorization: Bearer omx_live_xxx`.
  • Omixa handles provider keys, routing, billing, failover, and usage recording behind this endpoint.
  • Google Vertex requests use active Vertex accounts configured by the admin; Omixa retries the next healthy account when a route fails before output starts.
  • This model is routed through Vertex OpenAI-compatible chat. Omixa normalizes native reasoning chunks and provider-specific controls.
Copied Markdown