Whisper
whisperWhisper for speech, transcription, translation, or voice generation workflows.
Whisper for speech, transcription, translation, or voice generation workflows.
Text-to-speech, GPT audio chat, realtime sessions, transcription-oriented catalog rows, and voice settings. Endpoint: http://omixa.cloud/api/v1/audio
Whisper for speech, transcription, translation, or voice generation workflows.
Use Omixa's unified endpoint and your workspace API key. Provider routing, billing, failover, and usage records are handled by Omixa.
http://omixa.cloud/api/v1/audio
Only send options supported by this model. Required fields and accepted values are listed below.
| Field | Type | Required | Accepted values | Description |
|---|---|---|---|---|
model |
string | Yes | whisper | Use `whisper`. Omixa resolves the active provider route and failover key automatically. |
task |
string | No | auto, tts, audio_chat, realtime, transcription, translation | Audio workflow. Omixa maps it to speech, GPT audio chat, or realtime setup. |
input |
string | Yes | Any valid value | Text to speak, text prompt for audio chat, or realtime session instruction. |
voice |
string | No | alloy, ash, ballad, coral, echo, sage, shimmer, verse, marin, cedar | Voice name. Omixa maps common OpenAI voices to Google Gemini TTS voices when needed. |
response_format |
string | No | mp3, wav, opus, flac, pcm16, aac | Returned audio format. |
speed |
number | No | 0.25-4 | Speech speed for TTS routes. |
instructions |
string | No | Any valid value | Voice style or realtime/audio-chat behavior instructions. |
Start with this model-safe payload and expect the normalized Omixa response shape shown beside it.
{
"model": "whisper",
"task": "tts",
"input": "Welcome to Omixa, one API for every AI model.",
"voice": "alloy",
"response_format": "mp3",
"speed": 1
}
{
"object": "audio.speech|audio.chat|realtime.session",
"data": {
"content_type": "audio/mpeg",
"audio_base64": "<base64-audio>"
},
"usage": {
"estimated_audio_minutes": "0.100000"
}
}
Replace the example API key with a workspace key and keep model-specific fields unchanged unless the table above marks them optional.
curl -X POST http://omixa.cloud/api/v1/audio \
-H "Authorization: Bearer omx_live_xxx" \
-H "Content-Type: application/json" \
-d '{
"model": "whisper",
"task": "tts",
"input": "Welcome to Omixa, one API for every AI model.",
"voice": "alloy",
"response_format": "mp3",
"speed": 1
}'
const response = await fetch('http://omixa.cloud/api/v1/audio', {
method: 'POST',
headers: {
'Authorization': 'Bearer omx_live_xxx',
'Content-Type': 'application/json'
},
body: "{\n \"model\": \"whisper\",\n \"task\": \"tts\",\n \"input\": \"Welcome to Omixa, one API for every AI model.\",\n \"voice\": \"alloy\",\n \"response_format\": \"mp3\",\n \"speed\": 1\n}"
});
const data = await response.json();
import requests
response = requests.post(
'http://omixa.cloud/api/v1/audio',
headers={'Authorization': 'Bearer omx_live_xxx', 'Content-Type': 'application/json'},
json={
"model": "whisper",
"task": "tts",
"input": "Welcome to Omixa, one API for every AI model.",
"voice": "alloy",
"response_format": "mp3",
"speed": 1
}
)
print(response.json())
$ch = curl_init('http://omixa.cloud/api/v1/audio');
curl_setopt_array($ch, [
CURLOPT_POST => true,
CURLOPT_HTTPHEADER => ['Authorization: Bearer omx_live_xxx', 'Content-Type: application/json'],
CURLOPT_POSTFIELDS => '{
"model": "whisper",
"task": "tts",
"input": "Welcome to Omixa, one API for every AI model.",
"voice": "alloy",
"response_format": "mp3",
"speed": 1
}',
CURLOPT_RETURNTRANSFER => true,
]);
$response = curl_exec($ch);
using var client = new HttpClient();
client.DefaultRequestHeaders.Authorization = new System.Net.Http.Headers.AuthenticationHeaderValue("Bearer", "omx_live_xxx");
var json = @"{
""model"": ""whisper"",
""task"": ""tts"",
""input"": ""Welcome to Omixa, one API for every AI model."",
""voice"": ""alloy"",
""response_format"": ""mp3"",
""speed"": 1
}";
var response = await client.PostAsync("http://omixa.cloud/api/v1/audio", new StringContent(json, System.Text.Encoding.UTF8, "application/json"));
var body = await response.Content.ReadAsStringAsync();
payload := []byte(`{
"model": "whisper",
"task": "tts",
"input": "Welcome to Omixa, one API for every AI model.",
"voice": "alloy",
"response_format": "mp3",
"speed": 1
}`)
req, _ := http.NewRequest("POST", "http://omixa.cloud/api/v1/audio", bytes.NewReader(payload))
req.Header.Set("Authorization", "Bearer omx_live_xxx")
req.Header.Set("Content-Type", "application/json")
resp, err := http.DefaultClient.Do(req)