API-Kosten im Dispatcher-System — verifiziert 2026-05-26
| Modell | Input $/MTok | Output $/MTok | Notes |
|---|---|---|---|
| claude-opus-4-7 | $5.00 | $25.00 | — |
| claude-sonnet-4-6 | $3.00 | $15.00 | Flaneur primär |
| claude-haiku-4-5 | $1.00 | $5.00 | Pipecat Voice |
| Batch (alle) | 50% Rabatt | 50% Rabatt | async |
| Cache Write | 1.25× Input | — | Prompt-Cache |
| Cache Hit | 0.10× Input | — | Prompt-Cache |
| Modell | Input | Cached | Output |
|---|---|---|---|
| gpt-4o | $2.50/MTok | $1.25/MTok | $10.00/MTok |
| gpt-4o-mini | $0.15/MTok | — | $0.60/MTok |
| o1 | $15.00/MTok | — | $60.00/MTok |
| Speech-to-Text | Preis | Notes | |
|---|---|---|---|
| gpt-4o-transcribe | $0.006/Min | STT Standard | |
| gpt-realtime-whisper | $0.017/Min | Realtime-Modus | |
| Realtime Audio | Preis | ~Min-Äquivalent | |
|---|---|---|---|
| Audio Input | $32.00/MTok | ~$0.06/Min | |
| Audio Output | $64.00/MTok | ~$0.24/Min | |
| Service | Modell | Preis |
|---|---|---|
| TTS Standard | multilingual_v2 / v3 | $0.10/1.000 Zeichen |
| TTS Flash/Turbo | Flash / Turbo | $0.05/1.000 Zeichen |
| TTS Turbo v2.5 | turbo_v2_5 | $0.05/1.000 Zeichen |
| ConvAI | — | $0.08/Minute |
| STT Scribe | Scribe v1 / v2 | $0.22/Stunde |
| Service | Preis | Status |
|---|---|---|
| nova-2 / nova-3 STT | — | ⚠️ Preis nicht verifiziert |
| Streaming (WebSocket) | — | ⚠️ Preis nicht verifiziert |
| Modell | Input | Output | Notes |
|---|---|---|---|
| gemini-2.5-pro | $1.25 (≤200k) $2.50 (>200k) |
$10.00 (≤200k) $15.00 (>200k) |
Rat primär |
| gemini-2.5-flash | $0.30/MTok | $2.50/MTok | Healthcheck |
| gemini-2.0-flash | $0.10/MTok | $0.40/MTok | ⚠️ deprecated 01.06.2026 |
| Batch/Flex (alle) | 50% günstiger | 50% günstiger | async |
| Modell | Input $/MTok | Output $/MTok | Per-Request (medium) |
|---|---|---|---|
| sonar | $1.00 | $1.00 | $8.00/1k req |
| sonar-pro | $3.00 | $15.00 | $10.00/1k req |
| sonar-deep-research | $2.00 | $8.00 | $5.00/1k req |