Consistent Voice Across Languages
Emotionally Expressive Speech
Max Audio Duration
10 hours
Max Monthly Characters
7000000000
Multi-language Support
EnglishFrenchGermanChineseJapaneseKorean
Production Ready Infrastructure
Real-Time Speech Adaptation
Supported Bitrates
320k256k192k128k96k64k48k32k16k
Supported Language List
US EnglishUK EnglishMandarin ChineseHindiSpanishPortugueseJapaneseFrenchItalian
Text-to-Speech API
/stream/speech/synthesisTasks/streamWithTimestamps