Advanced Text Comprehension
Consistent Voice Style Across Languages
Max Audio Duration
10 hours
Max Monthly Characters
7000000000
Mix-Language Comprehension
Multi-Speaker Voice Cloning
Production Ready Infrastructure
Real-Time System Interaction
Studio-Level Online Editing
Supported Bitrates
320k256k192k128k96k64k48k32k16k
Supported Language List
US EnglishUK EnglishMandarin ChineseHindiSpanishPortugueseJapaneseFrenchItalian
Text-to-Speech API
/stream/speech/synthesisTasks/streamWithTimestamps
Ultra Low Latency
under 500 ms
Voice Customization Options
200