Advanced Text Comprehension
Consistent Voice Style Across Languages
Mix-Language Comprehension
Multi-Speaker Voice Cloning
Multimodal Prompt Support
Music Sampling for Remixes
Real-Time System Interaction
Studio-Level Online Editing
Supported Use Cases
Video content (YouTube)PodcastsGamesShort films/TrailersAI ArtSocial MediaAudiobooksAdvertisementsLivestreams
Ultra Low Latency
under 500 ms
Voice Customization Options
200