All Voices Download (Basic+)
Basic Plan Audio Quality
High quality (44.1kHz)
Basic Plan Video Quality
Full HD (1080p)
Business Plan Voice Cloning Slots
2
Conversational Speech Generation
Dataset Size
1 million hours
Free Tier Video Download Quality
HD (720p)
HD Audio Downloads (Basic+)
44.1kHz
High Quality Speech Synthesis
Language Support (Flagship)
EnglishSpanishKoreanJapaneseChineseVietnamese
Model Sizes
Tiny: 1B backbone, 100M decoderSmall: 3B backbone, 250M decoderMedium: 8B backbone, 300M decoder
Multiple Speaker Handling
Natural Variation in Speech
Objective Metrics
Word Error RateSpeaker SimilarityHomograph DisambiguationPronunciation Consistency
Partial Multilingual Support Planned
Planned for 20+ languages
Pro Plan Video Quality
Ultra HD (4K)
Pro Plan Voice Cloning Slots
1
Subjective Metrics
Comparative Mean Opinion Score
Unlimited Voice Generation
Voice Customization Options
Voice Types
KidTikTokAudiobooksVoicemailsAnimeRapperNewsAnnouncerPodcastsAdsVideo gameCartoonNarration
Watermark-Free Downloads (Basic+)