Cannot Model Conversation Structure
Coming Soon Features
Video VoiceoversVideo Localization
English Language Dominance
Memory Bottleneck in Training
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly
Strict Data Deletion Policies
24 hours