Cannot Model Conversation Structure
English Language Dominance
Memory Bottleneck in Training
No Mention of Export Features
No Mention of Pricing Plans
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly
Sign-in Required for In-Depth