Cannot Model Conversation Structure
English Language Dominance
Memory Bottleneck in Training
No Built-in Telephony Provider Mentioned
No Explicit Pricing Details
No Explicit Usage Quotas Listed
No File Format Support Listed
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly