Cannot Model Conversation Structure
English Language Dominance
Free Plan Limitations
Basic integrations, 1 agent, 20MB storage
Memory Bottleneck in Training
No Pre-trained Language Model Use
Pro Plan Minimum for Phone Connection
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly