Cannot Model Conversation Structure
English Language Dominance
Languages Supported
Not specified
Memory Bottleneck in Training
Monthly Transcription Limit
2 hours (Basic)8 hours (Starter)24 hours (Pro)
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly