Cannot Model Conversation Structure
English Language Dominance
Export or Download Feature
File Formats Supported
Unknown
Limits and Quotas
Not specified
Memory Bottleneck in Training
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly
Supported Language List
Unknown (likely English only)