Attribution Requirement
Free plan requires attribution
Cannot Model Conversation Structure
English Language Dominance
Max Characters /speech
3000
Max Characters /stream
1000
Max Characters /synthesisTasks
500000
Memory Bottleneck in Training
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly