API or Plugin Integration
Cannot Model Conversation Structure
English Language Dominance
File Formats Supported
Not specified
Memory Bottleneck in Training
Minimum OS Requirements
MacOS Ventura 13.1 or higher
No Pre-trained Language Model Use
Platform Limitation
MacOS only
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly