Cannot Model Conversation Structure
English Language Dominance
Internet Required for Voice Responses
Memory Bottleneck in Training
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly
Version Requirement
1.25051.10.0 or higher
Wake Word Only in English
English only
Windows Insider Availability
Insiders only