Microsoft Copilot for Windows vs OpenAI Realtime API

Comparing the features of Microsoft Copilot for Windows to OpenAI Realtime API

Feature
Microsoft Copilot for Windows
OpenAI Realtime API

Capability Features

Automatic Conversation End
Chime Notification
Copilot Voice Floating UI
Copilot Wake Word
Enterprise Privacy Commitment
Expanded Model Support Planned
Feedback Mechanism
Five New Voices
5
Function Calling
Hands-Free Voice Activation
Human and Automated Safety Monitoring
Interruption Handling
Manual Conversation End
Mic Usage Indicator
No Training on Data Without Permission
On-Device Wake Word Detection
Playground Access
Prompt Caching Planned
Public Beta
Reference Client Available
Six Preset Voices
6
Speech-to-Speech
Streaming Audio Inputs/Outputs
Supports Text and Audio Inputs
TextAudio
Ultra Low Latency
WebSocket Connection

Integration Features

Agora Integration
Chat Completions API Integration
LiveKit Integration
Microsoft Store Update
OpenAI Node.js SDK Planned
OpenAI Python SDK Planned
Supports GPT-4o
gpt-4o-realtime-preview
Twilio Voice API Integration

Limitation Features

AI Disclosure Requirement
Audio Only Modality (Initially)
Feature Not Default
Internet Required for Voice Responses
Lower Session Limits Tiers 1-4
Lower than 100
No Local Audio Storage
No Simultaneous Session Limit Anymore
Requires PC Unlocked
Simultaneous Sessions Limit Tier 5
100
Usage Policy Restriction
Version Requirement
1.25051.10.0 or higher
Wake Word Only in English
English only
Windows Insider Availability
Insiders only

Pricing Features

Approximate Audio Input Price
$0.06/minute
Approximate Audio Output Price
$0.24/minute
No Free Tier
Pricing Audio Input
$100/1M tokens
Pricing Audio Output
$200/1M tokens
Pricing Cached Audio Input
$20/1M tokens
Pricing Cached Text Input
$2.50/1M tokens
Pricing Text Input
$5/1M tokens
Pricing Text Output
$20/1M tokens