OpenAI Realtime API vs Text Reader

Comparing the features of OpenAI Realtime API to Text Reader

Feature
OpenAI Realtime API
Text Reader

Capability Features

Accessibility Features
Advanced AI Models
AI WaveNet Voices
Commercial Use Allowed
Continuous AI Improvement
Download as MP3
Educational Use
Enterprise Privacy Commitment
Expanded Model Support Planned
Five New Voices
5
Function Calling
Gender Selection
MaleFemale
Human and Automated Safety Monitoring
Instant Generation
Interruption Handling
Language and Accent Variety
50+ languages and variants
Manual Input Supported
Multi-language Support
AfrikaansArabicBengaliBulgarianCatalanChineseCzechDanishDutchEnglish (Australia)English (United Kingdom)English (United States)FilipinoFinnishFrench (Canada)French (France)GermanGujaratiHindiHungarianIcelandicIndonesianItalianJapaneseKannadaKoreanLatvianMalayalamMandarinNorwegianPolishPortuguese (Brazil)Portuguese (Portugal)RomanianRussianSerbianSlovakSpanish (Spain)Spanish (United States)SwedishTamilTeluguThaiTurkishUkrainianVietnamese
No Training on Data Without Permission
Playground Access
Prompt Caching Planned
Public Beta
Reference Client Available
Six Preset Voices
6
Speech-to-Speech
Streaming Audio Inputs/Outputs
Supports Text and Audio Inputs
TextAudio
Ultra Low Latency
Unlimited Downloads
User-Friendly Controls
Voice Options
WebSocket Connection

Integration Features

Agora Integration
API Integrations
Chat Completions API Integration
LiveKit Integration
OpenAI Node.js SDK Planned
OpenAI Python SDK Planned
Supported Output Formats
MP3
Supports GPT-4o
gpt-4o-realtime-preview
Twilio Voice API Integration
Upload TXT Files

Limitation Features

AI Disclosure Requirement
Audio Only Modality (Initially)
Export Format Limitation
.txt
Free Tier Character Limit
1000
Lower Session Limits Tiers 1-4
Lower than 100
No Mention of API
No Simultaneous Session Limit Anymore
Simultaneous Sessions Limit Tier 5
100
Usage Policy Restriction

Pricing Features

Approximate Audio Input Price
$0.06/minute
Approximate Audio Output Price
$0.24/minute
Free Tier
Monthly Voice Generation Limit
3 hrs/month
No Free Tier
Premium Plan Monthly
$18/mo
Pricing Audio Input
$100/1M tokens
Pricing Audio Output
$200/1M tokens
Pricing Cached Audio Input
$20/1M tokens
Pricing Cached Text Input
$2.50/1M tokens
Pricing Text Input
$5/1M tokens
Pricing Text Output
$20/1M tokens
Pro Annual Plan
$15/mo (billed annually)
Voice Generation Limit (Annual)
36 hrs/year