Whisper Notes vs OpenAI Realtime API

Comparing the features of Whisper Notes to OpenAI Realtime API

Feature
Whisper Notes
OpenAI Realtime API

Capability Features

Audio File Import
Custom Keyboard Shortcuts
Enterprise Privacy Commitment
Expanded Model Support Planned
Export SRT Subtitles
Export with Timestamps
Five New Voices
5
Function Calling
Human and Automated Safety Monitoring
Interruption Handling
Local Processing
Manual Re-transcribe Option
Multilingual Support
EnglishChineseSpanishGermanFrenchJapaneseKoreanRussianArabicHindiPortugueseItalianDutchTurkishIndonesianVietnameseThaiCzechPolishUkrainianSwedish
No Cloud Upload
No Internet Required
No Recording Length Limit
No Training on Data Without Permission
Offline Transcription
Playground Access
Prompt Caching Planned
Public Beta
Reference Client Available
Refund Available
Share Transcripts
Six Preset Voices
6
Speech-to-Speech
Streaming Audio Inputs/Outputs
Supports Text and Audio Inputs
TextAudio
Ultra Low Latency
Unlimited Transcriptions
WebSocket Connection
Whisper Large V3 Turbo Model
Whisper Large V3 Turbo

Integration Features

Agora Integration
Apple Silicon Support
Chat Completions API Integration
File Formats Supported
MP3M4AWAV
Intel Mac Support
LiveKit Integration
OpenAI Node.js SDK Planned
OpenAI Python SDK Planned
Supports GPT-4o
gpt-4o-realtime-preview
Twilio Voice API Integration
Works on iOS
Works on macOS

Limitation Features

AI Disclosure Requirement
Audio Only Modality (Initially)
Device Requirements
Not supported on iPhone SE 2nd gen; 8GB+ RAM recommended for macOS
Lower Session Limits Tiers 1-4
Lower than 100
No AI Summaries
No Background Transcription on iOS
No Free Trial
No Real-time Transcription
No Simultaneous Session Limit Anymore
Simultaneous Sessions Limit Tier 5
100
Slower on Old Devices
Usage Policy Restriction

Pricing Features

All Future Updates
Approximate Audio Input Price
$0.06/minute
Approximate Audio Output Price
$0.24/minute
Lifetime Access
No Ads
No Free Tier
No In-App Purchases
No Subscription Required
One-Time Payment
$4.99
Pricing Audio Input
$100/1M tokens
Pricing Audio Output
$200/1M tokens
Pricing Cached Audio Input
$20/1M tokens
Pricing Cached Text Input
$2.50/1M tokens
Pricing Text Input
$5/1M tokens
Pricing Text Output
$20/1M tokens