Vatis Speech-to-Text vs OpenAI Realtime API

Comparing the features of Vatis Speech-to-Text to OpenAI Realtime API

Feature
Vatis Speech-to-Text
OpenAI Realtime API

Capability Features

Audio and Video File Support
Content Repurposing
Custom AI Prompts
Custom Vocabulary Support
Dedicated Support
Drag and Drop Upload
Enterprise Grade Security
GDPR compliantISO 27001 certifiedSOC 2 Type II in progress
Enterprise Privacy Commitment
Enterprise SLAs
Expanded Model Support Planned
File Format Support
Five New Voices
5
Function Calling
GDPR Compliance
High Accuracy Model
Up to 99%
Human and Automated Safety Monitoring
Interruption Handling
Interview to Article
ISO 27001 Certification
Language Code-Switch
Multi-Language Support
No Training on Data Without Permission
Playground Access
Private Cloud Deployment
Prompt Caching Planned
Public Beta
Real-Time Insights
Real-Time Transcription
<700ms
Reference Client Available
Six Preset Voices
6
SOC 2 Type II In Progress
Speaker Separation
Speech-to-Speech
Streaming Audio Inputs/Outputs
Summarization and Sentiment Analysis
Supports Text and Audio Inputs
TextAudio
Transcription Speed
1 hour audio in 10 min
Ultra Low Latency
Unlimited Concurrency
Use-Case Coverage
Creators and MarketersJournalists and EditorsSales and MeetingsDevelopers and BuildersResearchers and AcademicsMedia Monitoring and Broadcasting
WebSocket Connection

Integration Features

Agora Integration
Audio Intelligence
Chat Completions API Integration
Integrations
APIReal-Time APIAudio Intelligence API
LiveKit Integration
OpenAI Node.js SDK Planned
OpenAI Python SDK Planned
Real-Time Speech-to-Text API
Speech to Text
Supports GPT-4o
gpt-4o-realtime-preview
Twilio Voice API Integration

Limitation Features

AI Disclosure Requirement
Audio Only Modality (Initially)
Lower Session Limits Tiers 1-4
Lower than 100
No Credit Card Required
No Sign-Up Required
No Simultaneous Session Limit Anymore
Simultaneous Sessions Limit Tier 5
100
Usage Policy Restriction

Pricing Features

Approximate Audio Input Price
$0.06/minute
Approximate Audio Output Price
$0.24/minute
Free Trial Package
No Free Tier
Pricing Audio Input
$100/1M tokens
Pricing Audio Output
$200/1M tokens
Pricing Cached Audio Input
$20/1M tokens
Pricing Cached Text Input
$2.50/1M tokens
Pricing Text Input
$5/1M tokens
Pricing Text Output
$20/1M tokens
Volume Discounts