Sesame vs Voice In

Comparing the features of Sesame to Voice In

Feature
Sesame
Voice In

Capability Features

Advanced Mode Sites Limit
1000+ sites
Automatic Formatting
Change Text Case
Consistent Personality
Context Awareness
Conversational Dynamics
Conversational Speech Generation
Custom Voice Commands
Dataset Size
1 million hours
Dictation Across Tabs
Dictation Box
Emotional Intelligence
Evaluation Suite
GDPR Compliance
Language Switching Shortcuts
Minimal Permissions
Model Sizes
Tiny: 1B backbone, 100M decoderSmall: 3B backbone, 250M decoderMedium: 8B backbone, 300M decoder
Multiple Speaker Handling
No Extra Hardware Needed
Objective Metrics
Word Error RateSpeaker SimilarityHomograph DisambiguationPronunciation Consistency
Page Pop-up Placement
Partial Multilingual Support Planned
Planned for 20+ languages
Popular Languages
EnglishJapaneseGermanPortugueseSpanishHebrewFrenchHindiArabicPolish
Premium Support
Privacy Protection
Pronunciation Correction
Sequence Length
2048
Single-Stage Model
Subjective Metrics
Comparative Mean Opinion Score
Supported Language List
100+ languages
Text and Audio Input
TextAudio
Training Epochs
5
Voice Commands
Works on Editable Textboxes

Integration Features

GitHub Release
LLama Architecture Backbone
Mimi Split-RVQ Tokenizer
Popular Site Integrations
GmailOutlookGoogle DocsFacebookLinkedInRedditSlackChatGPTWhatsAppSalesforce
Supported Platforms
WindowsMacLinuxChromebook
Website Compatibility
Over 10,000 websites
Works On Chrome

Limitation Features

Cannot Model Conversation Structure
English Language Dominance
Memory Bottleneck in Training
No Desktop App
No Free Trial
No Offline Mode
No Pre-trained Language Model Use
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly
Web App Only

Pricing Features

Free Preview
Free Tier
Open Source
Apache 2.0
Paid Plans
Plus Plan Price
$60 per year