Whisper Notes vs Sesame

Comparing the features of Whisper Notes to Sesame

Feature
Whisper Notes
Sesame

Capability Features

Audio File Import
Consistent Personality
Context Awareness
Conversational Dynamics
Conversational Speech Generation
Custom Keyboard Shortcuts
Dataset Size
1 million hours
Emotional Intelligence
Evaluation Suite
Export SRT Subtitles
Export with Timestamps
Local Processing
Manual Re-transcribe Option
Model Sizes
Tiny: 1B backbone, 100M decoderSmall: 3B backbone, 250M decoderMedium: 8B backbone, 300M decoder
Multilingual Support
EnglishChineseSpanishGermanFrenchJapaneseKoreanRussianArabicHindiPortugueseItalianDutchTurkishIndonesianVietnameseThaiCzechPolishUkrainianSwedish
Multiple Speaker Handling
No Cloud Upload
No Internet Required
No Recording Length Limit
Objective Metrics
Word Error RateSpeaker SimilarityHomograph DisambiguationPronunciation Consistency
Offline Transcription
Partial Multilingual Support Planned
Planned for 20+ languages
Pronunciation Correction
Refund Available
Sequence Length
2048
Share Transcripts
Single-Stage Model
Subjective Metrics
Comparative Mean Opinion Score
Text and Audio Input
TextAudio
Training Epochs
5
Unlimited Transcriptions
Whisper Large V3 Turbo Model
Whisper Large V3 Turbo

Integration Features

Apple Silicon Support
File Formats Supported
MP3M4AWAV
GitHub Release
Intel Mac Support
LLama Architecture Backbone
Mimi Split-RVQ Tokenizer
Works on iOS
Works on macOS

Limitation Features

Cannot Model Conversation Structure
Device Requirements
Not supported on iPhone SE 2nd gen; 8GB+ RAM recommended for macOS
English Language Dominance
Memory Bottleneck in Training
No AI Summaries
No Background Transcription on iOS
No Free Trial
No Pre-trained Language Model Use
No Real-time Transcription
Real-Time Generation Delay
RVQ time-to-first-audio scales poorly
Slower on Old Devices

Pricing Features

All Future Updates
Free Preview
Lifetime Access
No Ads
No In-App Purchases
No Subscription Required
One-Time Payment
$4.99
Open Source
Apache 2.0