Canopy Labs vs Voice Dictation

Comparing the features of Canopy Labs to Voice Dictation

Feature
Canopy Labs
Voice Dictation

Capability Features

Demo Availability
Emotion Tags
normalslowcryingsleepysighchuckle
Guided Emotion and Intonation
Handles Disfluencies
Input Streaming for Lower Latency
Llama Architecture
Llama
LLM-based Customizability
Local Storage Only
Model Tokenizer Type
Non-streaming (CNN-based) tokenizer
Open Source Release Planned
Orpheus Speech Models
Medium (3B)Small (1B)Tiny (400M)Nano (150M)
Pretrained and Finetuned Models
Pretrained modelsFinetuned models
Real-time Transcription
Realtime Streaming
Sample Finetuning Scripts
Sliding Window Detokenizer
Speech to Text
Streaming Inference Speed
Faster than playback on A100 40GB for 3B model
Supported Language List
AfrikaansBahasa IndonesiaBahasa MelayuCatalàČeštinaDanskDeutschEnglishEspañolEuskaraFilipinoFrançaisGalegohrvatskiIsizuluÍslenskaItalianoLietuviųMagyarNederlandsNorsk (Bokmål)PolskiPortuguêsRomânăSlovenčinaSlovenščinaSuomiSvenskaTiếng ViệtTürkçeΕλληνικάБългарскиРусскийСрпскиУкраїнськаעבריתالعربيةفارسیहिन्दीاُردُوአማርኛAzərbaycancaবাংলাગુજરાતીಕನ್ನಡភាសាខ្មែរLatviešuമലയാളംमराठीລາວनेपाली भाषाසිංහලBasa SundaతెలుగుKiswahiliქართულიՀայերենதமிழ்ไทยசிங்கப்பூர்中文(中国)中文(台灣)中文(香港)日本語한국어
Text to Speech
Training Data Volume
100k+ hours of speech, billions of text tokens
Voice Commands
Zero-Shot Voice Cloning

Integration Features

Baseten 1-Click Deployment
GitHub Repository Access
Google Colab Notebook
Google Speech Recognition
Hugging Face Model Access
LLama Ecosystem Support
Python Package for Streaming
Supported Platforms
Google ChromeWindowsMacLinux

Limitation Features

Browser Compatibility
Google Chrome only
English Language Only
No API Access
No API Mentioned
No Explicit Pricing Details
No Export Formats Listed
No Mention of File Format Support
No Mobile App
No Team Collaboration
Requires Internet Connection

Pricing Features

Free Tier