Canopy Labs vs Voiser

Comparing the features of Canopy Labs to Voiser

Feature
Canopy Labs
Voiser

Capability Features

AR/VR Support
Automatic Punctuation
Available Voices
550
Batch Processing
Country Coverage
200
Demo Availability
Dialects Supported
135
Emotion Tags
normalslowcryingsleepysighchuckle
Emotion Voice Options
Export Formats
WordExcelTxtSrt
Guided Emotion and Intonation
Handles Disfluencies
Input Streaming for Lower Latency
Languages Supported
75
Llama Architecture
Llama
LLM-based Customizability
Minimum Accuracy Claim
%99.9 success rate
Model Tokenizer Type
Non-streaming (CNN-based) tokenizer
Online Dictation
Open Source Release Planned
Orpheus Speech Models
Medium (3B)Small (1B)Tiny (400M)Nano (150M)
Pretrained and Finetuned Models
Pretrained modelsFinetuned models
Profanity Filtering
Realtime Streaming
Sample Finetuning Scripts
Sliding Window Detokenizer
Smart Guide
Speaker Identification
Speech-to-Text
Streaming Inference Speed
Faster than playback on A100 40GB for 3B model
Subtitle Customization
Talking Avatar
Text to Speech
Text-to-Video
Training Data Volume
100k+ hours of speech, billions of text tokens
Voice Cloning
Voice Quality Levels
HDHQUHD
YouTube Dubbing
Zero-Shot Voice Cloning

Integration Features

API Access
Baseten 1-Click Deployment
ChatGPT Integration
Email Login
Facebook Login
File Format Support (Audio)
.mp3.wav.flac.aac.wma.ogg.aiff
File Format Support (Video)
.avi.mp4.mov.webm.mpeg.3gp
GitHub Repository Access
Google Colab Notebook
Google Login
Hugging Face Model Access
LLama Ecosystem Support
Python Package for Streaming
URL Import Support
Wordpress Integration
YouTube Import

Limitation Features

English Language Only
Maximum Usage Without Payment
50 characters for TTS, 5 minutes for STT
No API Mentioned
No Explicit Pricing Details
No Mention of File Format Support
Premium Voices (Enterprise)
Studio Free Limit
50
Transcription Limit Free Tier
5

Pricing Features

Free Tier
Quota Extension Via Purchase