AI Native Call Center Stack
Automatic Call Recording & Chat Display
Emotion Tags
normalslowcryingsleepysighchuckle
Enterprise-Grade Security
Guided Emotion and Intonation
Handle High Volume
Thousands of calls
Input Streaming for Lower Latency
LLM-based Customizability
Model Tokenizer Type
Non-streaming (CNN-based) tokenizer
Multilingual Support
Over 20 languages
Omnichannel Support
TextVoiceDigital Messaging
Open Source Release Planned
Orpheus Speech Models
Medium (3B)Small (1B)Tiny (400M)Nano (150M)
Pretrained and Finetuned Models
Pretrained modelsFinetuned models
Sample Finetuning Scripts
Sliding Window Detokenizer
Streaming Inference Speed
Faster than playback on A100 40GB for 3B model
Tier 1-2 Support Automation
Training Data Volume
100k+ hours of speech, billions of text tokens