Business Use Case Support
Customer supportSalesTrainingExecutive assistanceAI podcast hosts
Emotion Tags
normalslowcryingsleepysighchuckle
Feedback-Based Improvement
Guided Emotion and Intonation
Input Streaming for Lower Latency
LLM-based Customizability
Model Tokenizer Type
Non-streaming (CNN-based) tokenizer
Multiple AI Personas
Sophie (Therapist)Alex (Daily planner)Michelle (Parenting coach)Marvin (Tech support)Page (Idea partner)Ryan (Journaling pal)Custom AI bot
Open Source Release Planned
Orpheus Speech Models
Medium (3B)Small (1B)Tiny (400M)Nano (150M)
Personal Memory (In Development)
Pretrained and Finetuned Models
Pretrained modelsFinetuned models
Sample Finetuning Scripts
Sliding Window Detokenizer
Streaming Inference Speed
Faster than playback on A100 40GB for 3B model
Training Data Volume
100k+ hours of speech, billions of text tokens