AI Model Choice
OpenAI WhisperOpenAI GPTsGoogle GeminiAnthropic ClaudeMeta LlamaxAI Grok
Content Creation
Painpoints and SolutionsMindmapsAction ItemsQuiz7 Key ThemesBlog PostTopicsLinkedIn Post
Editing Tools
Find & ReplaceSpeaker AssignmentRich Text FormatsHighlighting
Emotion Tags
normalslowcryingsleepysighchuckle
Guided Emotion and Intonation
Input Streaming for Lower Latency
LLM-based Customizability
Model Tokenizer Type
Non-streaming (CNN-based) tokenizer
Open Source Release Planned
Orpheus Speech Models
Medium (3B)Small (1B)Tiny (400M)Nano (150M)
Pretrained and Finetuned Models
Pretrained modelsFinetuned models
Priority Processing
High Priority for paid plans
Privacy Policy
No data used for AI training
Sample Finetuning Scripts
Sliding Window Detokenizer
Streaming Inference Speed
Faster than playback on A100 40GB for 3B model
Training Data Volume
100k+ hours of speech, billions of text tokens
Unlimited Simultaneous Uploads
5