Audio and Video Processing
Audio Metadata Extraction
Daily Audio Processing
2500000
Energy Classification
high
Genre Classification
rock
Higher SDR Performance
15.8% higher average SDR
Instrument Types Supported
bassGuitarpercussionelectricGuitar
Large Scale Audio Processing
1000000000
Lyric & Speech Transcription
Max Audio Duration
10 hours
Max Monthly Characters
7000000000
Mood Classification
energetic
Production Ready Infrastructure
Supported Bitrates
320k256k192k128k96k64k48k32k16k
Supported Language List
US EnglishUK EnglishMandarin ChineseHindiSpanishPortugueseJapaneseFrenchItalian
Text-to-Speech API
/stream/speech/synthesisTasks/streamWithTimestamps
Translation & Localization