Speech To Text
Text To Speech
Speech To Text using the turian/insanely-fast-whisper-with-video AI Model.
POST
v1
/
replicate
/
stt
Query
model
string
default: "turian/insanely-fast-whisper-with-video"requiredThe model to be used;
Body
audio
file
requiredAudio file. Either this or url must be provided.
url
string
Video URL for yt-dlp to download the audio from. Either this or audio must be provided.
task
string
default: "transcribe"Task to perform: transcribe or translate to another language. (default: transcribe).
batch_size
int
default: "64"Number of parallel batches you want to compute. Reduce if you face OOMs. (default: 64).
timestamp
string
default: "chunk"Whisper supports both chunked as well as word level timestamps. (default: chunk).
Was this page helpful?