POST
v1
/
replicate
/
stt
Text To Speech
curl --request POST \
  --url http://localhost:9000/api/v1/replicate/stt \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form temperature=123 \
  --form translate=true \
  --form 'initial_prompt=<string>' \
  --form condition_on_previous_text=true

Query

model
string
default:"openai/whisper"
required
The model to be used;

Body

audio
file
required
Audio file
temperature
float
default:"0.0"
temperature to use for sampling
translate
boolean
default:"False"
Translate the text to English when set to True
initial_prompt
string
optional text to provide as a prompt for the first window.
condition_on_previous_text
boolean
default:"True"
if True, provide the previous output of the model as a prompt for the next window; disabling may make the text inconsistent across windows, but the model becomes less prone to getting stuck in a failure loop.