Query
model
string
default:"lucataco/xtts-v2"
required
The voice model to be used;
Body
Original speaker audio (wav, mp3, m4a, ogg, or flv)
Output language for the synthesised speech
Whether to apply denoising to the speaker audio (microphone recordings)