Query
The voice model to be used;
Body
Text to synthesize
Original speaker audio (wav, mp3, m4a, ogg, or flv)
Output language for the synthesised speech
Whether to apply denoising to the speaker audio (microphone recordings)
Text to speech using the lucataco/xtts-v2 AI Model.