🎉 New Year Sale: Get 20% OFF on all plans — Use code NEWYEAR2026.

Upgrade now
Voice cloning thumbnail

ModelsLab/Voice Cloning

voice-cloning
The Text-to-Audio endpoint generates audio from text using either a provided audio URL or a voice_id, producing output that mimics the selected voice
API PlaygroundAPI Documentation

API Endpoint URL

Base URL for all API requests to this endpoint.

https://modelslab.com/api/v6/voice/text_to_audio

API Authentication

Authentication requires a valid API key included in the request. Generate and manage your API keys from your developer dashboard. Include the key in the key parameter for all API requests.

Integration Examples

Production-ready code samples for API integration

{
"init_audio": "https://pub-f3505056e06f40d6990886c8e14102b2.r2.dev/audio/tom_hanks_1.wav",
"prompt": "Narrative voices capable of pronouncing terminologies & acronyms in training and ai learning materials.",
"language": "english",
"base64": "false",
"speed": "1",
"stream": false,
"track_id": null,
"webhook": null,
"key": "YOUR_API_KEY"
}

SDKs

Official SDKs

Production-ready SDKs and client libraries for all major programming languages

API Parameters

Technical specifications for API request parameters.

Field NameParameterTechnical Description
Init Audioinit_audioinit_audio
PromptpromptPrompt
languagelanguagelanguage
base64base64Get response as base64 string, default: false, options: true or false
speedspeedPlayback speed of generated audio
streamstreamStream response in base64 format
track_idtrack_idID for webhook identification
webhookwebhookURL to receive POST API call when complete