Deploy Dedicated GPU server to run AI models

Deploy Model
Skip to main content
Imagen

Voice Cloning API - Clone Any Voice

Clone any voice with ModelsLab's voice cloning API. Create realistic AI voice replicas from short audio samples and generate speech in the cloned voice programmatically.

What Is the ModelsLab Voice Cloning API?

Clone Any Voice from a Short Audio Sample

The ModelsLab Voice Cloning API lets developers create realistic AI voice replicas from as little as 10 seconds of audio. The API analyzes vocal characteristics — pitch, tone, accent, rhythm, and timbre — and creates a reusable voice model that can generate speech in the cloned voice from any text input.

Voice cloning works in two modes: instant cloning (10-60 seconds of audio, ready in seconds) and deep training (longer samples, higher fidelity). Both modes produce voice models that can be reused indefinitely across text-to-speech requests without re-uploading the original audio.

  • Clone from as little as 10 seconds of audio (WAV, MP3, OGG)
  • Instant cloning ready in seconds, deep training for higher fidelity
  • Generate speech in 40+ languages while preserving voice characteristics
  • Emotional control — adjust tone, pace, emphasis, and warmth
  • Broadcast-quality output at 24kHz and 48kHz sample rates
  • Pay-per-character pricing starting at $0.006 per second of audio

How Does ModelsLab Compare to ElevenLabs and Other Voice APIs?

ModelsLab offers voice cloning as part of a larger AI API platform that includes image generation, video creation, and LLM access. This means developers building multimodal applications can use a single API key for voice, image, and text. Compared to ElevenLabs, ModelsLab provides more flexible pricing — pay-per-character with no monthly minimums — and includes access to hundreds of pre-built voices alongside custom cloning.

The voice cloning API supports 40+ languages including English, Spanish, French, German, Hindi, Japanese, Chinese, and Arabic. Cloned voices maintain their unique characteristics when generating speech in different languages, enabling global content localization without multiple voice actors.

When Should You Use Voice Cloning?

Voice cloning is suited for audiobook production, video dubbing and localization, e-learning course narration, podcast creation, virtual assistant branding, and accessibility applications. The API handles both real-time synthesis (for interactive applications) and batch processing (for bulk content generation).

Trusted by

Google logo
Salesforce logo
Amazon logo
IBM logo
Adobe logo
Sony logo
Google logo
Salesforce logo
Amazon logo
IBM logo
Adobe logo
Sony logo
Google logo
Salesforce logo
Amazon logo
IBM logo
Adobe logo
Sony logo
Google logo
Salesforce logo
Amazon logo
IBM logo
Adobe logo
Sony logo
1B+

Images Processed Monthly

500K+

Active Developers

5K+

Discord Community Members

300+

Available AI APIs

How Do You Clone a Voice via API?

Clone a voice and generate speech in three API calls. Upload audio, create a voice model, then synthesize speech.

cURL — Clone a Voice

bash
1curl -X POST https://modelslab.com/api/v6/voice/clone \
2 -H "Content-Type: application/json" \
3 -d '{
4 "key": "your_api_key",
5 "init_audio": "https://example.com/sample-voice.wav",
6 "language": "english",
7 "title": "my-custom-voice"
8 }'

Python — Generate Speech with Cloned Voice

python
1import requests
2
3# Step 1: Clone a voice
4clone_response = requests.post(
5 "https://modelslab.com/api/v6/voice/clone",
6 json={
7 "key": "your_api_key",
8 "init_audio": "https://example.com/sample.wav",
9 "language": "english",
10 "title": "narrator-voice"
11 }
12)
13voice_id = clone_response.json()["voice_id"]
14
15# Step 2: Generate speech in the cloned voice
16speech_response = requests.post(
17 "https://modelslab.com/api/v6/voice/text_to_speech",
18 json={
19 "key": "your_api_key",
20 "voice_id": voice_id,
21 "text": "Welcome to our platform. Let me show you around.",
22 "language": "english",
23 "speed": 1.0,
24 "emotion": "friendly"
25 }
26)
27audio_url = speech_response.json()["output"][0]
28print(f"Audio: {audio_url}")

JavaScript — Multi-Language Voice Generation

javascript
1// Generate speech in multiple languages with the same cloned voice
2const languages = ["english", "spanish", "french", "german"];
3
4for (const lang of languages) {
5 const response = await fetch("https://modelslab.com/api/v6/voice/text_to_speech", {
6 method: "POST",
7 headers: { "Content-Type": "application/json" },
8 body: JSON.stringify({
9 key: "your_api_key",
10 voice_id: "your_cloned_voice_id",
11 text: "This product is now available in your region.",
12 language: lang,
13 speed: 1.0
14 })
15 });
16 const data = await response.json();
17 console.log(`${lang}: ${data.output[0]}`);
18}

What Voice Cloning Capabilities Are Available?

Train custom voice models from audio samples and generate natural speech in any cloned voice. Multi-language support with emotional control.

Instant Voice Cloning

Clone a voice from as little as 10 seconds of audio. The API analyzes vocal characteristics — pitch, tone, accent, pace — and creates a reusable voice model for text-to-speech generation.

Multi-Language Voice Generation

Generate speech in cloned voices across 40+ languages. The cloned voice maintains its unique characteristics while speaking in English, Spanish, French, German, Hindi, Japanese, and more.

Emotional and Expressive Speech

Control the emotional tone of generated speech. Add emphasis, adjust pacing, and convey emotions like excitement, calm, urgency, or warmth through API parameters.

High-Fidelity Output

Generate broadcast-quality audio at 24kHz or 48kHz sample rates. Output in WAV, MP3, or OGG format for direct use in production applications.

Instant Voice Cloning
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result
Example of AI outpainting result

Enterprise-grade voice cloning with natural prosody, emotional expression, and multi-language support.

How to Clone a Voice with Our API

Create a custom AI voice in three steps.

STEP 01
STEP 01

Step 1: Upload Voice Sample

Upload a 10-60 second audio clip of the voice you want to clone. The API accepts WAV, MP3, and OGG formats. Cleaner audio produces higher quality clones.

STEP 02
STEP 02

Step 2: Train the Voice Model

The API processes the audio sample and creates a unique voice model. Instant cloning takes seconds. For higher fidelity, you can train with longer samples for a custom model.

STEP 03
STEP 03

Step 3: Generate Speech

Send text to the API with your voice model ID to generate speech in the cloned voice. Control speed, pitch, emotion, and language through request parameters.

Why Choose ModelsLab for Voice Cloning?

Key advantages that set us apart

Clone voices from as little as 10 seconds of audio
Multi-language speech generation in 40+ languages
Emotional control — adjust tone, pace, and emphasis
Broadcast-quality audio at 24kHz and 48kHz
Instant cloning with optional deep training
WAV, MP3, and OGG output formats
Reusable voice models for repeated generation
Hundreds of pre-built voices available
RESTful API with Python and JavaScript SDKs
GDPR-compliant voice data handling
Pay-per-character pricing with free tier
24/7 developer support
Part of a full AI platform (image, video, audio, LLMs)
No questions asked refund policy

Our Popular Use Cases

Applications for voice cloning API:

Generate narration in consistent voices across entire audiobooks. Clone author voices or create distinct character voices for multi-narrator productions.

Audiobook Production

Your Data is Secure: GDPR Compliant AI Services

ModelsLab GDPR Compliance Certification Badge

GDPR Compliant

Pricing That's Perfect

Choose plan as per your needs, cancel anytime.

Coming Soon

We are making some changes to our pricing, please check back later.

Voice Cloning API — Frequently Asked Questions

Instant voice cloning requires as little as 10 seconds of clear audio. For higher fidelity, 30-60 seconds produces better results. Deep training mode can use longer samples for broadcast-quality output. The API accepts WAV, MP3, and OGG formats.

Voice cloning and text-to-speech are priced per character of generated audio, starting at approximately $0.006 per second of output. A free tier is available for testing. Subscription plans (Standard at $47/month, Unlimited at $199/month) include bundled usage with lower per-unit costs.

Yes. Cloned voice models retain their unique vocal characteristics (pitch, tone, timbre) when generating speech in any of the 40+ supported languages. This enables content localization without hiring voice actors for each language.

Voice cloning technology itself is legal in most jurisdictions. However, you must have consent from the person whose voice you clone. ModelsLab requires users to comply with local laws and obtain appropriate permissions before cloning voices.

The API generates broadcast-quality audio at 24kHz or 48kHz sample rates. Output formats include WAV (uncompressed), MP3 (compressed), and OGG. The quality is suitable for audiobooks, podcasts, video narration, and IVR systems.

ModelsLab offers voice cloning as part of a comprehensive AI API platform that includes image generation, video creation, and LLM access — all through one API key and billing account. Pricing is pay-per-character with no monthly minimums, and hundreds of pre-built voices are included at no extra cost.

Get Expert Support in Seconds

We're Here to Help.

Want to know more? You can email us anytime at support@modelslab.com

View Docs
Plugins

Explore Plugins for Pro

Our plugins are designed to work with the most popular content creation software.

API

Build Apps with
ML
API

Use our API to build apps, generate AI art, create videos, and produce audio with ease.