Seedance 2.0 is here - create consistent, multimodal AI videos faster with images, videos, and audio in one prompt.

Try Now
Skip to main content
Inworld Text to Speech thumbnail

Inworld Text To Speech

by Inworld

Ultra-realistic, low-latency voice cloning supports 11 languages, instant & professional cloning, 48 kHz audio, fine emotional control, API access—ideal for dynamic, expressive AI interactions.

inworld-tts-1
Closed Source ModelLLMs.txtLearn more
API PlaygroundAPI Documentation

Input

Per million characters will cost 6$

Output

Idle

Unknown content type

About Inworld Text To Speech

Ultra-realistic, low-latency voice cloning supports 11 languages, instant & professional cloning, 48 kHz audio, fine emotional control, API access—ideal for dynamic, expressive AI interactions.

Technical Specifications

Model ID
inworld-tts-1
Provider
Inworld
Category
Audio Models
Task
Voice Cloning
Price
$6.000000 per million characters
Added
August 7, 2025

Key Features

  • AI voice synthesis and text-to-speech
  • Multiple language and accent support
  • Voice cloning from short audio samples
  • Real-time audio processing via API
  • Customizable speech parameters

Quick Start

Integrate Inworld Text To Speech into your application with a single API call. Get your API key from the pricing page to get started.

import requests
import json
url = "https://modelslab.com/api/v7/voice/text-to-speech"
headers = {
"Content-Type": "application/json"
}
data = {
"model_id": "inworld-tts-1",
"prompt": "your prompt here",
"key": "YOUR_API_KEY"
}
try:
response = requests.post(url, headers=headers, json=data)
response.raise_for_status() # Raises an HTTPError for bad responses (4XX or 5XX)
result = response.json()
print("API Response:")
print(json.dumps(result, indent=2))
except requests.exceptions.HTTPError as http_err:
print(f"HTTP error occurred: {http_err} - {response.text}")
except Exception as err:
print(f"Other error occurred: {err}")

View the full API documentation for SDKs, code examples in Python, JavaScript, and more.

Pricing

Inworld Text To Speech API costs $6.000000 per million characters. Pay only for what you use with no minimum commitments. View pricing plans

Use Cases

  • Voice-over production for video content
  • Podcast and audiobook narration
  • Multilingual customer support automation
  • Interactive voice response (IVR) systems

Inworld Text To Speech FAQ

Ultra-realistic, low-latency voice cloning supports 11 languages, instant & professional cloning, 48 kHz audio, fine emotional control, API access—ideal for dynamic, expressive AI interactions.

You can integrate Inworld Text To Speech into your application with a single API call. Sign up on ModelsLab to get your API key, then use the model ID "inworld-tts-1" in your API requests. We provide SDKs for Python, JavaScript, and cURL examples in the API documentation.

Inworld Text To Speech costs $6.000000 per million characters. ModelsLab uses pay-per-use pricing with no minimum commitments. A free tier is available to get started.

The model ID for Inworld Text To Speech is "inworld-tts-1". Use this ID in your API requests to specify this model.

Yes, ModelsLab offers a free tier that lets you try Inworld Text To Speech and other AI models. Sign up to get free API credits and start building immediately.