Create & Edit Images Instantly with Grok Imagine

Try Grok Imagine
Skip to main content

AI Model APIs

Discover and integrate with powerful AI model APIs for your applications

Free for Premium Users
Qwen Voice Design

ModelsLab

Qwen Voice Design

Create and customize any AI-generated voice you can imagine using a simple text prompt - choose the tone, style, accent, emotion, age, or personality, and instantly turn your words into natural-sounding speech.

Voice DesignerUltra NaturalNew Added
Free for Premium Users
Qwen Voice cloning

ModelsLab

Qwen Voice cloning

The Qwen Text-to-Speech endpoint generates audio from text using a provided audio URL, producing output that mimics the uploaded voice

3-Sec Voice CloneSupport 10 Languages
Song Inpaint

Sonauto

Song Inpaint

Audio Inpaint intelligently reconstructs missing or corrupted portions of an audio clip. Whether you need to remove unwanted noises, repair damaged recordings, or fill silent gaps, the model analyzes the surrounding context to generate smooth.

New Added
Song Extender

Sonauto

Song Extender

This endpoint allows clients to extend an existing song / vocal audio track by generating additional material

New Added
Text to Music

Sonauto

Text to Music

Generate full songs from text, lyrics, or melodies with a latent diffusion-powered AI music model offering up to 4:45 min tracks, voice control, and seamless editing.

New Added
Elevenlabs/Text to Music

Eleven Labs

Elevenlabs/Text to Music

Eleven Music is cleared for nearly all commercial uses, from film and television to podcasts and social media videos, and from advertisements to gaming

Music ProductionBest song generation
Inworld/Text To Speech

Inworld

Inworld/Text To Speech

The Text-to-Audio endpoint enables you to generate audio by providing a text input along with a valid audio URL or a pre-created voice using a voice_id. The output is an audio file that mimics the sound of the provided audio URL or the selected voice.

High Quality OutputSupport 30+ Languages
Elevenlabs/Sound-Effect

Eleven Labs

Elevenlabs/Sound-Effect

Generate up to 30 seconds of professional, royalty-free sound effects from text prompts with customizable duration, looping, and multiple MP3 output formats at 44.1 kHz.

Cheapest PriceBest SFX
Elevenlabs/Speech To Speech

Eleven Labs

Elevenlabs/Speech To Speech

Transform one voice into another in using advanced speech-to-speech technology. Perfect for dubbing, content creation, and voice customization without altering the original message.

Best for Creators
Elevenlabs/Text to Speech

Eleven Labs

Elevenlabs/Text to Speech

The Text-to-Audio endpoint enables you to generate audio by providing a text input along with a valid audio URL or a pre-created voice using a voice_id. The output is an audio file that mimics the sound of the provided audio URL or the selected voice.

Support 30+ LanguagesTrending
Free for Premium Users
CreateDubbing

ModelsLab

CreateDubbing

The endpoint enables automatic voice translation of videos from one language to another. It accepts a video file link and various parameters to control the dubbing process.

Free for Premium Users
SoundEffect(SFX)

ModelsLab

SoundEffect(SFX)

The SFX endpoint allows you to generate sound effects (SFX) from text prompts. It takes user input in the form of a text prompt to conditionally generate audio effects.