Deploy Dedicated GPU server to run AI models

Deploy Model
Skip to main content

AI Model APIs

Discover and integrate with powerful AI model APIs for your applications

Qwen Voice Design

ModelsLab

Qwen Voice Design

Create and customize any AI-generated voice you can imagine using a simple text prompt - choose the tone, style, accent, emotion, age, or personality, and instantly turn your words into natural-sounding speech.

Open SourceVoice DesignerUltra NaturalNew Added
Qwen Voice cloning

ModelsLab

Qwen Voice cloning

The Qwen Text-to-Speech endpoint generates audio from text using a provided audio URL, producing output that mimics the uploaded voice

Open Source3-Sec Voice CloneSupport 10 Languages
Song Inpaint

Sonauto

Song Inpaint

Audio Inpaint intelligently reconstructs missing or corrupted portions of an audio clip. Whether you need to remove unwanted noises, repair damaged recordings, or fill silent gaps, the model analyzes the surrounding context to generate smooth.

Closed SourceNew Added
Song Extender

Sonauto

Song Extender

This endpoint allows clients to extend an existing song / vocal audio track by generating additional material

Closed SourceNew Added
Text to Music

Sonauto

Text to Music

Generate full songs from text, lyrics, or melodies with a latent diffusion-powered AI music model offering up to 4:45 min tracks, voice control, and seamless editing.

Closed SourceNew Added
Elevenlabs/Text to Music

Eleven Labs

Elevenlabs/Text to Music

Eleven Music is cleared for nearly all commercial uses, from film and television to podcasts and social media videos, and from advertisements to gaming

Closed SourceMusic ProductionBest song generation
Inworld/Text To Speech

Inworld

Inworld/Text To Speech

The Text-to-Audio endpoint enables you to generate audio by providing a text input along with a valid audio URL or a pre-created voice using a voice_id. The output is an audio file that mimics the sound of the provided audio URL or the selected voice.

Closed SourceHigh Quality OutputSupport 30+ Languages
Elevenlabs/Sound-Effect

Eleven Labs

Elevenlabs/Sound-Effect

Generate up to 30 seconds of professional, royalty-free sound effects from text prompts with customizable duration, looping, and multiple MP3 output formats at 44.1 kHz.

Closed SourceCheapest PriceBest SFX
Elevenlabs/Speech To Speech

Eleven Labs

Elevenlabs/Speech To Speech

Transform one voice into another in using advanced speech-to-speech technology. Perfect for dubbing, content creation, and voice customization without altering the original message.

Closed SourceBest for Creators
Elevenlabs/Text to Speech

Eleven Labs

Elevenlabs/Text to Speech

The Text-to-Audio endpoint enables you to generate audio by providing a text input along with a valid audio URL or a pre-created voice using a voice_id. The output is an audio file that mimics the sound of the provided audio URL or the selected voice.

Closed SourceSupport 30+ LanguagesTrending
CreateDubbing

ModelsLab

CreateDubbing

The endpoint enables automatic voice translation of videos from one language to another. It accepts a video file link and various parameters to control the dubbing process.

Open Source
SoundEffect(SFX)

ModelsLab

SoundEffect(SFX)

The SFX endpoint allows you to generate sound effects (SFX) from text prompts. It takes user input in the form of a text prompt to conditionally generate audio effects.

Open Source