AI APIs for Developers
AI Model APIs
Discover and integrate with powerful AI model APIs for your applications
ModelsLab
Create and customize any AI-generated voice you can imagine using a simple text prompt - choose the tone, style, accent, emotion, age, or personality, and instantly turn your words into natural-sounding speech.
ModelsLab
The Qwen Text-to-Speech endpoint generates audio from text using a provided audio URL, producing output that mimics the uploaded voice
Sonauto
Audio Inpaint intelligently reconstructs missing or corrupted portions of an audio clip. Whether you need to remove unwanted noises, repair damaged recordings, or fill silent gaps, the model analyzes the surrounding context to generate smooth.
Sonauto
This endpoint allows clients to extend an existing song / vocal audio track by generating additional material
Sonauto
Generate full songs from text, lyrics, or melodies with a latent diffusion-powered AI music model offering up to 4:45 min tracks, voice control, and seamless editing.
Eleven Labs
Eleven Music is cleared for nearly all commercial uses, from film and television to podcasts and social media videos, and from advertisements to gaming
Inworld
The Text-to-Audio endpoint enables you to generate audio by providing a text input along with a valid audio URL or a pre-created voice using a voice_id. The output is an audio file that mimics the sound of the provided audio URL or the selected voice.
Eleven Labs
Generate up to 30 seconds of professional, royalty-free sound effects from text prompts with customizable duration, looping, and multiple MP3 output formats at 44.1 kHz.
Eleven Labs
Transform one voice into another in using advanced speech-to-speech technology. Perfect for dubbing, content creation, and voice customization without altering the original message.
Eleven Labs
The Text-to-Audio endpoint enables you to generate audio by providing a text input along with a valid audio URL or a pre-created voice using a voice_id. The output is an audio file that mimics the sound of the provided audio URL or the selected voice.
ModelsLab
The endpoint enables automatic voice translation of videos from one language to another. It accepts a video file link and various parameters to control the dubbing process.
ModelsLab
The SFX endpoint allows you to generate sound effects (SFX) from text prompts. It takes user input in the form of a text prompt to conditionally generate audio effects.














