🚀 Z-Image Turbo Image-to-Image is Live — Unlimited Generations for Premium Users

Try Now
Skip to main content
diffrhythm-short thumbnail

ModelsLab/Diffrhythm-Short

diffrhythm-short
Generate full-length songs with vocals and instrumentals in 10 seconds using latent diffusion—supports English and Chinese, 44.1kHz stereo, up to 4m45s tracks, requires only lyrics and style prompts.
API PlaygroundAPI Documentation

Input

Customize your API request

File preview

or record audio
Audio recording is not supported in your browser

Per Audio generation will cost 0.0047$
For premium plan Audio generation will cost 0.00$ i.e Free.

Output

Generation complete

Idle

Unknown content type

Related Models

Discover similar models you might be interested in

scribe_v1

scribe_v1

eleven_sound_effect

eleven_sound_effect

Inworld Text to Speech
Popular

Inworld Text to Speech

Kanye West
Popular

Kanye West