Deploy Dedicated GPU server to run AI models

Deploy Model
Skip to main content
Available now on ModelsLab · Video Generation

Wan 2.5 Text To VideoText Sparks Cinematic Video

Generate Precise Video Fast

1080p Output

High-Res Video Generation

Produce 480p to 1080p videos at 24fps from text prompts using Wan 2.5 Text to Video model.

Audio Sync

Native Audio Integration

Sync voiceovers, music, and effects in one pass with Wan 2.5 Text to Video API audio file support.

Multi-Input

Text and Image Modes

Animate images or build scenes from prompts with optional audio guidance in Wan 2.5 Text to Video.

Examples

See what Wan 2.5 Text To Video can create

Copy any prompt below and try it yourself in the playground.

City Flyover

Aerial drone shot over futuristic city skyline at dusk, neon lights reflecting on glass towers, smooth camera pan right, ambient city hum and distant traffic sounds, 1080p, 10 seconds.

Ocean Waves

Calm ocean waves crashing on rocky shore at sunrise, golden light on water, slow zoom in on foam, gentle wave sounds and seagulls calling, 720p, 5 seconds.

Mountain Timelapse

Timelapse of clouds rolling over snow-capped mountains, shifting sunlight, steady upward pan, wind whispers and eagle cries, 1080p, 10 seconds.

Urban Street

Busy urban street with vintage cars driving past brick buildings, tracking shot forward, jazz music and footsteps echoing, 720p, 5 seconds.

For Developers

A few lines of code.
Video from text. One call.

ModelsLab handles the infrastructure: fast inference, auto-scaling, and a developer-friendly API. No GPU management needed.

  • Serverless: scales to zero, scales to millions
  • Pay per second, no minimums
  • Python and JavaScript SDKs, plus REST API
import requests
response = requests.post(
"https://modelslab.com/api/v7/video-fusion/text-to-video",
json={
"key": "YOUR_API_KEY",
"prompt": " Shot from a low angle, in a medium close-up, with warm tones, mixed lighting (the practical light from the desk lamp blends with the overcast light from the window), side lighting, and a central composition. In a classic detective office, wooden bookshelves are filled with old case files and ashtrays. A green desk lamp illuminates a case file spread out in the center of the desk. A fox, wearing a dark brown trench coat and a light gray fedora, sits in a leather chair, its fur crimson, its tail resting lightly on the edge, its fingers slowly turning yellowed pages. Outside, a steady drizzle falls beneath a blue sky, streaking the glass with meandering streaks. It slowly raises its head, its ears twitching slightly, its amber eyes gazing directly at the camera, its mouth clearly moving as it speaks in a smooth, cynical voice: 'The case was cold, colder than a fish in winter. But every chicken has its secrets, and I, for one, intended to find them '.",
"duration": "5",
"init_audio": "https://assets.modelslab.ai/generations/2f7dfdcb-2295-4c6f-966d-4e673baee8e3.mp3",
"resolution": "720"
}
)
print(response.json())

FAQ

Common questions about Wan 2.5 Text To Video

Read the docs

Ready to create?

Start generating with Wan 2.5 Text To Video on ModelsLab.