🎉 New Year Sale: Get 20% OFF on all plans — Use code NEWYEAR2026.

Upgrade now
Wan2.6 Image To Video  thumbnail

Alibaba/Wan2.6 Image To Video

wan2.6-i2v
Wan 2.6 Image to VideoLLMs.txt
Generate cinematic 1080p, 24fps videos from a single image with multi-shot storytelling, native lip-sync, 15s length, and consumer-GPU support.
API PlaygroundAPI Documentation

API Endpoint URL

Base URL for all API requests to this endpoint.

https://modelslab.com/api/v7/video-fusion/image-to-video

API Authentication

Authentication requires a valid API key included in the request. Generate and manage your API keys from your developer dashboard. Include the key in the key parameter for all API requests.

Integration Examples

Production-ready code samples for API integration

{
"init_image": "https://assets.modelslab.com/generations/a2dd96c6-b148-4bdc-aefc-453157d5fd0c.png",
"init_audio": "https://assets.modelslab.com/generations/ba1837f2-a8a1-49ac-ac0f-2809818867c0.mp3",
"prompt": "The person from the reference image is a travel vlogger standing on the Great Wall of China, speaking directly to the camera in a natural vlog style. Multishot cinematic sequence starting with a medium close-up selfie shot, the vlogger holding the camera, relaxed expression, light wind, then a smooth pan revealing the Great Wall stretching across the mountains with tourists clearly visible in the background, followed by an over-the-shoulder shot of the vlogger pointing toward the scenic views. The vlogger says clearly and naturally for about 5 seconds: “Right now, I’m standing on the Great Wall of China… and the view here is absolutely unreal.” Add realistic outdoor ambience with soft wind sounds, distant crowd murmurs, footsteps on stone, and clean vlog-style voice audio. Ultra-realistic visuals, perfect face consistency with the reference image, sharp background details, natural daylight, cinematic color grading, stable camera motion, authentic travel vlog mood, immersive and inspiring atmosphere, no distortions, no extra people, duration approximately 5 seconds.",
"model_id": "wan2.6-i2v",
"duration": "5",
"resolution": "720p",
"key": "YOUR_API_KEY"
}

SDKs

Official SDKs

Production-ready SDKs and client libraries for all major programming languages

API Parameters

Technical specifications for API request parameters.

Field NameParameterTechnical Description
Image URLinit_imageAdd the image to convert it to video using prompt
Audio-URLinit_audioThe video content will attempt to align with the audio content, such as lip movements and rhythm. Format: WAV, MP3. If the audio duration exceeds the duration value (5 or 10 seconds), the first 5 or 10 seconds are automatically used, and the rest is discarded. If the audio is shorter than the video duration, the part of the video beyond the audio length will be silent.
PromptpromptEnter a prompt to define the actions you want your image to perform.
Modelmodel_idModel_id for selecting the model from mutiple models
DurationdurationThe duration of the generated video in seconds.
ResolutionresolutionThe resolution of the generated video in pixel.