Happy Horse 1.0 is now on ModelsLab

Try Now
Skip to main content
Available now on ModelsLab · Video Generation

Grok Imagine Image To VideoImages Animate Instantly

Unlock Video Generation Power

Fast Inference

6-Second Videos

Turns images into 6s 480p videos with audio in 15-20 seconds.

Simple API

Image To Video

Upload image URL or file via Grok Imagine Image To Video API for instant motion.

Cost Efficient

Predictable Pricing

480p video costs $0.05 per second plus $0.002 image input.

Examples

See what Grok Imagine Image To Video can create

Copy any prompt below and try it yourself in the playground.

Cityscape Motion

Animate a bustling city skyline at dusk with smooth camera pan over neon lights and traffic flow, 480p, add ambient urban audio.

Ocean Waves

Bring to life a serene beach scene with rolling waves crashing on shore, gentle wind in palm trees, realistic water physics, include wave sounds.

Product Reveal

Animate a sleek smartphone rotating on reflective surface under studio lights, slow 360 spin with subtle glow effects, neutral background audio.

Forest Path

Create motion in a misty forest trail with leaves rustling and sunlight filtering through canopy, walking camera forward, natural ambient sounds.

For Developers

A few lines of code.
Video from image. One call.

ModelsLab handles the infrastructure: fast inference, auto-scaling, and a developer-friendly API. No GPU management needed.

  • Serverless: scales to zero, scales to millions
  • Pay per second, no minimums
  • Python and JavaScript SDKs, plus REST API
import requests
response = requests.post(
"https://modelslab.com/api/v7/video-fusion/image-to-video",
json={
"key": "YOUR_API_KEY",
"prompt": "A cinematic fly of iron man wind blowing asthectic view blazing fast speed creating a fire smoke while flying",
"duration": "6",
"init_image": "https://assets.modelslab.ai/generations/59935966-289c-455e-80e2-06b939a8bcdb.jpg"
}
)
print(response.json())

FAQ

Common questions about Grok Imagine Image To Video

Read the docs

xAI model generates 6-second videos from images with audio. Processes in 15-20 seconds at 480p. Accepts JPG, PNG, WebP inputs.

Videos generate in under 20 seconds. Loading shows real-time progress from 15% to 100%. Output is 6s clips.

480p second costs $0.05, plus $0.002 per image. 720p second is $0.07. Pricing scales with resolution.

Best for quick animations without prompts. Lacks Veo keyframing but excels in speed and fluidity. Ideal for social feeds.

Works with photos, AI images, multiple subjects. Quality matches input. Handles low-res or family photos.

Includes native audio sync in outputs. Generates sound matching motion like waves or urban noise. Download with commercial rights.

Ready to create?

Start generating with Grok Imagine Image To Video on ModelsLab.