Skip to main content
Available now on ModelsLab · Video Generation

Hailuo 2.3 Text To VideoText Sparks Cinematic Motion

Generate Precise Video Sequences

Realistic Motion

Lifelike Expressions Actions

Hailuo 2.3 Text To Video model captures nuanced facial gestures and complex choreography from text prompts.

Cinematic Controls

Camera Movement Precision

Direct controllable camera paths and dynamic transitions in Hailuo 2.3 Text To Video API outputs.

Dual Input Support

Text Image To Video

Process text prompts or images into 1080p videos with temporal consistency via Hailuo 2.3 endpoint.

Examples

See what Hailuo 2.3 Text To Video can create

Copy any prompt below and try it yourself in the playground.

Urban Timelapse

Aerial drone shot over futuristic city at dusk, skyscrapers with neon lights reflecting on rain-slick streets, smooth forward pullback camera motion, cinematic lighting, high detail, 1080p.

Ocean Waves

Powerful waves crashing on rocky cliffside at sunrise, foam spraying upward, slow orbiting camera around the impact zone, realistic water physics, golden hour glow, professional quality.

Product Reveal

Sleek metallic watch rotating on glass pedestal in modern studio, soft spotlight highlights engravings, gentle pan camera from side to top view, sharp focus, commercial style.

Forest Path

Serene forest trail winding through ancient trees in autumn, falling leaves drift in wind, steady tracking shot forward along path, volumetric fog, natural depth of field.

For Developers

A few lines of code.
Videos From Text. One Call.

ModelsLab handles the infrastructure: fast inference, auto-scaling, and a developer-friendly API. No GPU management needed.

  • Serverless: scales to zero, scales to millions
  • Pay per second, no minimums
  • Python and JavaScript SDKs, plus REST API
import requests
response = requests.post(
"https://modelslab.com/api/v7/video-fusion/text-to-video",
json={
"key": "YOUR_API_KEY",
"prompt": "A sunset aerial shot of a lone rider galloping across a snow-covered plain, swirling snow kicked by the hooves, warm orange backlight casting long shadows, steady tracking shot from behind, cinematic photorealism with film grain."
}
)
print(response.json())

FAQ

Common questions about Hailuo 2.3 Text To Video

Read the docs

Hailuo 2.3 Text To Video generates 1080p cinematic videos from text or images. It excels in realistic motion, expressions, and camera controls. Access via MiniMax Hailuo 2.3 Text To Video API.

Send text prompts or image URLs to the Hailuo 2.3 Text To Video API endpoint. It outputs coherent 10-second clips with specified styles. Average runtime is 130 seconds.

Supports 768p standard and 1080p pro outputs. Choose tiers for quality and cost balance. Ideal for professional video generation.

Hailuo 2.3 Text To Video offers superior motion realism and prompt accuracy over predecessors. Use Hailuo 2.3 Text To Video API for text-to-video and image-to-video tasks.

Yes, Hailuo 2.3 Text To Video handles image-to-video alongside text inputs. Animates static images with natural motion. Fast variant optimizes for speed.

Generates ~10-second clips without first-last frame control. Single first-frame input only. Supports diverse styles from photorealistic to animated.

Ready to create?

Start generating with Hailuo 2.3 Text To Video on ModelsLab.