Create & Edit Images Instantly with Grok Imagine

Try Grok Imagine
Skip to main content
Omnihuman thumbnail

Omnihuman

by Bytedance
omni-human
Omnihuman Image + Audio to VideoLLMs.txt
Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.
API PlaygroundAPI Documentation

Input

File preview

File preview

or record audio
Audio recording is not supported in your browser

Per sec generation will cost 0.168$

Output

Idle

Unknown content type

Related Models

Discover similar models you might be interested in

Veo 3 Fast
Popular
Google

Veo 3 Fast

Kling V2.5 Turbo Image To Video
KlingAI

Kling V2.5 Turbo Image To Video

Wan 2.5 Image to Video
Alibaba

Wan 2.5 Image to Video

Kling V2 Master Image To Video
KlingAI

Kling V2 Master Image To Video

Kling V2 Master Text To Video
KlingAI

Kling V2 Master Text To Video

Seedance 1.0 Pro Fast Text to Video
Bytedance

Seedance 1.0 Pro Fast Text to Video

Gen4 Aleph (Video Edit)
Runway ML

Gen4 Aleph (Video Edit)

Kling V2.1 Master Image To Video
KlingAI

Kling V2.1 Master Image To Video

Kling V2.5 Turbo Text To Video
KlingAI

Kling V2.5 Turbo Text To Video

Kling Motion Control
KlingAI

Kling Motion Control

Seedance 1.5 Pro
Popular
Bytedance

Seedance 1.5 Pro

Hailuo 02 Image To Video
Minmax

Hailuo 02 Image To Video