Create & Edit Images Instantly with Grok Imagine

Try Grok Imagine
Skip to main content
Omnihuman thumbnail

Omnihuman

by Bytedance
omni-human
Omnihuman Image + Audio to VideoLLMs.txt
Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.
API PlaygroundAPI Documentation

Input

File preview

File preview

or record audio
Audio recording is not supported in your browser

Per sec generation will cost 0.168$

Output

Idle

Unknown content type

Related Models

Discover similar models you might be interested in

Seedance 1.0 Pro Fast Image To Video
Popular
Bytedance

Seedance 1.0 Pro Fast Image To Video

Kling V2.5 Turbo Image To Video
KlingAI

Kling V2.5 Turbo Image To Video

Hailuo 2.3 Image To Video
Minmax

Hailuo 2.3 Image To Video

kling V2.1 Master Text To Video
KlingAI

kling V2.1 Master Text To Video

Kling Motion Control
KlingAI

Kling Motion Control

Hailuo 02 Image To Video
Minmax

Hailuo 02 Image To Video

Wan2.6 Image To Video
Alibaba

Wan2.6 Image To Video

Hailuo 02 Start/End Frame Image To Video
Minmax

Hailuo 02 Start/End Frame Image To Video

Veo 3.1 Fast
Google

Veo 3.1 Fast

Hailuo 2.3 Fast Image To Video
Minmax

Hailuo 2.3 Fast Image To Video

Veo 3 Fast preview
Google

Veo 3 Fast preview

Hailuo 2.3 Text To Video
Minmax

Hailuo 2.3 Text To Video