🎉 New Year Sale: Get 20% OFF on all plans — Use code NEWYEAR2026.

Upgrade now
Omnihuman thumbnail

Bytedance/Omnihuman

omni-human
Omnihuman Image + Audio to VideoLLMs.txt
Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.
API PlaygroundAPI Documentation

Input

Customize your API request

File preview

File preview

or record audio
Audio recording is not supported in your browser

Per sec generation will cost 0.168$

Output

Generation complete

Idle

Unknown content type

Related Models

Discover similar models you might be interested in

Wan2.6 Text To Video

Hailuo 2.3 Image To Video

Hailuo 2.3 Image To Video

Kling V2.1 Master Image To Video

Kling V2.1 Master Image To Video

kling V2.1 Master Text To Video

kling V2.1 Master Text To Video