Deploy Dedicated GPU server to run AI models

Omnihuman

Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.

omni-human

Closed Source ModelLLMs.txt

API Playground API Documentation

Input

Per sec generation will cost 0.168$

Output

Idle

Unknown content type

Open Source Alternatives

Explore open-source models that offer similar capabilities with full transparency and flexibility

View all open source models

SVD

CogVideoX

wan2.1

Omnihuman

Input

Output

Open Source Alternatives

Omnihuman Readme