
Omnihuman
by BytedanceGenerates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.
omni-humanClosed Source ModelLLMs.txt
Input
Per sec generation will cost 0.168$
Output
Idle
Unknown content type
Open Source Alternatives
Explore open-source models that offer similar capabilities with full transparency and flexibility
Omnihuman Readme
Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.


