🔥 20% OFF All Kling Models

Omnihuman thumbnail

Omnihuman

Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.

omni-human

Closed Source ModelLLMs.txt Learn more

Input

Per sec generation will cost 0.168$

Output

Idle

Unknown content type

Open Source Alternatives

Similar models we host ourselves — unlimited on the $149/month Open Source Unlimited plan, where this model is billed per call from your wallet

View all open source models

SVD

SVD

Open Source Model

CogVideoX

CogVideoX

Open Source Model

wan2.1

wan2.1

Open Source Model

Looking for something different?

Models that trade off differently against the one you are viewing

Better quality

Newer flagship models that generally produce stronger results.

Seedance 2.0 Multi Reference To Video

Seedance 2.0 Multi Reference To Video

Closed Source Model

Seedance 1.5 Pro

Seedance 1.5 Pro

Closed Source Model

Seedance 2.0 Start/End Frame To Video

Seedance 2.0 Start/End Frame To Video

Closed Source Model

LTX 2 PRO Image To Video

LTX 2 PRO Image To Video

Closed Source Model

WAN 2.7 Image To Video

WAN 2.7 Image To Video

Closed Source Model

happyhorse-1.0-i2v

happyhorse-1.0-i2v

Closed Source Model

Faster turnaround

Built for speed when you are iterating on a prompt.

Happyhorse 1.1 Image To Video

Happyhorse 1.1 Image To Video

Closed Source Model

Happyhorse 1.1 Text To Video

Happyhorse 1.1 Text To Video

Closed Source Model

Happyhorse 1.1 Reference To Video

Happyhorse 1.1 Reference To Video

Closed Source Model

LTX 2.3

LTX 2.3

Open Source Model

Lower cost per run

Open-source models we host, unlimited on the $149/month plan.

wan 2.2

wan 2.2

Open Source Model

Hailuo 02 Start/End Frame Image To Video

Hailuo 02 Start/End Frame Image To Video

Closed Source Model

Hailuo 02 Text To Video

Hailuo 02 Text To Video

Closed Source Model

Sora-2

Sora-2

Closed Source Model

Hailuo 2.3 Image To Video

Hailuo 2.3 Image To Video

Closed Source Model

Kling V2.5 Turbo Text To Video

Kling V2.5 Turbo Text To Video

Closed Source Model

Related Models

Discover similar models you might be interested in

View all Video Models

Seedance 1.0 Pro Fast Text to Video

Seedance 1.0 Pro Fast Text to Video

Closed Source Model

Grok Imagine Text To Video

Grok Imagine Text To Video

Closed Source Model

Wan 2.5 Text to Video

Wan 2.5 Text to Video

Closed Source Model

Veo 3.1 Fast

Veo 3.1 Fast

Closed Source Model

Hailuo 02 Image To Video

Hailuo 02 Image To Video

Closed Source Model

Kling Motion Control

Kling Motion Control

Closed Source Model

kling V2.1 Master Text To Video

kling V2.1 Master Text To Video

Closed Source Model

Kling V2 Master Text To Video

Kling V2 Master Text To Video

Closed Source Model

Gen4 Image To Video Turbo

Gen4 Image To Video Turbo

Closed Source Model

Kling V2.5 Turbo Image To Video

Kling V2.5 Turbo Image To Video

Closed Source Model

Kling V2 Master Image To Video

Kling V2 Master Image To Video

Closed Source Model

Hailuo 2.3 Text To Video

Hailuo 2.3 Text To Video

Closed Source Model

wan2.6 Image To Video (Flash)

wan2.6 Image To Video (Flash)

Closed Source Model

Gen4 Aleph (Video Edit)

Gen4 Aleph (Video Edit)

Closed Source Model

lipsync-2

lipsync-2

Closed Source Model

Wan2.6 Image To Video

Wan2.6 Image To Video

Closed Source Model

Kling V2.1 Image To Video

Kling V2.1 Image To Video

Closed Source Model

Grok Imagine Image To Video

Grok Imagine Image To Video

Closed Source Model

About Omnihuman

Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.

Technical Specifications

Model ID: omni-human
Provider: BytePlus
Category: Video Models
Task: Video Generation
Price: $0.168 per second
Added: September 4, 2025

Key Features

AI video generation from text or image input
Motion control and camera movement parameters
Adjustable frame rate and video duration
High-quality cinematic output up to 1080p
Native audio generation support

Quick Start

Integrate Omnihuman into your application with a single API call. Get your API key from the pricing page to get started.

import requests
import json

url = "https://modelslab.com/api/v7/video-fusion/image-to-video"

headers = {
    "Content-Type": "application/json"
}

data = {
        "model_id": "omni-human",
        "prompt": "your prompt here",
        "key": "YOUR_API_KEY"
    }

try:
    response = requests.post(url, headers=headers, json=data)
    response.raise_for_status()  # Raises an HTTPError for bad responses (4XX or 5XX)
    result = response.json()
    print("API Response:")
    print(json.dumps(result, indent=2))
except requests.exceptions.HTTPError as http_err:
    print(f"HTTP error occurred: {http_err} - {response.text}")
except Exception as err:
    print(f"Other error occurred: {err}")

Pricing

Omnihuman API costs $0.168000 per second. Pay only for what you use with no minimum commitments. View pricing plans

Use Cases

Marketing and promotional video creation
Social media short-form video content
Product demos and explainer videos
Creative storytelling and animation

Learn more about Omnihuman Browse Video Models More from BytePlus View Pricing

Omnihuman FAQ