---
title: Gemini Omni — Text To Video API | ModelsLab
description: Generate cinematic AI videos with audio from text prompts using the Gemini Omni API on ModelsLab. 720p output, world knowledge grounding. Try it now.
url: https://modelslab.com/models/google/gemini-omni-text-to-video.md
canonical: https://modelslab.com/models/google/gemini-omni-text-to-video.md
type: product
component: Playground/Endpoint/Index
generated_at: 2026-07-02T19:52:06.709984Z
---

[![Gemini Omni — Text To Video thumbnail](https://assets.modelslab.ai/api-logos/01KMSWZ55N192HV90FQ8GTR7CQ.webp)](https://modelslab.com/models/google)Gemini Omni — Text To Video
---

[by Google](https://modelslab.com/models/google)Google's Gemini Omni is live on ModelsLab. Describe any scene in plain text and get a cinematic video clip with synchronized audio — powered by Gemini's world knowledge and physics understanding.

`gemini-omni-text-to-video`

Closed Source Model [LLMs.txt](https://modelslab.com/models/google/gemini-omni-text-to-video/llms.txt)

[API Playground](/models/google/gemini-omni-text-to-video)ShowcaseAPI DocumentationVibe CodeRelated ModelsDeveloper SupportModel Specs

Input
---

Prompt 

A wide, eye-level cinematic shot captures a man walking slowly across a frost-covered bridge at dawn, his hands tucked into the pockets of a heavy coat. Pale morning light glows faintly through soft, curling fog that clings to the bridge railings. In the distance, bare trees fade into the mist, their skeletal branches barely visible. The pace is unhurried and reflective, evoking a naturalistic and quiet mood. The scene is filled with subtle, atmospheric sounds—faint footsteps crunching on frost, steady breaths in the cold air, and the distant caw of a crow echoing across the stillness.

Aspect Ratio 

Duration 

Advanced Settings Customize your input with more control.

Configure

Add FundsLogin to Generate

**Per second** video generation will cost **$0.12**

Output
---

Idle

Unknown content type

Related Models
---

Discover similar models you might be interested in

 [View all Video Models](https://modelslab.com/models?feature=videogen)

[![Seedance 1.0 Pro Fast Text to Video](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/9a86a500-9dee-4489-858e-39fe9d9f3066.webp)](https://modelslab.com/models/byteplus/seedance-1.0-pro-fast-t2v)[Bytedance](https://modelslab.com/models/byteplus)

 [Seedance 1.0 Pro Fast Text to Video

Closed Source Model](https://modelslab.com/models/byteplus/seedance-1.0-pro-fast-t2v)

[![Hailuo 02 Image To Video ](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/89f858fe-f734-4c69-a6e0-e4eff44a6417.webp)](https://modelslab.com/models/minimax/Hailuo-02-i2v)[Minmax](https://modelslab.com/models/minimax)

 [Hailuo 02 Image To Video

Closed Source Model](https://modelslab.com/models/minimax/Hailuo-02-i2v)

[![Kling V2.5 Turbo Image To Video](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/5bff3d58-58d2-494a-98df-1dffc0dc58b3.webp)](https://modelslab.com/models/klingai/Kling-V2-5-Turbo-i2v)[KlingAI](https://modelslab.com/models/klingai)

 [Kling V2.5 Turbo Image To Video

Closed Source Model](https://modelslab.com/models/klingai/Kling-V2-5-Turbo-i2v)

[![Veo 3 Fast preview](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/707c2544-a4e2-4f55-8db9-fd20fb82063b.webp)](https://modelslab.com/models/google/veo-3.0-fast-generate-preview)[Google](https://modelslab.com/models/google)

 [Veo 3 Fast preview

Closed Source Model](https://modelslab.com/models/google/veo-3.0-fast-generate-preview)

[![kling V2.1 Master Text To Video ](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/4c066928-ae15-4d08-99f0-1cdcab52742c.webp)](https://modelslab.com/models/klingai/kling-v2-1-master-t2v)[KlingAI](https://modelslab.com/models/klingai)

 [kling V2.1 Master Text To Video

Closed Source Model](https://modelslab.com/models/klingai/kling-v2-1-master-t2v)

[![Grok Imagine Text To Video ](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/66741ffe-f704-47ef-92d5-32fec90bcc7a.webp)](https://modelslab.com/models/xai/grok-imagine-video-t2v)[xAI](https://modelslab.com/models/xai)

 [Grok Imagine Text To Video

Closed Source Model](https://modelslab.com/models/xai/grok-imagine-video-t2v)

[![Hailuo 2.3 Text To Video ](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/567c802c-05e1-45f9-8278-9ee7c35388b6.webp)](https://modelslab.com/models/minimax/Hailuo-2.3-t2v)[Minmax](https://modelslab.com/models/minimax)

 [Hailuo 2.3 Text To Video

Closed Source Model](https://modelslab.com/models/minimax/Hailuo-2.3-t2v)

[![Kling Motion Control](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/f8506650-29ee-4c93-92d6-f9ad0a023c06.webp)](https://modelslab.com/models/klingai/kling-motion-control)[KlingAI](https://modelslab.com/models/klingai)

 [Kling Motion Control

Closed Source Model](https://modelslab.com/models/klingai/kling-motion-control)

[![Gen4 Aleph (Video Edit)](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/8a41fc4d-9e2c-465f-b1e2-343e7ef681c7.webp)](https://modelslab.com/models/runway_ml/gen4_aleph)[Runway ML](https://modelslab.com/models/runway_ml)

 [Gen4 Aleph (Video Edit)

Closed Source Model](https://modelslab.com/models/runway_ml/gen4_aleph)

[![Hailuo 02 Start/End Frame Image To Video ](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/2c685f03-1f3c-412b-b830-3d3113512d43.webp)](https://modelslab.com/models/minimax/Hailuo-02-start-end-frame)[Minmax](https://modelslab.com/models/minimax)

 [Hailuo 02 Start/End Frame Image To Video

Closed Source Model](https://modelslab.com/models/minimax/Hailuo-02-start-end-frame)

[![Kling V2.5 Turbo Text To Video](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/a66ac4fb-91df-4b23-ae0c-126c5f1f38a8.webp)](https://modelslab.com/models/klingai/kling-v2-5-turbo-t2v)[KlingAI](https://modelslab.com/models/klingai)

 [Kling V2.5 Turbo Text To Video

Closed Source Model](https://modelslab.com/models/klingai/kling-v2-5-turbo-t2v)

[![Sora 2 Pro Text To Video ](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/28616b8b-e430-4ce6-b693-768f995ec195.webp)](https://modelslab.com/models/openai/sora-2-pro-t2v)[Open Ai](https://modelslab.com/models/openai)

 [Sora 2 Pro Text To Video

Closed Source Model](https://modelslab.com/models/openai/sora-2-pro-t2v)

[![Kling V1.6 Multi Image To Video](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/8c9627de-55bb-4272-8b4f-839fa8d53b9d.webp)](https://modelslab.com/models/klingai/kling-v1-6)[KlingAI](https://modelslab.com/models/klingai)

 [Kling V1.6 Multi Image To Video

Closed Source Model](https://modelslab.com/models/klingai/kling-v1-6)

[![Seedance 1.0 Pro Fast Image To Video](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/96703442-6fd2-4a62-8b91-5b3cf5833930.webp)](https://modelslab.com/models/byteplus/seedance-1.0-pro-fast-i2v)[Bytedance](https://modelslab.com/models/byteplus)

 [Seedance 1.0 Pro Fast Image To Video

Closed Source Model](https://modelslab.com/models/byteplus/seedance-1.0-pro-fast-i2v)

[![wan2.1](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/04d08a15-bc50-43e7-96e5-5342c249cf50.webp)](https://modelslab.com/models/modelslab/wan2.1)[ModelsLab](https://modelslab.com/models/modelslab)

 [wan2.1

Open Source Model](https://modelslab.com/models/modelslab/wan2.1)

[![Sora-2](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/1c99ac56-551e-4ff7-97d2-66a6d58b04f9.webp)](https://modelslab.com/models/openai/sora-2)[Open Ai](https://modelslab.com/models/openai)

 [Sora-2

Closed Source Model](https://modelslab.com/models/openai/sora-2)

[![SVD](https://images.stablediffusionapi.com/?Image=https://pub-3626123a908346a7a8be8d9295f44e26.r2.dev/livewire-tmp/vRDgVJCkNyxWUSxclogvFeMWlgP9rV-metac3ZkLndlYnA=-.webp)Popular](https://modelslab.com/models/modelslab/svd)[ModelsLab](https://modelslab.com/models/modelslab)

 [SVD

Open Source Model](https://modelslab.com/models/modelslab/svd)

[![Hailuo 2.3 Fast Image To Video ](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/e7b31183-59f6-4e12-87dc-1b7eaa3f0be1.webp)](https://modelslab.com/models/minimax/Hailuo-2.3-Fast-i2v)[Minmax](https://modelslab.com/models/minimax)

 [Hailuo 2.3 Fast Image To Video

Closed Source Model](https://modelslab.com/models/minimax/Hailuo-2.3-Fast-i2v)

Open Source Alternatives
---

Explore open-source models that offer similar capabilities with full transparency and flexibility

 [View all open source models](https://modelslab.com/models?feature=video&provider=open-source-models)

[![SVD](https://images.stablediffusionapi.com/?Image=https://pub-3626123a908346a7a8be8d9295f44e26.r2.dev/livewire-tmp/vRDgVJCkNyxWUSxclogvFeMWlgP9rV-metac3ZkLndlYnA=-.webp)Popular](https://modelslab.com/models/modelslab/svd)[ModelsLab](https://modelslab.com/models/modelslab)

 [SVD

Open Source Model](https://modelslab.com/models/modelslab/svd)

[![CogVideoX](https://images.stablediffusionapi.com/?Image=https://pub-3626123a908346a7a8be8d9295f44e26.r2.dev/livewire-tmp/VmhYqa98ohanHj8vL6mQjkr5TG2sSS-metaY29ndmlkZW94LndlYnA=-.webp)Popular](https://modelslab.com/models/modelslab/cogvideox)[ModelsLab](https://modelslab.com/models/modelslab)

 [CogVideoX

Open Source Model](https://modelslab.com/models/modelslab/cogvideox)

[![wan2.1](https://images.stablediffusionapi.com/?Image=https://assets.modelslab.ai/generations/04d08a15-bc50-43e7-96e5-5342c249cf50.webp)](https://modelslab.com/models/modelslab/wan2.1)[ModelsLab](https://modelslab.com/models/modelslab)

 [wan2.1

Open Source Model](https://modelslab.com/models/modelslab/wan2.1)

About Gemini Omni — Text To Video
---

Gemini Omni — Text To Video is a video generation AI model by Google available on ModelsLab. Access Gemini Omni — Text To Video through our API with pay-per-use pricing and no minimum commitments.

### Technical Specifications

Model IDgemini-omni-text-to-video

ProviderGoogle

CategoryVideo Models

TaskVideo Generation

Price$0.12 per second

AddedJuly 1, 2026

### Key Features

- AI video generation from text or image input
- Motion control and camera movement parameters
- Adjustable frame rate and video duration
- High-quality cinematic output up to 1080p
- Native audio generation support

### Quick Start

Integrate Gemini Omni — Text To Video into your application with a single API call. Get your API key from the [pricing page](https://modelslab.com/pricing) to get started.

PythonJavaScriptcURLPHP

```
<code>import requests
import json

url = "https://modelslab.com/api/v7/video-fusion/text-to-video"

headers = {
    "Content-Type": "application/json"
}

data = {
        "model_id": "gemini-omni-text-to-video",
        "prompt": "your prompt here",
        "key": "YOUR_API_KEY"
    }

try:
    response = requests.post(url, headers=headers, json=data)
    response.raise_for_status()  # Raises an HTTPError for bad responses (4XX or 5XX)
    result = response.json()
    print("API Response:")
    print(json.dumps(result, indent=2))
except requests.exceptions.HTTPError as http_err:
    print(f"HTTP error occurred: {http_err} - {response.text}")
except Exception as err:
    print(f"Other error occurred: {err}")</code>
```

### Pricing

Gemini Omni — Text To Video API costs $0.120000 per second. Pay only for what you use with no minimum commitments. [View pricing plans](https://modelslab.com/pricing)

### Use Cases

- Marketing and promotional video creation
- Social media short-form video content
- Product demos and explainer videos
- Creative storytelling and animation

[Browse Video Models](https://modelslab.com/models?feature=videogen) [More from Google](https://modelslab.com/models/google) [View Pricing](https://modelslab.com/pricing)

Gemini Omni — Text To Video FAQ
---

### What is Gemini Omni — Text To Video?

Gemini Omni — Text To Video is a video generation AI model by Google available on ModelsLab. Access it through our API with pay-per-use pricing and no minimum commitments.

### How do I use the Gemini Omni — Text To Video API?

You can integrate Gemini Omni — Text To Video into your application with a single API call. Sign up on ModelsLab to get your API key, then use the model ID "gemini-omni-text-to-video" in your API requests. We provide SDKs for Python, JavaScript, and cURL examples in the API documentation.

### How much does Gemini Omni — Text To Video cost?

Gemini Omni — Text To Video costs $0.120000 per second. ModelsLab uses pay-per-use pricing with no minimum commitments. A free tier is available to get started.

### What is the Gemini Omni — Text To Video model ID?

The model ID for Gemini Omni — Text To Video is "gemini-omni-text-to-video". Use this ID in your API requests to specify this model.

### Does Gemini Omni — Text To Video have a free tier?

Yes, ModelsLab offers a free tier that lets you try Gemini Omni — Text To Video and other AI models. Sign up to get free API credits and start building immediately.

---

*This markdown version is optimized for AI agents and LLMs.*

**Links:**
- [Website](https://modelslab.com)
- [API Documentation](https://docs.modelslab.com)
- [Blog](https://modelslab.com/blog)

---
*Generated by ModelsLab - 2026-07-03*