Create & Edit Images Instantly with Grok Imagine

Try Grok Imagine
Skip to main content

AI Model APIs

Discover and integrate with powerful AI model APIs for your applications

Grok Imagine Image To Video

xAI

Grok Imagine Image To Video

Grok Imagine – Image to Video lets you instantly turn your ideas into stunning 1–15 second AI-generated videos. Simply describe your scene, and generate smooth, high-quality videos in 480p and 720p resolution — perfect for social media, ads, storytelling,

Realistic
Grok Imagine Text To Video

xAI

Grok Imagine Text To Video

Grok Imagine – Text to Video lets you instantly turn your ideas into stunning 1–15 second AI-generated videos. Simply describe your scene, and generate smooth, high-quality videos in 480p and 720p resolution — perfect for social media, ads, storytelling,

Free for Premium Users
Qwen Voice Design

ModelsLab

Qwen Voice Design

Create and customize any AI-generated voice you can imagine using a simple text prompt - choose the tone, style, accent, emotion, age, or personality, and instantly turn your words into natural-sounding speech.

Voice DesignerUltra NaturalNew Added
Free for Premium Users
Z-Image-TurboLoraTrainer

ModelsLab

Z-Image-TurboLoraTrainer

Fast-train your custom models with optimized pipelines, supporting various image formats, and requiring minimal 16GB VRAM for efficient fine-tuning.

New AddedBest Lora Trainer
Free for Premium Users
Qwen Voice cloning

ModelsLab

Qwen Voice cloning

The Qwen Text-to-Speech endpoint generates audio from text using a provided audio URL, producing output that mimics the uploaded voice

3-Sec Voice CloneSupport 10 Languages
Grok Imagine Image Edit

xAI

Grok Imagine Image Edit

Grok Imagine – Image Edit lets you modify existing images using simple text instructions—add, remove, or change elements while keeping the original image style and details intact.

Fastest Image EditPro Grade Output
Grok Imagine Text To Image

xAI

Grok Imagine Text To Image

Generate high-quality 1024x1024 images in 2.3 seconds with efficient 2.1GB GPU memory use, natural language editing, superior character consistency, and real-time style transfers.

Fastest Image Gen2K OutputBest for Creators
Free for Premium Users
Z Imge base

ModelsLab

Z Imge base

A distilled version of Z-Image that matches or exceeds leading competitors with only 8 NFEs (Number of Function Evaluations)

High Quality OutputCheapest Price
Free for Premium Users
Qwen Image Edit 2511

ModelsLab

Qwen Image Edit 2511

Qwen-Image-Edit-2511 is a powerful, versatile AI tool for sophisticated, prompt-based image editing with strong consistency, identity preservation, and mixed-mode control across subjects and scenes

Precision Image EditBest Selling
OpenAI/Sora 2 Pro Text to Video

Open Ai

OpenAI/Sora 2 Pro Text to Video

Sora 2 Pro is an advanced text-to-video AI model that turns simple prompts into high-quality, cinematic videos with realistic motion, consistent characters, and strong scene coherence—built for creators, filmmakers, and production teams.

Native Sync AudioFilmmaker Grade
wan2.6 Image To Video (Flash)

Alibaba

wan2.6 Image To Video (Flash)

wan2.6-i2v-flash is an image-to-video generation model in the WAN 2.6 series. It takes a single input image (plus optional text prompt and audio) and generates a short video clip with motion and optionally synchronized sound.

Cheapest PriceMulti-shot Story teller 15 sec Output
Free for Premium Users
Z Image Turbo Image To Image

ModelsLab

Z Image Turbo Image To Image

Z-Image Turbo Model transform an existing image into a new version using a text prompt, rather than generating a picture from scratch. You upload a source image and then describe how you want it changed

Best SellingPrompt-Based EditTurbo Image Transform