Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only

Grok Models

xAI's most advanced AI models with real-time knowledge and vision capabilities.

xAI's most advanced AI models with real-time knowledge and vision capabilities.

All Models

8 models
text-to-image

x-ai/grok-2-image

Grok 2 Image is xAI’s latest image generation model that turns simple text prompts into sharp, photorealistic visuals in seconds. From product shots to social posts and concept art, it follows your instructions closely so you can go from idea to production-ready image with just one prompt. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image

x-ai/grok-imagine-image/edit

X-AI Grok Imagine Image enables precise image editing with xAI's Grok Imagine model. Transform and modify images using text prompts with AI-powered precision. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image

x-ai/grok-imagine-image/text-to-image

X-AI Grok Imagine Image enables precise image editing with xAI's Grok Imagine model. Transform and modify images using text prompts with AI-powered precision. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

x-ai/grok-imagine-video/text-to-video

X-AI Grok Imagine Video generates videos from text descriptions using xAI's Grok Imagine Video model. Create high-quality videos with customizable duration, aspect ratio, and resolution. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

x-ai/grok-imagine-video/image-to-video

X-AI Grok Imagine Video transforms images into videos using xAI's Grok Imagine Video model. Animate still images with natural motion, scene continuity, and synchronized audio. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

x-ai/grok-imagine-video/edit-video

X-AI Grok Imagine Video Edit enables video editing using xAI's Grok Imagine Video model. Transform and modify existing videos with text prompts for seamless AI-powered edits. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-extend

x-ai/grok-imagine-video/video-extend

X-AI Grok Imagine Video Extend turns short clips into longer videos with smooth motion continuity and natural scene extension. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

x-ai/grok-imagine-video/reference-to-video

X-AI Grok Imagine Video Reference-to-Video generates videos from multiple reference images with preserved identity, style, and scene composition. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Grok Models

xAI's Grok Imagine is a powerful suite of AI-native image and video generation models, offering full creative control from text-to-image, image editing, to multi-modal video generation. Built on xAI's frontier reasoning capabilities, Grok Imagine delivers exceptional prompt understanding, cinematic quality, and production-ready outputs.

🎬 Grok Imagine Video — Edit, Animate & Generate

Grok Imagine Video provides three specialized endpoints for video creation: generate from text, animate images, or transform existing videos with AI-powered editing.

  1. Grok Imagine Video Text-to-Video — Generate high-quality videos from text prompts with strong motion coherence and cinematic framing.
  2. x-ai/grok-imagine-video/text-to-video
  3. Grok Imagine Video Image-to-Video — Bring still images to life with natural, fluid motion while preserving subject identity and composition.
  4. x-ai/grok-imagine-video/image-to-video
  5. Grok Imagine Video Edit — Transform and remix existing videos with AI-powered editing for style transfer, scene modification, and creative effects.
  6. x-ai/grok-imagine-video/edit-video

🖼️ Grok Imagine Image — Create & Edit

Grok's image generation models deliver stunning visuals with exceptional prompt adherence and artistic versatility.

  1. Grok Imagine Image Text-to-Image — Generate detailed, photorealistic or stylized images from text with superior prompt understanding.
  2. x-ai/grok-imagine-image/text-to-image
  3. Grok Imagine Image Edit — Precisely edit and refine images with controlled modifications while maintaining visual consistency.
  4. x-ai/grok-imagine-image/edit
  5. Grok 2 Image — xAI's flagship text-to-image model with frontier-level quality and creative flexibility.
  6. x-ai/grok-2-image

✨ Highlights

  1. Frontier Prompt Understanding: Powered by Grok's advanced reasoning for exceptional text comprehension and creative interpretation.
  2. Cinematic Video Quality: Smooth motion, consistent subjects, and professional-grade output.
  3. Versatile Image Generation: From photorealistic to artistic styles with precise control.
  4. Full Creative Pipeline: Text-to-image, image editing, and multi-modal video generation in one unified suite.
  5. Production-Ready: Fast inference with reliable, consistent results for commercial workflows.