Grok Imagine Video Image-to-Video
Grok Imagine Video Image-to-Video is X-AI's image animation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates a cinematic video with smooth, natural movement and consistent visual quality.
Why Choose This?
-
Image-driven generation
Transform any still image into a dynamic video with natural, fluid motion.
-
Flexible duration
Generate videos at 6 or 10 seconds to match your scene pacing.
-
Resolution options
Output in 720p or 480p based on your quality and speed requirements.
-
Prompt Enhancer
Built-in tool to automatically refine and strengthen your motion descriptions for better results.
Parameters
| Parameter | Required | Description |
|---|
| image | Yes | Reference image to animate (URL or file upload). |
| prompt | Yes | Text description of the desired motion, camera movement, and scene. |
| duration | No | Video length in seconds. Options: 6, 10. |
| resolution | No | Output resolution: 720p (default) or 480p. |
How to Use
- Upload your image — provide the reference image via URL or drag-and-drop upload.
- Write your prompt — describe the motion, camera movement, and scene details. Use the Prompt Enhancer for better results.
- Set duration — choose 6 or 10 seconds based on your scene length.
- Select resolution — 720p for higher quality, 480p for faster processing.
- Run — submit and download your video.
Pricing
| Duration | Cost |
|---|
| 6s | $0.30 |
| 10s | $0.50 |
Billing Rules
- Rate: $0.05 per second
- Duration options: 6 or 10 seconds
- Billing is based on the selected duration, not actual playback length
Best Use Cases
- Photo Animation — Bring portraits, landscapes, and product images to life with natural motion.
- Social Media Content — Create engaging video clips from static images for Reels, TikTok, and Shorts.
- Marketing & Ads — Generate dynamic promotional videos from product photos without a film crew.
- Storytelling — Animate illustrations and concept art to build visual narratives.
- Creative Projects — Explore motion concepts and cinematic ideas from reference images.
Pro Tips
- Use the Prompt Enhancer to refine your motion descriptions before generating.
- Be specific about camera movement (pan, zoom, dolly) and subject behavior in your prompt.
- Use high-quality, well-lit source images for sharper, more consistent video output.
- Start with a 6-second generation to test your prompt before committing to a 10-second run.
- Describe both motion and atmosphere in your prompt for richer results.
Notes
- Both image and prompt are required fields.
- Ensure image URLs are publicly accessible; a preview thumbnail in the interface confirms the URL is reachable.
- Maximum duration is 10 seconds.
Related Models
- Grok Imagine Video Edit — Edit existing videos with text instructions.
- Grok Imagine Image Text-to-Image — Generate images from text prompts.
- Grok Imagine Image Edit — Edit images with text instructions.