Nano Banana Pro | Nano Banana 2Mar.13 - 26 (UTC+8) 25% off
WaveSpeed.ai
Home/Explore/OpenAI Models/openai/sora-2-pro/text-to-video
text-to-video

text-to-video

OpenAI Sora 2

openai/sora-2-pro/text-to-video

OpenAI Sora 2 Pro is a state-of-the-art text-to-video model with realistic physics, synchronized audio, and strong steerability. Supports multiple resolutions up to 1080p and durations up to 20 seconds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Idle

Your request will cost $1.2 per run.

One more thing:

ExamplesView all

README

OpenAI Sora 2 Pro Text-to-Video

Sora 2 Pro is OpenAI's flagship text-to-video model, generating cinematic-quality videos from text descriptions. Built on OpenAI's most advanced video generation technology, it delivers stunning visual fidelity, realistic physics, and coherent long-form video up to 20 seconds — setting the benchmark for AI video generation.

Why Choose This?

  • OpenAI's best video model The most advanced text-to-video model from OpenAI, delivering state-of-the-art quality.

  • Realistic physics Accurate simulation of real-world physics, motion, and object interactions.

  • Long-form generation Generate coherent videos up to 20 seconds in length.

  • Multiple resolutions Support for 720p, 1024p, and 1080p in both landscape and portrait orientations.

  • Cinematic quality Professional-grade output suitable for commercial and creative production.

  • Prompt Enhancer Built-in tool to automatically improve your descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the desired video
sizeNoResolution: 1280×720, 720×1280, 1792×1024, 1024×1792, 1920×1080, 1080×1920
durationNoVideo length: 4, 8, 12, 16, 20 seconds (default: 4)

Size Options

SizeOrientationResolution
1280×720Landscape720p
720×1280Portrait720p
1792×1024Landscape1024p
1024×1792Portrait1024p
1920×1080Landscape1080p
1080×1920Portrait1080p

How to Use

  1. Write your prompt — describe the scene, action, camera movement, and style in detail.
  2. Select size — choose resolution and orientation based on your delivery platform.
  3. Set duration — choose 4, 8, 12, 16, or 20 seconds.
  4. Use Prompt Enhancer (optional) — click to refine your description.
  5. Run — submit and download your generated video.

Pricing

Duration720p1024p1080p
4 s$1.20$2.00$2.80
8 s$2.40$4.00$5.60
12 s$3.60$6.00$8.40
16 s$4.80$8.00$11.20
20 s$6.00$10.00$14.00

Billing Rules

  • 720p rate: $0.30 per second
  • 1024p rate: $0.50 per second
  • 1080p rate: $0.70 per second
  • Duration options: 4, 8, 12, 16, or 20 seconds

Best Use Cases

  • Commercial Production — Generate broadcast-quality video content for advertising.
  • Film Pre-visualization — Create high-fidelity concept footage for productions.
  • Premium Content Creation — Produce top-tier videos for brands and agencies.
  • Creative Exploration — Visualize complex scenes with realistic physics and motion.
  • Long-Form Storytelling — Generate extended coherent narratives up to 20 seconds.

Pro Tips

  • Be specific and detailed in your prompts — Sora excels at interpreting nuanced descriptions.
  • Include camera movement descriptions (e.g., "slow dolly forward," "aerial tracking shot").
  • Describe lighting, atmosphere, and mood for more cinematic results.
  • Use 720p for iterations and testing; 1080p for final production output.
  • Longer durations maintain coherence — Sora handles extended sequences well.

Notes

  • Prompt is the only required field.
  • Duration options: 4, 8, 12, 16, or 20 seconds (in 4-second increments).
  • Higher resolutions cost more per second.
  • Ensure your prompts comply with OpenAI's usage policies.

Related Models