Seedance 2.0 | Special Offer ✦ 10% OFF NOW
Home/Explore/OpenAI Models/openai/sora-2-pro/text-to-video

OpenAI Sora 2

openai /

OpenAI Sora 2 Pro is a state-of-the-art text-to-video model with realistic physics, synchronized audio, and strong steerability. Supports multiple resolutions up to 1080p and durations up to 20 seconds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video
Input

Idle

Your request will cost $1.2 per run.

One more thing:

ExamplesView all

README

Notice — Service Stability

The Sora 2 family is currently unstable. Generations may fall back to alternative models without notice and the service can be temporarily unavailable. OpenAI is also expected to discontinue this model in the future.

If you need an equally capable, stable alternative, we recommend Seedance 2: bytedance/seedance-2.0/text-to-video.

OpenAI Sora 2 Pro Text-to-Video

Sora 2 Pro is OpenAI's premium video and audio generator. It advances prior video models with more accurate physics, sharper realism, synchronized audio, stronger steerability, and a wider stylistic range — built on the original Sora foundation.

Now with character consistency — create reusable character IDs and feature them across multiple videos with the same identity.

Why It Looks Great

  • Physics-aware motion: learns contact, inertia, and momentum so objects move and collide believably.
  • Temporal consistency: stable identities, minimal flicker/ghosting, and clean frame-to-frame transitions.
  • Synchronized audio: lip-sync alignment, beat-aware cuts, and ambience that matches on-screen action.
  • High-frequency detail: preserves fine textures (skin, fabric, foliage) without plastic over-sharpening.
  • Complex scene reasoning: handles multiple subjects, occlusions, depth, and long camera moves coherently.
  • Cinematic camera literacy: natural pans, push-ins, and handheld vibes without warping or jelly-artifacts.
  • Wide stylistic range: from photoreal and documentary to anime, 3D, and illustrative aesthetics.
  • Strong steerability: responds predictably to prompt edits and control settings (duration, fps, motion strength).

Parameters

ParameterRequiredDescription
promptYesDescribe scene, style, camera, and audio cues
sizeNoOutput resolution (see options below)
durationNoVideo length: 4, 8, 12, 16, or 20 seconds
charactersNoList of character IDs for consistent identity

Size Options

  • 720×1280 / 1280×720 (720p)
  • 1024×1792 / 1792×1024 (1024p)
  • 1080×1920 / 1920×1080 (1080p)

How to Use

  1. Write your prompt — describe scene, style, camera movement, and audio cues.
  2. Select size — choose resolution and orientation.
  3. Set duration — select 4, 8, 12, 16, or 20 seconds.
  4. Add characters (optional) — paste character IDs from Sora 2 Characters.
  5. Submit — generate, preview, and download when ready.

Pricing

Duration720p1024p1080p
4 s$1.20$2.00$2.80
8 s$2.40$4.00$5.60
12 s$3.60$6.00$8.40
16 s$4.80$8.00$11.20
20 s$6.00$10.00$14.00

Billing Rules

  • 720p rate: $0.30 per second
  • 1024p rate: $0.50 per second
  • 1080p rate: $0.70 per second
  • Duration options: 4, 8, 12, 16, or 20 seconds

Notes

Related Models