Browse ModelsOpenaiOpenai Sora 2 Pro Text To Video

Openai Sora 2 Pro Text To Video

Openai Sora 2 Pro Text To Video

Playground

Try it on WavespeedAI!

OpenAI Sora 2 Pro is a state-of-the-art text-to-video model with realistic physics, synchronized audio, and strong steerability. Supports multiple resolutions up to 1080p and durations up to 20 seconds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

OpenAI Sora 2 Pro Text-to-Video

Sora 2 Pro is OpenAI’s premium video and audio generator. It advances prior video models with more accurate physics, sharper realism, synchronized audio, stronger steerability, and a wider stylistic range — built on the original Sora foundation.

Now with character consistency — create reusable character IDs and feature them across multiple videos with the same identity.


Why It Looks Great

  • Physics-aware motion: learns contact, inertia, and momentum so objects move and collide believably.
  • Temporal consistency: stable identities, minimal flicker/ghosting, and clean frame-to-frame transitions.
  • Synchronized audio: lip-sync alignment, beat-aware cuts, and ambience that matches on-screen action.
  • High-frequency detail: preserves fine textures (skin, fabric, foliage) without plastic over-sharpening.
  • Complex scene reasoning: handles multiple subjects, occlusions, depth, and long camera moves coherently.
  • Cinematic camera literacy: natural pans, push-ins, and handheld vibes without warping or jelly-artifacts.
  • Wide stylistic range: from photoreal and documentary to anime, 3D, and illustrative aesthetics.
  • Strong steerability: responds predictably to prompt edits and control settings (duration, fps, motion strength).

Parameters

ParameterRequiredDescription
promptYesDescribe scene, style, camera, and audio cues
sizeNoOutput resolution (see options below)
durationNoVideo length: 4, 8, 12, 16, or 20 seconds
charactersNoList of character IDs for consistent identity

Size Options

  • 720×1280 / 1280×720 (720p)
  • 1024×1792 / 1792×1024 (1024p)
  • 1080×1920 / 1920×1080 (1080p)

How to Use

  1. Write your prompt — describe scene, style, camera movement, and audio cues.
  2. Select size — choose resolution and orientation.
  3. Set duration — select 4, 8, 12, 16, or 20 seconds.
  4. Add characters (optional) — paste character IDs from Sora 2 Characters.
  5. Submit — generate, preview, and download when ready.

Pricing

Duration720p1024p1080p
4 s$1.20$2.00$2.80
8 s$2.40$4.00$5.60
12 s$3.60$6.00$8.40
16 s$4.80$8.00$11.20
20 s$6.00$10.00$14.00

Billing Rules

  • 720p rate: $0.30 per second
  • 1024p rate: $0.50 per second
  • 1080p rate: $0.70 per second
  • Duration options: 4, 8, 12, 16, or 20 seconds

Notes


Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/openai/sora-2-pro/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "size": "1280*720",
    "duration": 4
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The positive prompt for the generation.
sizestringNo1280*720720*1280, 1280*720, 1024*1792, 1792*1024, 1920*1080, 1080*1920The size of the generated media in pixels (width*height).
durationintegerNo44, 8, 12, 16, 20The duration of the generated video in seconds.
charactersarrayNo--Element reference list. To get available elements and their IDs, visit: https://wavespeed.ai/models/openai/sora-2/characters

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.