Bytedance Seedance 2.0 Text To Video

Playground

Seedance 2.0 (Text-to-Video) generates Hollywood-grade cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed’s unified multimodal architecture, it leads on instruction adherence, motion quality, and visual aesthetics.

Features

Seedance 2.0 Text-to-Video

Seedance 2.0 is Seed’s latest video generation model, built on a unified multimodal architecture that accepts text, image, audio, and video inputs. The Text-to-Video mode generates production-grade cinematic videos from text prompts alone — with native audio, director-level control, and exceptional motion stability.

Key Features

Unified multimodal architecture A single model that handles text, image, audio, and video inputs for comprehensive creative flexibility.
Native audio-visual synchronization Generates video with synchronized audio in a single pass — no separate audio generation needed.
Director-level control Granular control over camera movement, lighting, shadows, and character performance through natural language prompts.
Production-grade cinematic quality Hollywood-grade visual fidelity with dramatic lighting, professional color grading, and smooth natural motion.
Exceptional motion stability Industry-leading motion coherence with stable subjects, consistent physics, and fluid transitions.
Strong instruction adherence Accurately follows detailed scene descriptions, shot compositions, and creative direction.

Parameters

Parameter	Required	Description
prompt	Yes	Detailed description of the cinematic scene
aspect_ratio	No	Output format: 16:9 (default), 9:16, 4:3, 3:4, 1:1, 21:9
duration	No	Video length in seconds: 4-15 (default: 5)
resolution	No	Output resolution: 480p, 720p (default), or 1080p
reference_images	No	Reference image URLs to guide style, characters, or composition
reference_videos	No	Reference video URLs (total length must not exceed 15 seconds)
reference_audios	No	Reference audio URLs (total length must not exceed 15 seconds)

How to Use

Write your prompt — describe the scene with cinematic detail: lighting, mood, camera movement, action, and style.
Select aspect ratio — 16:9 for widescreen, 9:16 for vertical, 4:3 or 3:4 for classic formats.
Set duration — choose any duration from 4 to 15 seconds.
Optionally add references — provide reference images, videos, or audios for style guidance.
Run — submit and download your cinematic video with synchronized audio.

Pricing

Without Reference Videos

Billed per second of output duration, anchored at $0.60 per 5 seconds at 480p.

Resolution	Duration	Cost
480p	5 s	$0.60
480p	10 s	$1.20
480p	15 s	$1.80
720p	5 s	$1.20
720p	10 s	$2.40
720p	15 s	$3.60
1080p	5 s	$3.00
1080p	10 s	$6.00
1080p	15 s	$9.00

With Reference Videos

When reference_videos are provided, billing follows the same scheme as Seedance 2.0 Video-Edit: billed per second across input duration + output duration, where input duration is the total length of the supplied reference videos clamped to the 2-15 s range.

Resolution	Per second
480p	$0.075
720p	$0.15
1080p	$0.375

Examples (reference videos totaling 5 s, output 5 s = 10 billed seconds):

Resolution	Cost
480p	$0.75
720p	$1.50
1080p	$3.75

Billing Rules

Without reference videos: $0.60 per 5 seconds at 480p, scaled by resolution; prorated per second.
With reference videos: per-second billing matching Seedance 2.0 Video-Edit, using the total reference-video duration as input (clamped 2-15 s) plus the output duration.
720p: 2x the 480p price.
1080p: 5x the 480p price (2.5x the 720p price).
Duration range: 4-15 seconds (continuous).

Best Use Cases

Film & Production — Generate cinematic footage for professional video projects.
Commercials & Ads — Create high-end promotional content with Hollywood aesthetics.
Music Videos — Produce visually stunning sequences with native audio sync.
Social Media Premium — Stand out with film-quality short-form content.
Concept Visualization — Pitch film and TV concepts with production-quality previews.

Pro Tips

Write prompts like a film director — include lighting (e.g., “dramatic rim lighting”), camera angles, and mood.
Use 16:9 for cinematic widescreen; 9:16 for premium vertical content.
Include specific visual details for best results (e.g., “golden hour sunlight casting long shadows”).
Describe character expressions and actions for more engaging scenes.
Start with a short duration (4-5s) to iterate on the look, then extend up to 15s.

Notes

Native audio generation is included — videos come with synchronized sound.
Duration range: 4-15 seconds (continuous).
Built on the same architecture as Seedance 2.0 Image-to-Video.

Seedance 2.0 Image-to-Video — Generate video from reference images + prompt.
Seedance 2.0 Fast Text-to-Video — Faster generation at lower cost.
Seedance 2.0 Fast Image-to-Video — Fast image-guided video generation.
Seedance V1.5 Pro Text-to-Video — Previous generation Seedance model.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/bytedance/seedance-2.0/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "16:9",
    "resolution": "720p",
    "duration": 5,
    "enable_web_search": false,
    "generate_audio": true
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	Describe the scene, action, camera movement, and mood for the video.
reference_images	array	No	-	-	Reference image URLs to guide visual style, characters, or scene composition.
reference_videos	array	No	-	-	Reference video URLs (total length must not exceed 15 seconds).
reference_audios	array	No	-	-	Reference audio URLs (total length must not exceed 15 seconds).
aspect_ratio	string	No	16:9	16:9, 9:16, 4:3, 3:4, 1:1, 21:9	The aspect ratio of the generated video.
resolution	string	No	720p	480p, 720p, 1080p	The output video resolution.
duration	integer	No	5	4 ~ 15	The duration of the generated video in seconds (4-15s).
enable_web_search	boolean	No	false	-	Enable web search for real-time information.
generate_audio	boolean	No	true	-	Whether to generate native audio synchronized with the output video. Defaults to true.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	object	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Bytedance Seedance 2.0 Mini Video Extend Bytedance Seedance 2.0 Text To Video Turbo

Bytedance Seedance 2.0 Text To Video

Playground

Features

Seedance 2.0 Text-to-Video

Key Features

Parameters

How to Use

Pricing

Without Reference Videos

With Reference Videos

Billing Rules

Best Use Cases

Pro Tips

Notes

Related Models

Authentication

API Endpoints

Submit Task & Query Result

Parameters

Task Submission Parameters

Request Parameters

Response Parameters

Result Request Parameters

Result Response Parameters