
text-to-video
Idle
Your request will cost $0.6 per run.
For $10 you can run this model approximately 16 times.
One more thing:
Seedance 2.0 Fast is the speed-optimized version of ByteDance Seed's latest video generation model. The Text-to-Video mode generates cinematic videos from text prompts with native audio synchronization — faster and at 33% lower cost than the standard version, ideal for rapid iteration and high-volume production.
Speed-optimized generation Faster processing for quick turnaround on video projects, perfect for iteration and prototyping.
33% lower cost $0.80 per 5 seconds vs $1.20 for the standard version — ideal for high-volume production.
Unified multimodal architecture Same Seedance 2.0 foundation handling text, image, audio, and video inputs.
Native audio-visual synchronization Generates video with synchronized audio in a single pass.
Director-level control Camera movement, lighting, shadows, and character performance controlled through prompts.
Strong motion stability Coherent motion with stable subjects and fluid transitions.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Detailed description of the cinematic scene |
| aspect_ratio | No | Output format: 16:9 (default), 9:16, 4:3, 3:4, 1:1, 21:9 |
| duration | No | Video length: 5 (default), 10, or 15 seconds |
| resolution | No | Output resolution: 480p, 720p (default), or 1080p |
| reference_images | No | Reference image URLs to guide style, characters, or composition |
| reference_videos | No | Reference video URLs (total length must not exceed 15 seconds) |
| Resolution | Duration | Without Reference Videos | With Reference Videos |
|---|---|---|---|
| 480p | 5 s | $0.60 | $1.20 |
| 480p | 10 s | $1.20 | $2.40 |
| 480p | 15 s | $1.80 | $3.60 |
| 720p | 5 s | $1.20 | $2.40 |
| 720p | 10 s | $2.40 | $4.80 |
| 720p | 15 s | $3.60 | $7.20 |
| 1080p | 5 s | $1.80 | $3.60 |
| 1080p | 10 s | $3.60 | $7.20 |
| 1080p | 15 s | $5.40 | $10.80 |