Vidu Q3 Start End Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.35per run·~28 / $10
A rugged, mid-30s man with short dark hair and a weathered face rides a vintage black motorcycle at high speed through a winding mountain road at dusk. He wears a leather jacket with visible wear, gloves, and goggles pushed up on his forehead, eyes focused ahead with determination. The motorcycle kicks up dust and gravel as it curves sharply, tires screaming against the asphalt. Behind him, the sky glows with orange and purple hues, trees blur past, and distant mountains loom in shadow. Shot in cinematic action style, wide-angle, dynamic motion blur, realistic lighting, high detail, 4K resolution.
Vidu Q3 Start-End-to-Video generates videos with precise control over both the first and last frames. Provide a start image, an end image, and a text prompt — the model creates a smooth, coherent video transition between the two states. Supports multiple resolutions, motion control, and optional audio generation with background music.
Start and end frame control Define both the beginning and ending visuals for precise, predictable video transitions.
Smooth interpolation AI-powered motion generates natural, fluid movement between your two reference frames.
Multiple resolutions Choose from 540p, 720p, or 1080p to balance quality and cost.
Motion control Adjust movement_amplitude to control the intensity of motion in the transition.
Audio generation Optional synchronized audio and background music for complete video content.
Prompt Enhancer Built-in tool to automatically improve your scene descriptions.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the desired motion and action |
| image | Yes | Start frame image (URL or upload) |
| last_image | Yes | End frame image (URL or upload) |
| duration | No | Video length in seconds (default: 5) |
| resolution | No | Output resolution: 540p, 720p, or 1080p (default: 720p) |
| bgm | No | Include background music (default: enabled) |
| generate_audio | No | Whether to generate synchronized audio (default: enabled) |
| movement_amplitude | No | Motion intensity: auto or manual value (default: auto) |
| seed | No | Random seed for reproducible results (default: -1) |
| Resolution | Cost per Second |
|---|---|
| 540p | $0.07 |
| 720p | $0.15 |
| 1080p | $0.16 |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/vidu/q3/start-end-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Q3 Start End To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/vidu/q3/start-end-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"duration": 5,
"resolution": "720p",
"bgm": true,
"generate_audio": true,
"movement_amplitude": "auto",
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("vidu/q3/start-end-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"duration": 5,
"resolution": "720p",
"bgm": true,
"generate_audio": true,
"movement_amplitude": "auto",
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"vidu/q3/start-end-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"duration": 5,
"resolution": "720p",
"bgm": true,
"generate_audio": true,
"movement_amplitude": "auto",
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputQ3 Start End To Video is a Vidu model for video generation from images, exposed as a REST API on WaveSpeedAI. Vidu Q3 Start End Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/vidu/vidu-q3-start-end-to-video.
Q3 Start End To Video starts at $0.35 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `resolution`, `duration`, `seed`, `bgm`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/vidu/vidu-q3-start-end-to-video.
Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.
Commercial usage rights depend on the model's license, set by its provider (Vidu). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.