Enjoy 50% OFF Vidu Q3 & Q3 Pro models • Only on WaveSpeedAI | May 20 – Jun 2

Depth Anything Video

wavespeed-ai /

Depth Anything Video estimates depth maps from video input with temporal consistency. Supports multiple model sizes and colormaps. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

video-to-video
Input

Drag & drop or click to upload

Idle

$0.04per run·~25 / $1

Next:

ExamplesView all

Related Models

README

Wavespeed Depth Anything Video

Wavespeed Depth Anything Video (VDA) is a specialized model designed to estimate dense, pixel-wise depth from monocular video. By transforming standard 2D footage into a grayscale depth map, it provides essential spatial data for 3D reconstruction, augmented reality, and professional visual effects.

Why Choose This?

  • Temporal Consistency Engineered to maintain depth stability across frames, preventing the "flickering" effect common in frame-by-frame processing.
  • Scale Flexibility Offers three distinct model sizes to balance between real-time processing speed and high-fidelity depth precision.
  • Fine-Grained Detail Excellent at capturing thin structures and complex silhouettes, such as foliage or distant architectural elements.
  • Zero-Shot Generalization Performs reliably across diverse environments, from indoor studios to vast outdoor landscapes, without needing scene-specific tuning.

Parameters

ParameterRequiredDescription
video*YesThe input video file to process (Drag and drop a file or click to upload).
modelNoSelection of model scale: VDA-Small, VDA-Base, or VDA-Large (Default).

How to Use

  1. Upload your video — Drag and drop your source file into the upload box or provide a direct media link.
  2. Select the model
  • VDA-Small: Fastest inference, best for mobile or quick previews.
  • VDA-Base: Standard balance of speed and accuracy.
  • VDA-Large: Maximum precision for professional VFX and 3D mapping.
  1. Run — Submit the task to generate and download your depth-encoded video.

Model Comparison

VersionUse CasePerformance
VDA-SmallReal-time applications and low-latency feedback.Optimized Speed
VDA-BaseGeneral creative projects and social media content.Balanced
VDA-LargeHigh-end cinematography and 3D environment scanning.Best Quality

Best Use Cases

  • Cinematography & VFX — Create realistic depth-of-field, fog, and volumetric lighting effects in post-production.
  • 3D Scene Reconstruction — Extract spatial data to build point clouds or 3D meshes from 2D video.
  • AR Occlusion — Enable virtual objects to realistically pass behind physical objects in a video scene.
  • Motion Graphics — Use depth data as a displacement map for unique visual transitions.

Pro Tips

  • Check the Histogram: In the output, pure white represents the closest objects to the lens, while black represents the furthest distance.
  • VDA-Large for Detail: Use the VDA-Large model if your video contains intricate foreground elements like hair or thin wires.
  • Consistency: Ensure your video has steady lighting for the most accurate depth estimation results.
Accessibility:This website uses AI models provided by third parties.

Depth Anything Video API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/depth-anything/video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Depth Anything Video below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/depth-anything/video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "video": "https://example.com/your-input.mp4",
    "model": "VDA-Large"
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/depth-anything/video", {
        "video": "https://example.com/your-input.mp4",
        "model": "VDA-Large"
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/depth-anything/video",
    {
    "video": "https://example.com/your-input.mp4",
    "model": "VDA-Large"
}
)

print(output["outputs"][0])  # → URL of the generated output

Depth Anything Video API — Frequently asked questions

What is the Depth Anything Video API?

Depth Anything Video is a WaveSpeedAI model for video editing, exposed as a REST API on WaveSpeedAI. Depth Anything Video estimates depth maps from video input with temporal consistency. Supports multiple model sizes and colormaps. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Depth Anything Video API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/depth-anything-video.

How much does Depth Anything Video cost per run?

Depth Anything Video starts at $0.040 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Depth Anything Video accept?

Key inputs: `video`, `model`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/depth-anything-video.

How do I get started with the Depth Anything Video API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Depth Anything Video outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.