VEED Fabric 1.0
An image-to-video API that brings a single still image to life as a dynamic, talking video. Ideal for video storytelling, personalized messages, digital avatars, and automated content pipelines.
Why it looks good
- One image + Audio → talking video: Generate lip-synced, expressive clips from a single portrait or character image.
- Natural lip-sync & expressions: Stable mouth–audio alignment and smooth facial transitions with minimal jitter.
- Short creation pipeline: Image + voice → finished video, optimized for batch runs and automation.
- Multi-scenario ready: Explainers, greetings, brand characters, course intros, and support avatars.
Pricing
| Resolution | Price per 5 seconds | Example (10s) | Example (15s) |
|---|
| 480p | $0.35 | $0.70 | $1.05 |
| 720p | $0.70 | $1.40 | $2.10 |
Our endpoint starts at $0.35 per 5 seconds (480p) or $0.70 per 5 seconds (720p) for video generation.
How to use
-
Upload audio
- Add a voice track URL or drag-and-drop a file into the audio field.
- Use clean, paced speech; denoise/EQ if possible.
-
Upload image
- Paste an image URL or drag-and-drop a portrait into the image field.
- Prefer a clear front or body view with even lighting.
-
Select resolution
- Choose 480p for lightweight clips or 720p for sharper output.
-
Run
- Click Run to start the job. You’ll receive a job ID; the UI will show progress.
-
Retrieve & iterate
- Download the result when complete.
- Swap audio or image, or adjust resolution to iterate quickly.
Common use cases
- Digital avatars
- Personalized greetings
- Education snippets
- Social/marketing upgrades (poster → talking video)
- Customer service presenters