Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only
Home/Explore/Kling O3 Models/kwaivgi/kling-video-o3-std/text-to-video

Kling Omni Video O3 Standard Text-To-Video

kwaivgi/kling-video-o3-std/text-to-video

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Supports audio generation. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

text-to-video
Input
Whether to generate audio for the video.

Idle

Your request will cost $0.42 per run.

For $10 you can run this model approximately 23 times.

One more thing:

ExamplesView all

README

Kling Video O3 Standard Text-to-Video

Kling Video O3 Standard is Kuaishou's advanced text-to-video model in the O3 family, delivering high-quality cinematic video from text descriptions. With optional synchronized sound generation, multiple aspect ratios, and flexible duration, it offers a strong balance of quality and cost.

Why Choose This?

  • O3-level quality Advanced visual fidelity and motion realism beyond V3.0 models.

  • Sound generation Optional synchronized sound effects generated alongside the video.

  • Flexible duration Generate videos from 3 to 15 seconds to match your scene needs.

  • Multiple aspect ratios Support for 16:9, 9:16, and 1:1 to fit any platform.

  • Multi-prompt support Chain multiple prompt segments to guide scene transitions and narrative flow within a single generation.

  • Prompt Enhancer Built-in tool to automatically improve your video descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the video scene, motion, and style.
aspect_ratioNoOutput ratio: 16:9 (default), 9:16, 1:1.
durationNoVideo length in seconds. Range: 3–15. Default: 5.
soundNoGenerate synchronized sound alongside the video. Default: disabled.
shot_typeNoEditing mode: intelligent (default, auto-determines scope) or customize.
multi_promptNoAdditional prompt segments to guide scene transitions and progressions.

How to Use

  1. Write your prompt — describe the scene, characters, motion, camera style, and atmosphere in detail. Use the Prompt Enhancer for better results.
  2. Select aspect ratio — 16:9 for landscape, 9:16 for portrait/social, 1:1 for square.
  3. Set duration — choose between 3 and 15 seconds based on your scene length.
  4. Enable sound (optional) — generate synchronized audio alongside the video.
  5. Select shot_type (optional) — use intelligent for automatic scope, or customize for manual control.
  6. Add multi-prompt segments (optional) — click Add Item to guide scene transitions with additional prompts.
  7. Run — submit and download your video.

Pricing

DurationWithout SoundWith Sound
3s$0.252$0.336
5s$0.420$0.560
10s$0.840$1.120
15s$1.260$1.680

Billing Rules

  • Base rate: $0.42 per 5 seconds ($0.084 per second)
  • Sound surcharge: +33% when sound is enabled
  • Duration range: 3–15 seconds

Best Use Cases

  • Professional Content — High-quality videos at a more accessible price than O3 Pro.
  • Social Media — Create engaging videos for TikTok, Reels, and Stories.
  • Marketing Videos — Produce promotional content with optional synchronized sound.
  • Concept Visualization — Bring creative ideas to life quickly from text descriptions.
  • Extended Scenes — Up to 15 seconds for longer scene development and storytelling.

Pro Tips

  • Use the Prompt Enhancer to refine your descriptions automatically.
  • Match aspect ratio to your platform: 16:9 for YouTube, 9:16 for TikTok and Reels, 1:1 for Instagram.
  • Enable sound for a complete video experience with synchronized audio.
  • Be specific about camera movements, lighting, and atmosphere for best results.
  • Use shorter durations (3–5s) for testing, longer (10–15s) for final production.
  • Use multi_prompt to build smooth narrative progressions across a single clip.

Notes

  • Only prompt is required; all other parameters have defaults.
  • Sound generation increases cost by approximately 33%.
  • Please follow Kuaishou's content usage policies when crafting prompts.

Related Models