Dialogue, native
Speech in English, Chinese, Japanese, Korean, and Spanish — English accents and Chinese dialects included — and each character can speak a different language, in the order you direct.
Kling generates up to fifteen seconds of multi-shot cinema with the dialogue already in it — five languages, per-character voices, camera moves read straight from the prompt. In MML ONE it sits one click from your storyboard: nine tiers, routed shot by shot.
By Kling AI (Kuaishou)
Five things Kling does that most video models don't — each from the vendor's own releases, not our imagination.
Speech in English, Chinese, Japanese, Korean, and Spanish — English accents and Chinese dialects included — and each character can speak a different language, in the order you direct.
One generation carries shot-reverse-shot, cross-cutting, and voice-over — multi-shot storytelling understood straight from the prompt.
3.0 Omni lifts a character's look and voice from a single reference video, then re-stages them scene to scene — per-shot duration, framing, angle, camera move.
O1 generates and edits in one model: swap the subject, change the weather or the style on existing footage. No masks, no keyframes, up to seven mixed inputs.
On-screen text and logos stay legible through generation — storefronts, labels, and branded wardrobe keep their lettering. Product films care about this.
Exactly what our catalog serves today — vendor names, newest first.
Tier lineup as served in MML ONE on the day this page was written. Kling ships fast — the catalog inside the app is the live truth.

Takes land in the storyboard as versioned assets. Dialogue-heavy scenes and multi-shot sequences are the shots you route to Kling.
The newest tiers gate behind premium access first, and the fast tier trades inputs for speed — Kling 3.0 Turbo takes a first frame only, no references.
Audio arrived at 2.6. Everything below it is a silent model, and even 2.6 goes silent in first/last-frame mode.
Character-from-video work lives on O1 and 3.0 Omni only; every other tier works from image references and first/last frames.
Start with a premise, a screenplay, or a folder of references. We'll set up your provider keys and walk through the first scene with you.
Cookie settings
We use analytics cookies to improve MML ONE. You can decline anytime. Privacy