What does Kling Lipsync do?

It re-syncs the mouth of an existing video clip to a new audio track — for dubbing, re-voicing, language swaps, or fixing sync drift on footage you already have.

What are the input limits?

A video clip from 2 to 10 seconds and an audio track from 2 to 60 seconds.

How is Kling Lipsync different from Kling Avatar V2?

Kling Lipsync re-syncs an existing video clip to new audio. Kling Avatar V2 generates a talking video from a single still image plus audio. Pick Lipsync when you already have footage.

How much does it cost?

About 2 credits per second — among the cheapest lip-sync options, well-suited to volume re-voicing.

Kling Lipsync on ZOOOP — Re-Sync Video to New Audio

Kling Lipsync

Kling's lip-sync model — re-sync an existing video clip to new audio, about 2 credits per second.

ไม่มีการสมัครสมาชิก

เครดิตไม่มีวันหมดอายุ

เรียนรู้เพิ่มเติม

จ่ายหนึ่งครั้งสำหรับเครดิต - ใช้พวกเขาในทุกรุ่นใน ZOOOP · เติมเงินเมื่อคุณต้องการไม่มีการเผาไหม้รายเดือน

What Kling Lipsync is good at — and what it's not

Kling Lipsync is the re-sync tool: give it a video clip and a new audio track, and it re-aligns the subject's mouth to the new audio. The starting point is footage you already have — a talking-head clip, a recorded performance, or a video you generated elsewhere on ZOOOP — which makes it the model for dubbing, re-voicing, language swaps, and fixing audio that drifted out of sync.

The economics are a real draw: at about 2 credits per second, it's among the cheapest lip-sync options, so producing several localized language cuts of the same clip is cheap rather than precious. The driving audio can run from 2 to 60 seconds, against a source video clip of 2 to 10 seconds.

The natural pairing is with a TTS model: generate the new voice (in any supported language) with Multilingual V3 or another voice model, then re-sync your clip to it — a complete localized version with no re-shoot.

Where it's the wrong tool: if you're starting from a single still image rather than video, you want Kling Avatar V2, which generates a talking performance from one image. Pixverse Lipsync is another lip-sync option. Kling Lipsync's lane is re-syncing existing video footage.

A reasonable mental model: default to Kling Lipsync when you have a video clip and want its mouth matched to new audio. To start from a still image instead, use Kling Avatar V2.

Kling Lipsync

Kling Lipsync

คุณสมบัติที่สำคัญ

Re-sync existing video

Low cost per second

Flexible audio length

Works on real footage

ใช้เคส

Dubbing and localization

Re-voicing

เลือกรุ่นที่เหมาะสม

วิธีใช้

ดำน้ำลึก

What Kling Lipsync is good at — and what it's not

คำถามที่พบบ่อย

รุ่นเพิ่มเติม

Kling Lipsync