Kling AI

Kling Lipsync

Kling's lip-sync model — re-sync an existing video clip to new audio, about 2 credits per second.

ไม่มีการสมัครสมาชิก
เครดิตไม่มีวันหมดอายุ
เรียนรู้เพิ่มเติม

จ่ายหนึ่งครั้งสำหรับเครดิต - ใช้พวกเขาในทุกรุ่นใน ZOOOP · เติมเงินเมื่อคุณต้องการไม่มีการเผาไหม้รายเดือน

Powered by Kling AI's API on ZOOOP

คุณสมบัติที่สำคัญ

Re-sync existing video

Provide a video clip and a new audio track, and Kling Lipsync re-aligns the subject's mouth to the new audio — dubbing, re-voicing, and language swaps on footage you already have.

Low cost per second

About 2 credits per second — cheap enough to re-voice clips at volume.

Flexible audio length

Drive the lip-sync with an audio track from 2 to 60 seconds.

Works on real footage

Built for existing video — talking-head clips, recorded performances, and previously generated videos.

ใช้เคส

Dubbing and localization

Dubbing and localization

Swap the audio to a different language and re-sync the mouth — localize a talking-head video without re-shooting.

Re-voicing

Re-voicing

Replace the dialogue on an existing clip with a new take or a cleaner recording, lips re-aligned.

เลือกรุ่นที่เหมาะสม

Pick the right tool. Your credits work everywhere on ZOOOP.

Re-sync an existing videoKling Lipsync
Talking avatar from an imageKling Avatar V2
Lip-sync alternativePixverse Lipsync
New voice track to sync toMultilingual V3
Synced-audio text-to-videoKling O3

วิธีใช้

01

Open Kling Lipsync from this page or pick it in the Video tools.

02

Upload the video clip (2–10s) and the new audio track (2–60s).

03

Confirm the inputs.

04

Generate, then download or send the clip to your canvas.

ดำน้ำลึก

What Kling Lipsync is good at — and what it's not

Kling Lipsync is the re-sync tool: give it a video clip and a new audio track, and it re-aligns the subject's mouth to the new audio. The starting point is footage you already have — a talking-head clip, a recorded performance, or a video you generated elsewhere on ZOOOP — which makes it the model for dubbing, re-voicing, language swaps, and fixing audio that drifted out of sync.

The economics are a real draw: at about 2 credits per second, it's among the cheapest lip-sync options, so producing several localized language cuts of the same clip is cheap rather than precious. The driving audio can run from 2 to 60 seconds, against a source video clip of 2 to 10 seconds.

The natural pairing is with a TTS model: generate the new voice (in any supported language) with Multilingual V3 or another voice model, then re-sync your clip to it — a complete localized version with no re-shoot.

Where it's the wrong tool: if you're starting from a single still image rather than video, you want Kling Avatar V2, which generates a talking performance from one image. Pixverse Lipsync is another lip-sync option. Kling Lipsync's lane is re-syncing existing video footage.

A reasonable mental model: default to Kling Lipsync when you have a video clip and want its mouth matched to new audio. To start from a still image instead, use Kling Avatar V2.

คำถามที่พบบ่อย

What does Kling Lipsync do?+

It re-syncs the mouth of an existing video clip to a new audio track — for dubbing, re-voicing, language swaps, or fixing sync drift on footage you already have.

What are the input limits?+

A video clip from 2 to 10 seconds and an audio track from 2 to 60 seconds.

How is Kling Lipsync different from Kling Avatar V2?+

Kling Lipsync re-syncs an existing video clip to new audio. Kling Avatar V2 generates a talking video from a single still image plus audio. Pick Lipsync when you already have footage.

How much does it cost?+

About 2 credits per second — among the cheapest lip-sync options, well-suited to volume re-voicing.

รุ่นเพิ่มเติม