
Reference-rich generation
Combine image, video, and audio references to anchor a shot to your assets — the most input-flexible Seedance tier.
The faster Seedance V2.0 — top-tier motion with multi-modal inputs and native audio, up to 720p.
ক্রেডিটগুলির জন্য একবার অর্থ প্রদান করুন - প্রতিটি মডেল জুড়ে ZOOOP ব্যবহার করুন। · আপনি যখন প্রয়োজন শীর্ষ আপ, কোন মাসিক পোড়া।
Powered by ByteDance's API on ZOOOP
The Seedance V2.0 generation's motion quality in a faster tier — built for quicker turnaround on the flagship's look.
Bring up to 9 reference images, 3 reference videos, and 3 reference audio tracks into a single generation — anchor look, motion, and sound.
Audio is generated with the video and on by default.
Clips from 4 to 15 seconds, with an auto aspect-ratio option alongside six fixed ratios.

Combine image, video, and audio references to anchor a shot to your assets — the most input-flexible Seedance tier.

Get the V2.0 generation's motion quality with quicker turnaround for iteration.

Native audio means clips arrive with scene sound, no separate audio pass.

Up to 15 seconds captures a complete action or beat in one generation.

Use reference videos for motion and reference images for look in the same generation.

The auto aspect-ratio option lets the model frame the shot, alongside six fixed ratios.
Pick the right video model. Your credits work everywhere on ZOOOP.
Open Seedance V2.0 Fast from this page or pick it in the Video Generator.
Write the prompt and add any image, video, or audio references.
Pick aspect ratio (or auto), resolution (up to 720p), and duration (4–15s); keep audio on.
Generate, then download or send the clip to your canvas.
Seedance V2.0 Fast is the faster tier of the V2.0 generation — the flagship's motion register with quicker turnaround. Its standout is input flexibility: a single generation can take up to 9 reference images, 3 reference videos, and 3 reference audio tracks alongside the prompt, so you can anchor look with images, motion with reference video, and sound with reference audio in one pass. It's the most input-rich tier in the Seedance line.
Generations run 4 to 15 seconds with native audio on by default, and an auto aspect-ratio option sits alongside six fixed ratios so the model can frame the shot when you don't want to. Output tops out at 720p on this tier.
Where it sits: the standard Seedance V2.0 targets the highest fidelity of the generation; V1.5 Pro and V1.0 Pro are the V1-tier options. For longer synced-audio takes from another line, Kling O3 reaches 15 seconds too. Seedance V2.0 Fast's sweet spot is fast, reference-rich V2 generation.
A reasonable mental model: default to Seedance V2.0 Fast when you want the V2 look quickly and have image, video, or audio references to anchor the shot, and step up to the standard V2.0 for peak fidelity.
V2.0 Fast is the faster tier of the V2.0 generation, with multi-modal inputs (image, video, audio references) and output up to 720p. The standard Seedance V2.0 targets higher fidelity. Pick Fast for quicker turnaround and reference-rich input.
Up to 9 reference images, 3 reference videos, and 3 reference audio tracks, plus a text prompt — the most input-flexible Seedance tier.
From 4 to 15 seconds.
Yes — native audio is generated with the video and on by default.
Images
Videos
Audios
Prompt*
দিক অনুপাত*
প্রস্তাব*
সময়কাল*