Bilibili Index

Index TTS 2

Bilibili's Index TTS 2 — voice cloning with fine-grained control over eight emotions.

Подписки нет
Кредиты никогда не истекают
Узнать больше

Pay once for credits — use them across every model on ZOOOP. · Пополняйте, когда вам нужно, без ежемесячного сжигания.

Powered by Bilibili Index's API on ZOOOP

Ключевые особенности

Voice cloning from a sample

Provide a reference audio sample and Index TTS 2 speaks your text in that cloned voice.

Eight-emotion control

Dial in happy, angry, sad, afraid, disgusted, melancholic, surprised, and calm individually — blend emotions to shape exactly how a line reads.

Fine-grained delivery

Set each emotion's strength independently for nuanced, layered expression.

Bilibili Index lineage

Built on the Bilibili Index voice model.

Случаи использования

Emotional character voice

Emotional character voice

Clone a character voice and dial in the exact emotional blend for each line.

Dramatic dialogue

Dramatic dialogue

Set anger, fear, or melancholy strengths to match a dramatic scene.

Cloned brand voice

Cloned brand voice

Reproduce a consistent voice with controllable emotional range.

Drive a talking avatar

Drive a talking avatar

Generate the cloned, emotion-controlled voice, then drive an avatar model with it.

Выберите правильную модель

Pick the right voice model. Your credits work everywhere on ZOOOP.

Voice clone with emotion controlIndex TTS 2
Multilingual voice cloningChatterbox TTS Multilingual
Cheapest voice cloneLUX TTS
Preset-voice TTSMultilingual V3
Drive a talking avatarKling Avatar V2
Sound effects and ambienceSound Effects V2

Как пользоваться

01

Open Index TTS 2 from this page or pick it in the Audio tools.

02

Upload a reference voice sample and paste your text.

03

Set the strength of each emotion to shape the delivery.

04

Generate, then download or send the audio to your canvas.

Глубокое погружение

What Index TTS 2 is good at — and what it's not

Index TTS 2 is Bilibili's voice-cloning model with a distinctive strength: fine-grained control over eight emotions. Provide a reference sample to clone a voice, then set the strength of happy, angry, sad, afraid, disgusted, melancholic, surprised, and calm — individually — to shape exactly how each line reads. Because the emotions blend, you can layer subtle combinations rather than choosing one preset feeling, which suits performed narration and dramatic dialogue.

The cloning side reproduces a specific voice from your sample, so the same character or brand voice can carry a script with a controllable emotional range. Pricing is per 1,000 characters.

Where it sits among ZOOOP's voice models: Chatterbox TTS is the voice clone built for broad multilingual coverage; LUX TTS is the cheapest clone; for preset voices use Multilingual V3. Index TTS 2's sweet spot is emotionally nuanced voice cloning.

A reasonable mental model: default to Index TTS 2 when a cloned voice needs precise emotional control, and switch to Chatterbox for many languages or LUX TTS for the lowest cost.

Часто задаваемые вопросы

What emotions can Index TTS 2 control?+

Eight, set individually: happy, angry, sad, afraid, disgusted, melancholic, surprised, and calm. Blend them to shape exactly how a line reads.

What does it need to clone a voice?+

A reference audio sample of the voice. It speaks your text in that cloned voice with your chosen emotional blend.

How is it different from Chatterbox TTS?+

Both clone voices. Index TTS 2 offers fine-grained, eight-emotion control; Chatterbox emphasizes broad multilingual coverage. Pick Index TTS 2 when emotional nuance matters most.

How is it priced?+

Per 1,000 characters of text.

Больше моделей