2026-06-19

ElevenLabs Launches Dubbing v2: AI Translation Across 90+ Languages That Keeps the Original Voice

Summary: ElevenLabs shipped Dubbing v2 on May 28, 2026 — a model that skips the traditional ASR→translate→TTS pipeline and instead directly conditions on the original audio performance to produce translated speech that sounds like the same speaker.

Key Facts

Novel approach: Conditions directly on source audio (not a text transcript), capturing emotion, intonation, pacing, and pauses before generating the translated output
Language coverage: 90+ languages and accents with automatic voice cloning that replicates speaker pitch, tone, and timbre
Sync: Generated speech automatically aligns to the original video or audio timing
Availability: GA as of May 28, 2026 via ElevenCreative and ElevenProductions; free tier includes up to 30 minutes on Creator+ plans

Why It Matters

Content localization has long struggled with the tradeoff between translation accuracy and preserving speaker identity. Dubbing v2 collapses both problems into a single model, making it feasible for individual creators, podcasters, and enterprise L&D teams to publish in 90+ languages at near-zero marginal cost — a significant shift for global content distribution economics.

Introducing Dubbing v2 — ElevenLabs Official Blog
ElevenLabs Dubbing v2 Launches With 90+ Languages — aiHola

ElevenLabs Launches Dubbing v2: AI Translation Across 90+ Languages That Keeps the Original Voice

Key Facts

Why It Matters

Read More

매주 핵심 AI 소식, 한 번에 받기