탐색
ElevenLabs v3
elevenlabs/v3/text-to-speech
ElevenLabs v3 Text-to-Speech
텍스트를 음성으로

ElevenLabs v3 Text-to-Speech API by ELEVENLABS

elevenlabs/v3/text-to-speech
Text-to-speech

ElevenLabs v3 Text-to-Speech model. High-quality speech generation from text prompts.

ElevenLabs V3 Text-to-Speech

ElevenLabs V3 Text-to-Speech is ElevenLabs' latest flagship speech synthesis model, delivering highly expressive, natural-sounding audio from text. With improved emotional range, multilingual fluency, and a diverse library of 21 built-in voices, V3 sets a new standard for AI voice generation.

Why Choose This?

  • Expressive voice quality V3 produces natural, human-like speech with nuanced emotion and intonation.

  • Large voice library Choose from 21 built-in voices covering a range of genders, ages, and speaking styles.

  • Flexible text normalization Control how numbers, abbreviations, and symbols are spoken with auto, on, or off modes.

  • Adjustable stability Fine-tune voice consistency versus expressiveness to match your content.

  • Long-form support Input up to 5,000 characters per request for articles, narrations, and scripts.

Parameters

ParameterRequiredDescription
textYesThe text to convert to speech. Maximum 5,000 characters
voiceNoVoice to use (default: Bella). See voice list below
stabilityNoVoice stability from 0 (expressive) to 1 (consistent), default: 0.5
apply_text_normalizationNoText normalization mode: auto (default), on, or off

Available Voices

VoiceVoiceVoiceVoiceVoice
BellaRogerSarahLauraCharlie
GeorgeCallumRiverHarryLiam
AliceMatildaWillJessicaEric
ChrisBrianDanielLilyAdam
Bill

How to Use

  1. Write your text — provide the content to be spoken, up to 5,000 characters.
  2. Select a voice — choose from the 21 built-in voices based on gender, tone, and style.
  3. Adjust stability (optional) — lower values give more expressive delivery; higher values give consistent, neutral tone.
  4. Configure text normalization (optional) — use auto for most cases, on to always expand numbers/abbreviations, off to skip normalization.
  5. Run — submit the request and retrieve the generated audio URL.

Best Use Cases

  • Content Narration — Convert articles, blogs, and documents into natural audio.
  • Audiobook Production — Generate long-form narration with consistent voice quality.
  • Voiceover & Media — Create professional voiceovers for videos, ads, and presentations.
  • Multilingual Applications — Deliver localized speech in 30+ languages.
  • Conversational AI — Power chatbot and virtual assistant voice responses.
  • Accessibility Tools — Build screen readers and assistive listening applications.

Pro Tips

  • Use Bella or Sarah for warm, professional narration; use Roger or George for authoritative or conversational tones.
  • Set stability to 0.30.4 for storytelling and emotional content; use 0.70.9 for news reading or technical narration.
  • Set apply_text_normalization to on when your text contains numbers, currencies, or abbreviations that must be spoken out in full.
  • Break very long content into chunks under 5,000 characters and stitch the audio outputs for seamless long-form production.

Pricing

Billing StandardPrice
Per 1,000 characters$0.003

Notes

  • model and text are required fields.
  • Maximum input length is 5,000 characters per request.
  • Task status values: created, processing, completed, timeout, failed.
  • Audio output URLs are returned in data.outputs once status is completed.

하나의 API로 모든 미디어 AI를.

모든 모델 탐색

Join our Discord community

Join the Discord community for the latest model updates, prompts, and support.