Hem
Utforska
ElevenLabs v3
elevenlabs/v3/text-to-speech
ElevenLabs v3 Text-to-Speech
text-till-tal

ElevenLabs v3 Text-to-Speech API by ELEVENLABS

elevenlabs/v3/text-to-speech
Text-to-speech

ElevenLabs v3 Text-to-Speech model. High-quality speech generation from text prompts.

ElevenLabs V3 Text-to-Speech

ElevenLabs V3 Text-to-Speech is ElevenLabs' latest flagship speech synthesis model, delivering highly expressive, natural-sounding audio from text. With improved emotional range, multilingual fluency, and a diverse library of 21 built-in voices, V3 sets a new standard for AI voice generation.

Why Choose This?

  • Expressive voice quality V3 produces natural, human-like speech with nuanced emotion and intonation.

  • Large voice library Choose from 21 built-in voices covering a range of genders, ages, and speaking styles.

  • Flexible text normalization Control how numbers, abbreviations, and symbols are spoken with auto, on, or off modes.

  • Adjustable stability Fine-tune voice consistency versus expressiveness to match your content.

  • Long-form support Input up to 5,000 characters per request for articles, narrations, and scripts.

Parameters

ParameterRequiredDescription
textYesThe text to convert to speech. Maximum 5,000 characters
voiceNoVoice to use (default: Bella). See voice list below
stabilityNoVoice stability from 0 (expressive) to 1 (consistent), default: 0.5
apply_text_normalizationNoText normalization mode: auto (default), on, or off

Available Voices

VoiceVoiceVoiceVoiceVoice
BellaRogerSarahLauraCharlie
GeorgeCallumRiverHarryLiam
AliceMatildaWillJessicaEric
ChrisBrianDanielLilyAdam
Bill

How to Use

  1. Write your text — provide the content to be spoken, up to 5,000 characters.
  2. Select a voice — choose from the 21 built-in voices based on gender, tone, and style.
  3. Adjust stability (optional) — lower values give more expressive delivery; higher values give consistent, neutral tone.
  4. Configure text normalization (optional) — use auto for most cases, on to always expand numbers/abbreviations, off to skip normalization.
  5. Run — submit the request and retrieve the generated audio URL.

Best Use Cases

  • Content Narration — Convert articles, blogs, and documents into natural audio.
  • Audiobook Production — Generate long-form narration with consistent voice quality.
  • Voiceover & Media — Create professional voiceovers for videos, ads, and presentations.
  • Multilingual Applications — Deliver localized speech in 30+ languages.
  • Conversational AI — Power chatbot and virtual assistant voice responses.
  • Accessibility Tools — Build screen readers and assistive listening applications.

Pro Tips

  • Use Bella or Sarah for warm, professional narration; use Roger or George for authoritative or conversational tones.
  • Set stability to 0.30.4 for storytelling and emotional content; use 0.70.9 for news reading or technical narration.
  • Set apply_text_normalization to on when your text contains numbers, currencies, or abbreviations that must be spoken out in full.
  • Break very long content into chunks under 5,000 characters and stitch the audio outputs for seamless long-form production.

Pricing

Billing StandardPrice
Per 1,000 characters$0.003

Notes

  • model and text are required fields.
  • Maximum input length is 5,000 characters per request.
  • Task status values: created, processing, completed, timeout, failed.
  • Audio output URLs are returned in data.outputs once status is completed.

Ett API för all media-AI.

Utforska alla modeller

Join our Discord community

Join the Discord community for the latest model updates, prompts, and support.