Programmable Voice | Mar. 20, 2025
New Generative Voices with latest Amazon and Google technology available for <Say> in Public Beta
The Generative voices are powered by the latest technology and innovation in synthesized speech to offer the most human-like, emotionally engaged and adaptive context-aware voices by "interpreting" the text-input and adjust speech accordingly (e.g. render context-dependent prosody, tone, emotion, pausing, spelling, dialectal properties, foreign word pronunciation, etc). These synthetic voices are remarkably similar to a human voice, and make them the optimal option for Conversational AI applications and Virtual Agents.
This release includes a total of 260 new voices, 20 from Amazon Polly Generative and 240 from Google’s Chirp3-HD, across different languages and locales.
Google’s Chirp3-HD voices and Amazon Polly Generative voices are the first voices available in a new tier of text-to-speech pricing titled Generative voices which are priced at $0.013 per 100 characters.
These new voices are initially available for <Say> only. Support for Text-to-Speech Settings in Twilio Console, and Studio <Say> Widget will follow.
For more information on the voices and on pricing, please visit Twilio Text-to-Speech docs.