Programmable Voice | Mar. 20, 2025

New Generative Voices with latest Amazon and Google technology available for <Say> in Public Beta

Twilio has updated its Text-To-Speech offering adding support for Google’s Chirp3-HD voices and Amazon Polly Generative voices, available now in Public Beta.

The Generative voices are powered by the latest technology and innovation in synthesized speech to offer the most human-like, emotionally engaged and adaptive context-aware voices by "interpreting" the text-input and adjust speech accordingly (e.g. render context-dependent prosody, tone, emotion, pausing, spelling, dialectal properties, foreign word pronunciation, etc). These synthetic voices are remarkably similar to a human voice, and make them the optimal option for Conversational AI applications and Virtual Agents.

This release includes a total of 260 new voices, 20 from Amazon Polly Generative and 240 from Google’s Chirp3-HD, across different languages and locales.

Google’s Chirp3-HD voices and Amazon Polly Generative voices are the first voices available in a new tier of text-to-speech pricing titled Generative voices which are priced at $0.013 per 100 characters.

These new voices are initially available for <Say> only. Support for Text-to-Speech Settings in Twilio Console, and Studio <Say> Widget will follow.

For more information on the voices and on pricing, please visit Twilio Text-to-Speech docs.

Voice Beta

Additional Resources

Blog

Read more about our latest product updates, product tutorials, and community projects.


Docs

See API reference documentation, quickstarts, SDKs, and multi-language code samples.

Events

Find upcoming events and join us virtually or in person to learn more about our products.