โ† Back to topics
3 model-release Google DeepMind confirmed 1 article

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Google DeepMind releases Gemini 3.1 Flash TTS with granular audio tags for expressive speech generation.

Gemini 3.1 Flash TTS: the next generation of expressive AI speech
via Google DeepMind

๐Ÿ” Let's dive in

Google DeepMind announced Gemini 3.1 Flash TTS, a new audio model featuring granular audio tags for precise control over AI-generated speech. The tags enable users to direct expressive audio generation with fine-grained control. The model represents the next generation in Google's text-to-speech capabilities.

Lead coverage: Google DeepMind โ€” Gemini 3.1 Flash TTS: the next generation of expressive AI speech โ†—

๐Ÿ•ฐ The timeline ยท 1 source

Google DeepMind first-party ยท 6d ago ยท 3/5

Gemini 3.1 Flash TTS: the next generation of expressive AI speech โ†—

Google DeepMind announced Gemini 3.1 Flash TTS, a new audio model featuring granular audio tags for precise control over AI-generated speech. The tags enable users to direct expressive audio generation with fine-grained control. The model represents the next generation in Google's text-to-speech capabilities.

All audio generated by Gemini 3.1 Flash TTS is watermarked with SynthID. This imperceptible watermark is interwoven directly into the audio output, allowing the reliable detection of AI-generated content to help prevent misinformation.
โ€” Google DeepMind
By embedding natural language commands directly into the text input, you can steer AI-speech output with improved levels of granularity.
โ€” Google DeepMind

๐Ÿท Tags

Gemini

๐Ÿ”ง Debug

Cluster ID
4edf732984
Importance (max)
3
Members
1
Sources
Google DeepMind
Earliest
2026-04-15T16:03:19.000Z
Latest
2026-04-15T16:03:19.000Z
Lead URL
https://deepmind.google/blog/gemini-3-1-flash-tts-the-next-generation-of-expressive-ai-speech