3 model-release Google DeepMind confirmed 1 article

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Google DeepMind releases Gemini 3.1 Flash TTS with granular audio tags for expressive speech generation.

🔍 Let's dive in

Google DeepMind announced Gemini 3.1 Flash TTS, a new audio model featuring granular audio tags for precise control over AI-generated speech. The tags enable users to direct expressive audio generation with fine-grained control. The model represents the next generation in Google's text-to-speech capabilities.

Lead coverage: Google DeepMind — Gemini 3.1 Flash TTS: the next generation of expressive AI speech ↗

🕰 The timeline · 1 source

Google DeepMind first-party · 6d ago · 3/5

Gemini 3.1 Flash TTS: the next generation of expressive AI speech ↗

All audio generated by Gemini 3.1 Flash TTS is watermarked with SynthID. This imperceptible watermark is interwoven directly into the audio output, allowing the reliable detection of AI-generated content to help prevent misinformation.

— Google DeepMind

By embedding natural language commands directly into the text input, you can steer AI-speech output with improved levels of granularity.

— Google DeepMind

🏷 Tags

Gemini

🔧 Debug

Cluster ID: 4edf732984
Importance (max): 3
Members: 1
Sources: Google DeepMind
Earliest: 2026-04-15T16:03:19.000Z
Latest: 2026-04-15T16:03:19.000Z
Lead URL: https://deepmind.google/blog/gemini-3-1-flash-tts-the-next-generation-of-expressive-ai-speech