AI News — 2026-04-15

3

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

6d ago · 1 source confirmed confirmed

Google DeepMind releases Gemini 3.1 Flash TTS with granular audio tags for expressive speech generation.

Google DeepMind announced Gemini 3.1 Flash TTS, a new audio model featuring granular audio tags for precise control over AI-generated speech. The tags enable users to direct expressive audio generation with fine-grained control. The model represents the next generation in Google's text-to-speech capabilities.

All audio generated by Gemini 3.1 Flash TTS is watermarked with SynthID. This imperceptible watermark is interwoven directly into the audio output, allowing the reliable detection of AI-generated content to help prevent misinformation.

— Google DeepMind

3

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

6d ago · 1 source confirmed confirmed

Hugging Face publishes research on VAKRA, examining reasoning, tool use, and failure modes in AI agents.

Hugging Face has published research on VAKRA, a study focused on understanding how AI agents reason, utilize tools, and encounter failure modes. The work examines the mechanisms and limitations of agent-based systems in practical scenarios. This research contributes to the broader understanding of agent behavior and robustness.

3

The next evolution of the Agents SDK

7d ago · 1 source confirmed confirmed

OpenAI updates its Agents SDK with native sandbox execution and model-native harness for secure, long-running agent development.

OpenAI has released an update to its Agents SDK featuring native sandbox execution and a model-native harness designed to help developers build secure, long-running agents. The update enables agents to work across files and tools with improved safety and isolation. This advancement aims to streamline agent development and deployment for production use cases.

Wednesday, April 15, 2026