Google: Gemini 3.1 Flash TTS Brings Expressive AI Speech to More Than 70 Languages
Google has launched Gemini 3.1 Flash TTS, a new text-to-speech model supporting more than 70 languages and achieving an Elo score of 1,211 on the Artificial Analysis leaderboard. The key innovation is audio tags — embedding natural language commands directly into text for precise control of voice, intonation, and emotion. The model is available on Google AI Studio, Vertex AI, and Google Vids, with SynthID watermarking for detecting AI-generated audio.