Google Launches Gemini 3.1 Flash TTS with 70+ Language, Multi‑Speaker Support

Posted on Apr 16, 2026 by CurrentLens in Models

The preview shifts audio generation from simple conversion toward controllable, tag - driven speech in more than 70 languages.

AI Quick Take

Preview adds natural‑language audio tags and native multi‑speaker dialogue for finer expressive control.
Native support for 70+ languages signals a push toward broader multilingual TTS use cases.

Google has launched Gemini 3.1 Flash TTS, a preview text‑to‑speech model that emphasizes speech quality, expressive control, and multilingual generation. The new build is presented as a shift from simple conversion workflows toward more controllable, tag‑driven audio output.

Gemini 3.1 Flash TTS introduces natural‑language audio tags and native multi‑speaker dialogue, and Google says it supports more than 70 languages natively. Those features are intended to let developers specify expression and speaker turns in natural language rather than treating generation as a black box.

The change in focus matters for teams building localized, dialogic, or expressive voice experiences because it promises finer control without custom engineering around speaker switching or language handling. What to watch next: how Google exposes these capabilities in APIs or tools, performance and quality benchmarks beyond the preview, and whether third‑party adopters report practical gains in multilingual and multi‑speaker scenarios.

Latest
Trending

Models & Launches

DKPS method cuts model-evaluation queries using cached responses

CurrentLens
Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.

Models & Launches

PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans

CurrentLens
Jun 2, 2026

A physics-informed foundation model called PIGMENT learns a universal microstructure prior and adapts zero-shot to individual diffusion MRI scans, enabling reliable maps from sparse and heterogeneous data.

Models & Launches

ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025

CurrentLens
May 27, 2026

A new ATOM analysis of about 1,500 open language models maps downloads, derivatives, inference share and performance, and reports Chinese models surpassed U.S.

Models & Launches

Authors Release OpenEval and Demand Item-Level Benchmark Standards

CurrentLens
May 25, 2026

A position paper argues AI evaluation must publish item-level benchmark responses and ships OpenEval - 10M model responses across 155k items - to prove the point.