Microsoft introduces VibeVoice, a Whisper-style speech-to-text model with speaker diarization.
2 results for: speech
Google Launches Gemini 3.1 Flash TTS with 70+ Language, Multi‑Speaker Support
Gemini 3.1 Flash TTS is a preview that refocuses Google’s speech work on expressive control, natural‑language audio tags, and native multilingual, multi‑speaker output.