Dev.to10h ago

Gemini 3.1: Real-World Voice Recognition with...

Background Google released Gemini 3.1 Flash Live at the end of March 2026 March, focusing on "making audio AI more natural and reliable." This model is specifically designed for real-time two-way voice conversations, with low latency, interruptibility, and multi-language support. I happened to have a LINE Bot project (linebot-helper-python) on hand, which already handles text, images, URLs, PDFs, and YouTube, but completely ignores voice messages: User sends a voice message Bot: (Silence) This t

Read original on dev.to