Updates to Gemini 2.5 from Google DeepMind

Biostate AI Raises $12M Collection A to Prepare the ChatGPT of Molecular Drugs

21 May 2025

The candy style of a brand new concept | MIT Information

20 May 2025

New Gemini 2.5 capabilities

Native audio output and enhancements to Dwell API

At present, the Dwell API is introducing a preview model of audio-visual enter and native audio out dialogue, so you may immediately construct conversational experiences, with a extra pure and expressive Gemini.

It additionally permits the consumer to steer its tone, accent and elegance of talking. For instance, you may inform the mannequin to make use of a dramatic voice when telling a narrative. And it helps instrument use, to have the ability to search in your behalf.

You may experiment with a set of early options, together with:

Affective Dialogue, through which the mannequin detects emotion within the consumer’s voice and responds appropriately.
Proactive Audio, through which the mannequin will ignore background conversations and know when to reply.
Considering within the Dwell API, through which the mannequin leverages Gemini’s considering capabilities to assist extra advanced duties.

We’re additionally releasing new previews for text-to-speech in 2.5 Professional and a pair of.5 Flash. These have first-of-its-kind assist for a number of audio system, enabling text-to-speech with two voices by way of native audio out.

Like Native Audio dialogue, text-to-speech is expressive, and might seize actually delicate nuances, equivalent to whispers. It really works in over 24 languages and seamlessly switches between them.