New Gemini 2.5 capabilities
Native audio output and enhancements to Dwell API
At present, the Dwell API is introducing a preview model of audio-visual enter and native audio out dialogue, so you may immediately construct conversational experiences, with a extra pure and expressive Gemini.
It additionally permits the consumer to steer its tone, accent and elegance of talking. For instance, you may inform the mannequin to make use of a dramatic voice when telling a narrative. And it helps instrument use, to have the ability to search in your behalf.
You may experiment with a set of early options, together with:
- Affective Dialogue, through which the mannequin detects emotion within the consumer’s voice and responds appropriately.
- Proactive Audio, through which the mannequin will ignore background conversations and know when to reply.
- Considering within the Dwell API, through which the mannequin leverages Gemini’s considering capabilities to assist extra advanced duties.
We’re additionally releasing new previews for text-to-speech in 2.5 Professional and a pair of.5 Flash. These have first-of-its-kind assist for a number of audio system, enabling text-to-speech with two voices by way of native audio out.
Like Native Audio dialogue, text-to-speech is expressive, and might seize actually delicate nuances, equivalent to whispers. It really works in over 24 languages and seamlessly switches between them.