Gemini Live is rolling out on Android and iOS: All you need to know

HIGHLIGHTS

Gemini Live processes real-time visual data to interpret surroundings and correct errors on the fly.

Project Mariner enables Gemini to handle multiple tasks simultaneously, such as bookings and research.

New features include improved voice output, screen interaction, memory, and integration with future AR devices.

Gemini Live is rolling out on Android and iOS: All you need to know

Google is now rolling out its much anticipated Gemini Live to both Android and iOS users, bringing the real-time visual understanding to its AI assistant. The feature which is available via Gemini app. This builds on the capabilities, first demonstrated under Project Astra during last year’s Google I/O. The feature was previously available for the Google Pixel devices but starting today, it is available for everyone.  

For the unversed, Gemini Live allows the assistant to process and respond to visual information captured via smartphone cameras in real-time. In live demonstrations, Google showed the ability to interpret surroundings and identify faults in real-time, such as correcting deliberately misleading information while a user moves through an outdoor space.

Google has already announced that it is aiming to transition Gemini into the most powerful, personal and proactive AI assistant for the whole world. The company has already announced new capabilities including the drawing of visual, audio, and contextual input across devices and applications.

Google is also expanding its agentic functions within the Gemini ecosystem. Project Mariner, a browser-based prototype, allows Gemini to handle multiple concurrent tasks such as booking appointments, conducting research, and making purchases. It is currently under testing among Google AI Ultra subscribers in the U.S., with integration planned across additional products throughout the year.

Along with camera-based features, Gemini has also got the voice output, screen interaction, and memory, allowing it to maintain conversational context and perform extended tasks. These tools are expected to feed into future iterations of Search, Gemini’s API for developers, and other form factors such as AR glasses.

“Our ultimate vision is to transform the Gemini app into a universal AI assistant that will perform everyday tasks for us, take care of our mundane admin and surface delightful new recommendations, making us more productive and enriching our lives,” the company added. 

Ashish Singh

Ashish Singh

Ashish Singh is the Chief Copy Editor at Digit. He's been wrangling tech jargon since 2020 (Times Internet, Jagran English '22). When not policing commas, he's likely fueling his gadget habit with coffee, strategising his next virtual race, or plotting a road trip to test the latest in-car tech. He speaks fluent Geek. View Full Profile

Digit.in
Logo
Digit.in
Logo