Article Content

Google Gemini Beta Exposed: New Image Annotation Tool and Conversational Local Editing Features

Published in Latest AI News

Time :Mar 18, 2026

Read :4minute

Google has recently showcased significant upgrades to its generative image editing capabilities in the 17.10.54.sa.arm64 beta version of the Gemini Android app. This version introduces a deeply integrated markup interface and a real-time text description box, aiming to address the pain points of imprecise instruction delivery and broken operation workflows in current AI image re-creation, further enhancing Gemini's ability to fine-tune specific parts of generated content (such as Nano Banana images).

The core of this technical iteration lies in the reconstruction of the interaction logic. Compared to previous basic sketching support, which required users to exit the editing interface before giving instructions to the robot, the new interface allows users to directly make high-precision marks on specific areas of an image after clicking the "pencil" icon, while simultaneously entering modification intentions in the newly added text box at the bottom.

This dual-modal interaction approach of "visual positioning + natural language" significantly improves the model's accuracy in understanding specific local modification instructions. In addition, the beta version also reserves space for adjustment size (Resizing) and effects (Effects) options, indicating that Gemini is evolving from a single text-to-image tool into a comprehensive image workstation that integrates generation, trimming, and filter processing.

From an industry trend perspective, Google's move reflects that the focus of competition in generative AI is shifting from "creating something out of nothing" to "controlled editing with precision." By integrating complex markup tools into a native mobile application, Google aims to establish a higher interaction barrier in the fields of mobile AI photography and digital creation.

Although the above features are currently still in the code analysis phase and have not been officially released to the public, their demonstrated "mark and modify immediately" logic indicates a key step forward for multimodal models in perceiving users' refined aesthetic intentions, which will further accelerate the infiltration of AI painting from entertainment towards professional creative processes.

Related Recommendations

ChatGPT's Traffic Share Slumps Dramatically as Google Gemini Catches Up Rapidly

According to Similarweb data, ChatGPT's web traffic share dropped from 77.6% to 53.7% in a year, while Google Gemini surged from 7.3% to 26.7%, reflecting fierce competition and shifting user preferences in the AI chatbot market, with rivals like Anthropic's Claude also gaining ground.....

May 15, 2026

168.5k

iOS27 Will Launch a Separate App for Siri with a Chatbot-Like Interface

Before Apple's WWDC 2026, journalist Mark Gurman revealed that Siri will return as a standalone application in iOS27, codenamed "Rave," marking the first time in 15 years. The new version of Siri is upgraded into a 24/7 intelligent agent, featuring a chat interface similar to ChatGPT, supporting conversation history, file uploads, and content prioritization, and is deeply integrated with Dynamic Island, significantly enhancing the user experience.

May 13, 2026

177.1k

Google Launches Rambler Voice Input Feature: Based on Gemini Model, Integrated into Gboard Keyboard

Google launched 'Rambler' at the 2026 Android Show, an AI voice dictation feature based on the Gemini multilingual model, integrated into Gboard. It auto-filters filler words and understands real-time natural language corrections, accurately recognizing changes like time or location, marking a shift to generative semantic understanding in system-level input methods.....

May 13, 2026

190.1k

Deep Integration of Google Maps with Gemini: CarPlay Navigation is About to Enter the Dialogue Era

Google plans to integrate Gemini AI into Apple CarPlay's Google Maps, enhancing in-car navigation intelligence. Currently, iPhone Google Maps uses AI via 'Ask Maps' for queries, but CarPlay lacked support. New code shows users can enable it on phones for smart interaction while driving.....

May 12, 2026

229.4k

Google Health Ecosystem Turmoil: Fitbit Officially Rebranded, $9.99 AI Personal Trainer Launches in May

Google announced on May 7 the integration of Gemini AI into health and fitness systems, renaming the Fitbit app to 'Google Health' and launching the screen-free tracking device Fitbit Air. The core upgrade is the 'Google Health Coach' service, powered by the Gemini large model, aiming to provide 24/7 vital sign monitoring and seamless health experiences.....

May 8, 2026

191.9k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご