AI Mode Gets Smarter: Google Adds Visual Search Capabilities

AI Mode Gets Smarter: Google Adds Visual Search Capabilities
  • calendar_today August 14, 2025
  • Technology

Google which serves as the main access point to the internet is rapidly incorporating artificial intelligence into its main search service which represents a major shift in our digital interactions. The company launched AI features in its search capabilities during 2024’s early months but marked a substantial advancement with the “AI Mode” introduction the previous month. The new mode unveils an exciting vision of a future where the traditional ten blue link search results become obsolete artifacts from internet history.

Google continues to advance its AI-driven search results by integrating powerful multimodal capabilities following initial positive user feedback on AI Mode. The evolution of this service relies on a custom iteration of Google’s Gemini large language model (LLM). Google’s confirmation about the new model indicates that it supports multimodal input which enables users to integrate images directly into their searches when using AI Mode.

Gemini Initiates the New Visual Search Era:

The new update brings a user-friendly button to the AI Mode search bar. The intuitive update allows users to take a live photo or upload an existing image from their device. The upgraded Gemini model demonstrates exceptional talent in visual content interpretation through its integration with Google Lens’s advanced object recognition features. Google explains how Lens is fundamental in its function to accurately detect specific items in the uploaded pictures. After obtaining detailed contextual data, Google Lens passes the information to AI Mode, which then performs several interconnected sub-queries through a company-defined “fan-out technique.”

Google demonstrates how users can benefit from this new feature through a detailed example. A user shows AI Mode multiple book covers and asks for recommendations about related books. Google Lens proceeds to analyze and recognize every single book title displayed within the images. The detailed identification process enables AI Mode to use distinct book features when generating responses. The AI provides detailed and relevant recommendations for similar books and successfully answers follow-up questions that arise from the initial set of displayed book covers.

Google envisions AI Mode as a fundamental component:

AI Mode represents a fundamental component of Google’s strategic plan to sustain its dominant position as the leading search directory on the internet. The company has previously stated that a significant number of users use traditional search methods to discover direct and precise responses to their inquiries. AI Mode presents these users with a faster and more powerful method to locate the exact information they need. Google’s initial telemetry data from AI Mode shows a significant transformation in the way users search. According to the company’s findings, users submit about twice the amount of text in their queries when using AI Mode compared to traditional web searches. Google sees this pattern as users asking more detailed questions, but it might also indicate that users need to add extra context for the AI to deliver accurate search results.

Users have not yet encountered AI Mode in their regular web sessions despite the AI Mode feature having existed for multiple weeks. Google first released this groundbreaking feature only to Google One AI Premium subscribers who needed to activate it through a manual process in Google Labs.

AI Mode accessibility is currently moving toward a broad expansion phase. Google declared its plan to provide access to AI Mode for millions of additional Labs users in the United States who do not have subscriptions to its premium AI service tier.

New users must opt in to use AI Mode, but the current trend predicts its evolution into a basic search feature available to a broader audience. AI Mode could become Google’s primary search interface in the near future as its multimodal integration marks a significant step toward the company’s vision of a visually immersive and user-friendly web information discovery experience.