Advertisement
  1. SEJ
  2.  ⋅ 
  3. SEO

Free Google AI Image Analysis Tool For Image Recognition

Learn how Google's Vision tool classifies images at scale and how you can use it to see the way Google interprets your images.

Free Google AI Image Analysis Tool For Image Recognition

Google offers an AI image classification tool that analyzes images to classify the content and assign labels to them.

The tool is intended as a demonstration of Google Vision, which can scale image classification on an automated basis but can be used as a standalone tool to see how an image detection algorithm views your images and what they’re relevant for.

Even if you don’t use the Google Vision API to scale image detection and classification, the tool provides an interesting view into what Google’s image-related algorithms are capable of, which makes it interesting to upload images to see how Google’s Vision algorithm classifies them.

This tool demonstrates Google’s AI and Machine Learning algorithms for understanding images.

It’s a part of Google’s Cloud Vision API suite that offers vision machine learning models for apps and websites.

Does Cloud Vision Tool Reflect Google’s Algorithm?

This is just a machine learning model and not a ranking algorithm.

So, it is unrealistic to use this tool and expect it to reflect something about Google’s image ranking algorithm.

However, it is a great tool for understanding how Google’s AI and Machine Learning algorithms can understand images, and it will offer an educational insight into how advanced today’s vision-related algorithms are.

The information provided by this tool can be used to understand how a machine might understand what an image is about and possibly provide an idea of how accurately that image fits the overall topic of a webpage.

Why Is An Image Classification Tool Useful?

Images can play an important role in search visibility and CTR from the various ways that webpage content is surfaced across Google.

Potential site visitors who are researching a topic use images to navigate to the right content.

Thus, using attractive images that are relevant for search queries can, within certain contexts, be helpful for quickly communicating that a webpage is relevant to what a person is searching for.

The Google Vision tool provides a way to understand how an algorithm may view and classify an image in terms of what is in the image.

Google’s guidelines for image SEO recommend:

“High-quality photos appeal to users more than blurry, unclear images. Also, sharp images are more appealing to users in the result thumbnail and increase the likelihood of getting traffic from users.”

If the Vision tool is having trouble identifying what the image is about, then that may be a signal that potential site visitors may also be having the same issues and deciding to not visit the site.

What Is The Google Image Tool?

The tool is a way to demo Google’s Cloud Vision API.

The Cloud Vision API is a service that lets apps and websites connect to the machine learning tool, providing image analysis services that can be scaled.

The standalone tool itself allows you to upload an image, and it tells you how Google’s machine learning algorithm interprets it.

Google’s Cloud Vision page describes how the service can be used like this:

“Cloud Vision allows developers to easily integrate vision detection features within applications, including image labeling, face and landmark detection, optical character recognition (OCR), and tagging of explicit content.”

These are five ways Google’s image analysis tools classify uploaded images:

  1. Faces.
  2. Objects.
  3. Labels.
  4. Properties.
  5. Safe Search.

Faces

The “faces” tab provides an analysis of the emotion expressed by the image.

The accuracy of this result is fairly accurate.

The below image is a person described as confused, but that’s not really an emotion.

The AI describes the emotion expressed in the face as surprised, with a 96% confidence score.

Google Image AIComposite image created by author, July 2022; images sourced from Google Cloud Vision API and Shutterstock/Cast Of Thousands

Objects

The “objects” tab shows what objects are in the image, like glasses, person, etc.

The tool accurately identifies horses and people.

Screenshot of Google Vision toolComposite image created by author, July 2022; images sourced from Google Cloud Vision API and Shutterstock/Lukas Gojda

Labels

The “labels” tab shows details about the image that Google recognizes, like ears and mouth but also conceptual aspects like portrait and photography.

This is particularly interesting because it shows how deeply Google’s image AI can understand what is in an image.

Screenshot of Google Vision AI identifying objects within an uploaded photoComposite image created by author, July 2022; images sourced from Google Cloud Vision API and Shutterstock/Lukas Gojda

Does Google use that as part of the ranking algorithm? That’s something that is not known.

Properties

Properties are the colors used in the image.

Screenshot of Google Vision tool identifying the dominant colors in an imageScreenshot from Google Cloud Vision API, July 2022

On the surface, the point of this tool isn’t obvious and may seem like it is somewhat without utility.

But in reality, the colors of an image can be very important, particularly for a featured image.

Images that contain a very wide range of colors can be an indication of a poorly-chosen image with a bloated size, which is something to look out for.

Another useful insight about images and color is that images with a darker color range tend to result in larger image files.

In terms of SEO, the Property section may be useful for identifying images across an entire website that can be swapped out for ones that are less bloated in size.

Also, color ranges for featured images that are muted or even grayscale might be something to look out for because featured images that lack vivid colors tend to not pop out on social media, Google Discover, and Google News.

For example,  featured images that are vivid can be easily scanned and possibly receive a higher click-through rate (CTR) when shown in the search results or in Google Discover, since they call out to the eye better than images that are muted and fade into the background.

There are many variables that can affect the CTR performance of images, but this provides a way to scale up the process of auditing the images of an entire website.

eBay conducted a study of product images and CTR and discovered that images with lighter background colors tended to have a higher CTR.

The eBay researchers noted:

“In this paper, we find that the product image features can have an impact on user search behavior.

We find that some image features have correlation with CTR in a product search engine and that that these features can help in modeling click through rate for shopping search applications.

This study can provide sellers with an incentive to submit better images for products that they sell.”

Anecdotally, the use of vivid colors for featured images might be helpful for increasing the CTR for sites that depend on traffic from Google Discover and Google News.

Obviously, there are many factors that impact the CTR from Google Discover and Google News. But an image that stands out from the others may be helpful.

So for that reason, using the Vision tool to understand the colors used can be helpful for a scaled audit of images.

Safe Search

Safe Search shows how the image ranks for unsafe content. The descriptions of potentially unsafe images are as follows:

  • Adult.
  • Spoof.
  • Medical.
  • Violence.
  • Racy.

Google search has filters that evaluate a webpage for unsafe or inappropriate content.

So for that reason, the Safe Search section of the tool is very important because, if an image unintentionally triggers a safe search filter, then the webpage may fail to rank for potential site visitors who are looking for the content on the webpage.

Google Vision Safe Search AnalysisScreenshot from Google Cloud Vision API, July 2022

The above screenshot shows the evaluation of a photo of racehorses on a race track. The tool accurately identifies that there is no medical or adult content in the image.

Text: Optical Character Recognition (OCR)

Google Vision has a remarkable ability to read text that is in a photograph.

The Vision tool is able to accurately read the text in the below image:

Screenshot of Vision tool accurately reading text in an imageComposite image created by author, July 2022; images sourced from Google Cloud Vision API and Shutterstock/Melissa King

As can be seen above, Google does have the ability (through Optical Character Recognition, a.k.a. OCR), to read words in images.

However, that’s not an indication that Google uses OCR for search ranking purposes.

The fact is that Google recommends the use of words around images to help it understand what an image is about and it may be the case that even for images with text within them, Google still depends on the words surrounding the image to understand what the image is about and relevant for.

Google’s guidelines on image SEO repeatedly stress using words to provide context for images.

“By adding more context around images, results can become much more useful, which can lead to higher quality traffic to your site.

…Whenever possible, place images near relevant text.

…Google extracts information about the subject matter of the image from the content of the page…

…Google uses alt text along with computer vision algorithms and the contents of the page to understand the subject matter of the image.”

It’s very clear from Google’s documentation that Google depends on the context of the text around images for understanding what the image is about.

Takeaway

Google’s Vision AI tool offers a way to test drive Google’s Vision AI so that a publisher can connect to it via an API and use it to scale image classification and extract data for use within the site.

But, it also provides an insight into how far algorithms for image labeling, annotation, and optical character recognition have come along.

Upload an image here to see how it is classified, and if a machine sees it the same way that you do.

More Resources:


Featured image by Maksim Shmeljov/Shutterstock

Category SEO Tools
ADVERTISEMENT
SEJ STAFF

Roger Montti

Owner at Martinibuster.com

Roger Montti is a search marketer with over 20 years experience. I offer site audits, phone consultations and content and ...

Subscribe to SEJ

Get our daily newsletter from SEJ's Founder Loren Baker about the latest news in the industry!

Topic(s) of Interest*
By clicking the "SUBSCRIBE" button, I agree and accept the content agreement and privacy policy of Search Engine Journal.
Ebook
ADVERTISEMENT
Advertisement
Read the Next Article
Read the Next