Google Revamps Entire Crawler Documentation

Google has launched a serious revamp of its Crawler documentation, shrinking the primary overview web page and splitting content material into three new, extra centered pages. Though the changelog downplays the adjustments there’s a completely new part and principally a rewrite of your entire crawler overview web page. The extra pages permits Google to extend the knowledge density of all of the crawler pages and improves topical protection.

What Modified?

Google’s documentation changelog notes two adjustments however there’s really much more.

Listed here are a few of the adjustments:

Added an up to date person agent string for the GoogleProducer crawler
Added content material encoding data
Added a brand new part about technical properties

The technical properties part incorporates totally new data that didn’t beforehand exist. There are not any adjustments to the crawler habits, however by creating three topically particular pages Google is ready to add extra data to the crawler overview web page whereas concurrently making it smaller.

That is the brand new details about content material encoding (compression):

“Google’s crawlers and fetchers assist the next content material encodings (compressions): gzip, deflate, and Brotli (br). The content material encodings supported by every Google person agent is marketed within the Settle for-Encoding header of every request they make. For instance, Settle for-Encoding: gzip, deflate, br.”

There’s extra details about crawling over HTTP/1.1 and HTTP/2, plus an announcement about their purpose being to crawl as many pages as attainable with out impacting the web site server.

What Is The Objective Of The Revamp?

The change to the documentation was resulting from the truth that the overview web page had turn out to be giant. Extra crawler data would make the overview web page even bigger. A choice was made to interrupt the web page into three subtopics in order that the particular crawler content material may proceed to develop and making room for extra normal data on the overviews web page. Spinning off subtopics into their very own pages is an excellent resolution to the issue of how greatest to serve customers.

That is how the documentation changelog explains the change:

“The documentation grew very lengthy which restricted our capacity to increase the content material about our crawlers and user-triggered fetchers.

…Reorganized the documentation for Google’s crawlers and user-triggered fetchers. We additionally added specific notes about what product every crawler impacts, and added a robots.txt snippet for every crawler to show how one can use the person agent tokens. There have been no significant adjustments to the content material in any other case.”

The changelog downplays the adjustments by describing them as a reorganization as a result of the crawler overview is considerably rewritten, along with the creation of three model new pages.

Whereas the content material stays considerably the identical, the division of it into sub-topics makes it simpler for Google so as to add extra content material to the brand new pages with out persevering with to develop the unique web page. The unique web page, known as Overview of Google crawlers and fetchers (person brokers), is now really an summary with extra granular content material moved to standalone pages.

Google revealed three new pages:

Widespread crawlers
Particular-case crawlers
Consumer-triggered fetchers

1. Widespread Crawlers

Because it says on the title, these are frequent crawlers, a few of that are related to GoogleBot, together with the Google-InspectionTool, which makes use of the GoogleBot person agent. All the bots listed on this web page obey the robots.txt guidelines.

These are the documented Google crawlers:

Googlebot
Googlebot Picture
Googlebot Video
Googlebot Information
Google StoreBot
Google-InspectionTool
GoogleOther
GoogleOther-Picture
GoogleOther-Video
Google-CloudVertexBot
Google-Prolonged

3. Particular-Case Crawlers

These are crawlers which can be related to particular merchandise and are crawled by settlement with customers of these merchandise and function from IP addresses which can be distinct from the GoogleBot crawler IP addresses.

Checklist of Particular-Case Crawlers:

AdSense
Consumer Agent for Robots.txt: Mediapartners-Google
AdsBot
Consumer Agent for Robots.txt: AdsBot-Google
AdsBot Cell Net
Consumer Agent for Robots.txt: AdsBot-Google-Cell
APIs-Google
Consumer Agent for Robots.txt: APIs-Google
Google-Security
Consumer Agent for Robots.txt: Google-Security

3. Consumer-Triggered Fetchers

The Consumer-triggered Fetchers web page covers bots which can be activated by person request, defined like this:

“Consumer-triggered fetchers are initiated by customers to carry out a fetching operate inside a Google product. For instance, Google Web site Verifier acts on a person’s request, or a website hosted on Google Cloud (GCP) has a function that enables the location’s customers to retrieve an exterior RSS feed. As a result of the fetch was requested by a person, these fetchers usually ignore robots.txt guidelines. The final technical properties of Google’s crawlers additionally apply to the user-triggered fetchers.”

The documentation covers the next bots:

Feedfetcher
Google Writer Heart
Google Learn Aloud
Google Web site Verifier

Takeaway:

Google’s crawler overview web page grew to become overly complete and probably much less helpful as a result of folks don’t at all times want a complete web page, they’re simply keen on particular data. The overview web page is much less particular but in addition simpler to grasp. It now serves as an entry level the place customers can drill right down to extra particular subtopics associated to the three sorts of crawlers.

This transformation presents insights into how one can clean up a web page that may be underperforming as a result of it has turn out to be too complete. Breaking out a complete web page into standalone pages permits the subtopics to handle particular customers wants and probably make them extra helpful ought to they rank within the search outcomes.

I might not say that the change displays something in Google’s algorithm, it solely displays how Google up to date their documentation to make it extra helpful and set it up for including much more data.

Learn Google’s New Documentation

Overview of Google crawlers and fetchers (user agents)

List of Google’s common crawlers

List of Google’s special-case crawlers

List of Google user-triggered fetchers

Featured Picture by Shutterstock/Forged Of Hundreds

Source link

Google Revamps Entire Crawler Documentation

Using Google Merchant Center Next For Competitive Analysis

The Definitive Guide For Your Online Store

Bluesky Emerges As Traffic Source: Publishers Report 3x Engagement

Google Chrome site engagement service metrics

The Top 10 Newsletter Strategies to Boost Your Engagement and Reach

The Ultimate Cheat Sheet to Holiday Advertising in 2025

Data, AI, and the New Era of Creator-Led Growth

A Comprehensive Guide to the Future of Influencer Marketing 2025–2026

18 AWeber Alternatives: Our Top Choice Revealed

Top Insights

The Top 10 Newsletter Strategies to Boost Your Engagement and Reach

The Ultimate Cheat Sheet to Holiday Advertising in 2025

Data, AI, and the New Era of Creator-Led Growth

Google Revamps Entire Crawler Documentation

What Modified?

What Is The Objective Of The Revamp?

1. Widespread Crawlers

3. Particular-Case Crawlers

Checklist of Particular-Case Crawlers:

3. Consumer-Triggered Fetchers

Takeaway:

Learn Google’s New Documentation

Related Posts