Categories: SEO

Google Explains When A Spike In Crawling Is Bad

Google’s Gary Illyes posted on LinkedIn with two common examples of when a spike in Googlebot activity, crawling, is a bad thing. The short answer is when Googlebot gets to crawling an infinite section of your site (like calendar pages that goes on forever) and when your site is hacked with a ton of new hacked pages.

This is probably very basic SEO stuff for most of you but there might be a reason he is sharing this now.

Gary wrote, “Don’t get happy prematurely when search engines unexpectedly start to crawl like crazy from your site.” “A sudden increase in crawling can mean good things, sure, but it can also mean something is wrong,” he added.

In these cases, he said, “Treat unexpected sharp increases in crawling as a symptom of an issue until you can prove otherwise.” Then he joked, “Or, you know, maybe I’m just a hardline pessimist.”

Here are two issues that come up way too often when looking at sharp increases in crawling, he wrote:

Infinite Spaces:

The example on infinite spaces that he provided is one of the most common. When “you have a calendar thingie on your site, or an infinitely filterable product listings page. If your site generally has pages that search users find helpful, crawlers will get excited about these infinite spaces for a time. robots.txt is your friend, use it.”

Hacked Content:

Another common example is hacked content. He said, “if a no-good-doer somehow managed to get access to your server’s file system or your content management system, they might flood your otherwise dandy site with, well, crap. If your site generally has pages that search users find helpful, crawlers will get excited about these new pages for a time and happily crawl them. https://web.dev/hacked has great resources about what to do in these cases. (tangent: yes, this is more cracking than hacking but apparently the internet is fine with the misnomer).”

Forum discussion at LinkedIn.

FOLLOW US ON GOOGLE NEWS

 

Read original article here

Denial of responsibility! Search Engine Codex is an automatic aggregator of the all world’s media. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials, please contact us by email – admin@searchenginecodex.com. The content will be deleted within 24 hours.

Share
Chris Barnhart

Leave a Comment
Published by
Chris Barnhart

Recent Posts

WordPress Takes Bite Out Of Plugin Attacks

WordPress announced over the weekend that they were pausing plugin updates and initiating a force…

July 2, 2024

Daily Search Forum Recap: July 1, 2024

Here is a recap of what happened in the search forums today, through the eyes…

July 1, 2024

Bing Adds Prompt Injection To Bing Webmaster Guidelines

Bing added a new guideline to its Bing Webmaster Guidelines named Prompt Injection. A prompt…

July 1, 2024

Google Maps Reviews Troubleshooting For Missing and Remove Reviews

A Google community manager, Alistair D., posted a Reviews troubleshooting and FAQs document in the…

July 1, 2024

Google On Why Soft 404s Are Bad

Gary Illyes from Google explained two reasons why soft 404 errors are bad. Soft 404s…

July 1, 2024

Some Sites Hit By Google September 2023 Helpful Content Update Seeing Small Lifts

There are reports that some sites hit by the Google September 2023 Helpful Content Update…

July 1, 2024