Welcome to Incremental Social! Learn more about this project here!
Check out lemmyverse to find more communities to join from here!

wise_pancake ,

robots.txt is a file available in a standard location on web servers (example.com/robots.txt) which set guidelines for how scrapers should behave.

That can range from saying "don't bother indexing the login page" to "Googlebot go away".

IT's also in the first paragraph of the article.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • technology@lemmy.world
  • random
  • incremental_games
  • meta
  • All magazines