Welcome to Incremental Social! Learn more about this project here!
Check out lemmyverse to find more communities to join from here!

Is there a simple way to severly impede webscraping and LLM data collection of my website?

I am working on a simple static website that gives visitors basic information about myself and the work I do. I want this as a way use to introduce myself to potential clients, collaborators, etc., rather than rely solely on LinkedIn as my visiting card.

This may seem sound rather oxymoronic given that I am literally going to be placing (some relevant) details about myself and my work on the internet, but I want to limit the websites' access from bots, web scraping and content collection for LLMs.

Is the a realistic expectation?

Also, any suggestions on privacy respecting, yet inexpensive domains that I can purchase in Europe would be of super great help.

Deckweiss ,

I did this a while back for blocking LLMs and there are more methods discussed in that threads comments.

https://lemmy.world/post/14767952

TheAnonymouseJoker ,

Why can I not open your post?

Maeve ,

Works for me.

TheAnonymouseJoker ,

Funny that Jerboa did not open it on my account, but on web browser it opened up.

Maeve ,

Occasionally I run into glitches on various instances, but visiting the original post on the original instance works. Lemmy is new enough that I didn't mind seeking workarounds, by asking or fiddling around. Best!

RvTV95XBeo ,

Are you perhaps an LLM in disguise?

TheAnonymouseJoker ,

Jerboa did not open it for some reason, web browser did. Also check my account age.

otp ,

That's exactly what an LLM would say...

TheAnonymouseJoker ,

I did not know LLMs were moderators on Lemmy :D

  • All
  • Subscribed
  • Moderated
  • Favorites
  • privacy@lemmy.ml
  • random
  • incremental_games
  • meta
  • All magazines