this post was submitted on 18 Aug 2024
75 points (77.0% liked)

Privacy

31236 readers
890 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

Chat rooms

much thanks to @gary_host_laptop for the logo design :)

founded 4 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] CynicusRex@lemmy.ml 9 points 1 month ago (1 children)

#TL;DR:

User-agent: GPTBot
Disallow: /
User-agent: ChatGPT-User
Disallow: /
User-agent: Google-Extended
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: Amazonbot
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Omgilibot
Disallow: /
User-Agent: FacebookBot
Disallow: /
User-Agent: Applebot
Disallow: /
User-agent: anthropic-ai
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: Claude-Web
Disallow: /
User-agent: Diffbot
Disallow: /
User-agent: ImagesiftBot
Disallow: /
User-agent: Omgilibot
Disallow: /
User-agent: Omgili
Disallow: /
User-agent: YouBot
Disallow: /
[–] mox@lemmy.sdf.org 7 points 1 month ago (1 children)

Of course, nothing stops a bot from picking a user agent field that exactly matches a web browser.

[–] JackbyDev@programming.dev 3 points 1 month ago (1 children)

Nothing stops a bot from choosing to not read robots.txt

[–] mox@lemmy.sdf.org 2 points 1 month ago* (last edited 1 month ago)

Indeed, as has already been said repeatedly in other comments.