this post was submitted on 01 Sep 2024
351 points (96.8% liked)

RPGMemes

10342 readers
1 users here now

Humor, jokes, memes about TTRPGs

founded 1 year ago
MODERATORS
 

This comic follows on from the Previous comic which will almost certainly provide context.

You might not wanna be famous, but when you're level 10, every organization within a mile is watching what you're doing.

you are viewing a single comment's thread
view the rest of the comments
[–] ahdok@ttrpg.network 38 points 2 months ago (8 children)

most of these AI scrapers don't respect robots.txt, so I'm not sure that really helps much, but... we have tried doing all of these things.

[–] itslilith@lemmy.blahaj.zone 25 points 2 months ago (6 children)

Someone on lemmy suggested to create a dummy endpoint that normal people won't be able to navigate to, and disallow it in robots.txt

Then when somebody crawls it you know they are ignoring robots.txt, and you ip ban them

[–] ahdok@ttrpg.network 15 points 2 months ago (5 children)

That's pretty clever.

I think that these AI scrapers might be smart enough that this doesn't really work though - at least if I were designing them I'd have them all come from dynamic IPs and not have any of them bother hitting the same target more than once. These things are very dedicated to acquiring content without consent, and if they're capable of causing problems for (say) Reddit, I'm not sure my little website is going to have much luck deterring them.

Honestly a better strategy might be to just glaze everything I draw.

[–] Johanno 7 points 2 months ago (1 children)

I am not sure if it costs money, but you could implement captchas.

Or use cloudflare to do that bot detecting for you.

Worst case you make it so you need to create an account to see content.

[–] ahdok@ttrpg.network 4 points 2 months ago

Well, we are already using cloudflare, that's one of the other reasons why the site is so slow... I don't think the other two suggestions prevent a scraper from requesting the information from the server... I think they'd just make it more arduous for real people to access the content.

load more comments (3 replies)
load more comments (3 replies)
load more comments (4 replies)