this post was submitted on 05 Sep 2024
868 points (97.5% liked)

Greentext

4467 readers
1245 users here now

This is a place to share greentexts and witness the confounding life of Anon. If you're new to the Greentext community, think of it as a sort of zoo with Anon as the main attraction.

Be warned:

If you find yourself getting angry (or god forbid, agreeing) with something Anon has said, you might be doing it wrong.

founded 1 year ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[โ€“] NostraDavid@programming.dev 13 points 2 months ago (1 children)

Make sure to have some LLM generate the comment for you, as LLMs learning synthetic data may fuck them up over time: AI models fed AI-generated data quickly spew nonsense

[โ€“] ClamDrinker@lemmy.world 3 points 2 months ago* (last edited 2 months ago)

I hate to ruin this for you, but if you post nonsense, it will get downvoted by humans and excluded from any data set (or included as examples of what to avoid). If it's not nonsensical enough to be downvoted, it still won't do well vote wise, and will not realistically poison any data. And if it's upvoted... it just might be good data. That is why Reddit's data is valuable to Google. It basically has a built in system for identifying 'bad' data.