this post was submitted on 11 Sep 2024
68 points (95.9% liked)

Data is Beautiful

1185 readers
1 users here now

Be respectful

founded 5 months ago
MODERATORS
 

cross-posted from: https://lemmy.dbzer0.com/post/27579423

This is my first try at creating a map of lemmy. I based it on the overlap of commentors that visited certain communities.

I only used communities that were on the top 35 active instances for the past month and limited the comments to go back to a maximum of August 1 2024 (sometimes shorter if I got an invalid response.)

I scaled it so it was based on percentage of comments made by a commentor in that community.

Here is the code for the crawler and data that was used to make the map:

https://codeberg.org/danterious/Lemmy_map

you are viewing a single comment's thread
view the rest of the comments
[–] clay_pidgin@sh.itjust.works 7 points 2 months ago* (last edited 2 months ago) (1 children)

I pretty much only browse /all , so I'm throwing the numbers off! I don't know myself with which communities i interact most.

[–] Danterious@lemmy.dbzer0.com 4 points 2 months ago (2 children)

Yeah I've noticed there aren't many clusters that encode specific ideas (there are a few like the anime, nsfw, or sometimes instance level clusters). Most of it just seems to be a blend. Sorta disappointing.

~Anti~ ~Commercial-AI~ ~license~ ~(CC~ ~BY-NC-SA~ ~4.0)~

[–] Asidonhopo@lemmy.world 1 points 2 months ago (1 children)

Are they clustered based on shared userbase?

[–] Danterious@lemmy.dbzer0.com 1 points 2 months ago

Yeah pretty much. There is also a weighting based on the percentage of comments in that community that come from that user.

~Anti~ ~Commercial-AI~ ~license~ ~(CC~ ~BY-NC-SA~ ~4.0)~

[–] CanadaPlus@lemmy.sdf.org 1 points 2 months ago* (last edited 2 months ago)

There's not enough data yet for the noise to cancel itself out, I think.

Place and language-specific clusters are pretty coherent, if you go looking.