this post was submitted on 11 Sep 2024
68 points (95.9% liked)

Data is Beautiful

1185 readers
1 users here now

Be respectful

founded 5 months ago
MODERATORS
 

cross-posted from: https://lemmy.dbzer0.com/post/27579423

This is my first try at creating a map of lemmy. I based it on the overlap of commentors that visited certain communities.

I only used communities that were on the top 35 active instances for the past month and limited the comments to go back to a maximum of August 1 2024 (sometimes shorter if I got an invalid response.)

I scaled it so it was based on percentage of comments made by a commentor in that community.

Here is the code for the crawler and data that was used to make the map:

https://codeberg.org/danterious/Lemmy_map

all 17 comments
sorted by: hot top controversial new old
[–] BackOnMyBS@lemmy.autism.place 8 points 2 months ago (2 children)

Can anyone ELI5 what the axes mean?

[–] Skua@kbin.earth 6 points 2 months ago

Nothing. There were far more dimensions in the original data and the author asked the computer to squash that down into two axes in whatever way preserved groupings

[–] mojofrododojo@lemmy.world 3 points 2 months ago

One is labelled Y.

I'm assuming the other is X.. but might be Z if they're fun

[–] clay_pidgin@sh.itjust.works 7 points 2 months ago* (last edited 2 months ago) (1 children)

I pretty much only browse /all , so I'm throwing the numbers off! I don't know myself with which communities i interact most.

[–] Danterious@lemmy.dbzer0.com 4 points 2 months ago (2 children)

Yeah I've noticed there aren't many clusters that encode specific ideas (there are a few like the anime, nsfw, or sometimes instance level clusters). Most of it just seems to be a blend. Sorta disappointing.

~Anti~ ~Commercial-AI~ ~license~ ~(CC~ ~BY-NC-SA~ ~4.0)~

[–] Asidonhopo@lemmy.world 1 points 2 months ago (1 children)

Are they clustered based on shared userbase?

[–] Danterious@lemmy.dbzer0.com 1 points 2 months ago

Yeah pretty much. There is also a weighting based on the percentage of comments in that community that come from that user.

~Anti~ ~Commercial-AI~ ~license~ ~(CC~ ~BY-NC-SA~ ~4.0)~

[–] CanadaPlus@lemmy.sdf.org 1 points 2 months ago* (last edited 2 months ago)

There's not enough data yet for the noise to cancel itself out, I think.

Place and language-specific clusters are pretty coherent, if you go looking.

[–] urheber@discuss.tchncs.de 3 points 2 months ago (1 children)

webGL is not available in my Browser :(

[–] Blaze@sopuli.xyz 3 points 2 months ago (1 children)

Most of the value comes from the labels which require the webGL though

[–] Asidonhopo@lemmy.world 2 points 2 months ago (1 children)

This is cool, keep adding more features. Not sure if my comment wishing this existed inspired you but nice to see a proof of concept!

[–] Danterious@lemmy.dbzer0.com 2 points 2 months ago (1 children)
[–] Asidonhopo@lemmy.world 1 points 2 months ago

Assuming you're continuing to develop this, maybe try displaying it like a bubble graph with posting frequency being the bubble size, and having nodes show all of the community names inside the largest bubbles at a particular zoom level.

Allowing the user to sort the data based on different criteria would be useful too, and links between nodes showing size of shared userbase might be interesting although you couldn't show the connection between all nodes at once. One quality of life feature that would be great is pinch to zoom in/out rather than the current selection zoom.

Would be great if eventually users could subscribe to communities directly through this as an interface.

Keep up the good work! Thanks for considering my idea!

[–] Etterra@lemmy.world 2 points 2 months ago

Hey everybody, the fascism apologists are way down there in the corner! Let's all point and laugh!

[–] CanadaPlus@lemmy.sdf.org 2 points 2 months ago* (last edited 2 months ago)

I'm finding new communities I want to explore from this thing. Very neat!