this post was submitted on 07 Aug 2024
11 points (100.0% liked)

datahoarder

6603 readers
1 users here now

Who are we?

We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.

We are one. We are legion. And we're trying really hard not to forget.

-- 5-4-3-2-1-bang from this thread

founded 4 years ago
MODERATORS
 

Hey guys, so it seems that Linkwarden isn't as good as I was hoping, since some websites will throw up a cookie popup or some other screen that basically prevents the capture.

Firefox Screenshot seems to work well, but it saves a PNG, which isn't really text searchable.

FF's "save page as..." feature seems to break things when viewing them back.

Save to PDF is another option, and that seems to be decent.

I'm not looking to copy entire websites, but I like to save web pages for later reference (i.e. instructions/specs).

I use Synology Note Station, but they don't have a web clipper for Firefox...

I'm fine with using a folder structure to store files, despite not being totally ideal when compared to Linkwarden.

Does anyone have any other suggestions that perhaps I've missed? Nothing too complicated... ideally, as simple as a button click would be great.

top 12 comments
sorted by: hot top controversial new old
[–] davel@lemmy.ml 7 points 1 month ago (1 children)

Are you familiar with the SingleFile browser extension? “It helps you to save a complete web page into a single HTML file.” This includes images.

[–] Showroom7561@lemmy.ca 2 points 1 month ago (2 children)

This seems to be exactly what I'm looking for!! Thank you!

[–] ReversalHatchery@beehaw.org 2 points 1 month ago

uBO could help with cleaning up the site

[–] ReversalHatchery@beehaw.org 1 points 1 month ago

Oh and ViolentMonkey for when you want to do something automatically on the site before saving it. A little JS knowledge is required for that, but there's plenty of userscripts and multiple "stores" of them to get some inspiration

[–] smpl@discuss.tchncs.de 2 points 1 month ago

I use WebScrapBook on LibreWolf. It let you choose between saving to single html, maff and htz. I prefer the htz format where resources are stored next to the page in an archive. It can also be viewed natively by Firefox-based browsers using the URI scheme jar:file:///home/me/somepage.htz!/index.html.

[–] fmstrat@lemmy.nowsci.com 2 points 1 month ago* (last edited 1 month ago)

I just learned about this FOSS option: https://linkwarden.app/

Looks really good.

Edit: Oops. Something to watch: https://github.com/linkwarden/linkwarden/issues/138

[–] far_university190 1 points 1 month ago

Save Page WE extension, i think better than singlefile because much more configurable.

[–] some_guy@lemmy.sdf.org 1 points 1 month ago
[–] thingsiplay@beehaw.org 1 points 1 month ago (1 children)

Save to PDF is another option, and that seems to be decent.

If that is a decent option, why are you searching for another tool?

[–] Showroom7561@lemmy.ca 2 points 1 month ago (1 children)

There's always something better 😂

[–] thingsiplay@beehaw.org 1 points 1 month ago

Fair enough. Nothing against research and improvement. :-)

[–] ReversalHatchery@beehaw.org 1 points 1 month ago

OP has started with that in the first sentence.