Do you use anything to archive content for yourself or others? (research, videos, articles, and anything that could be lost to time or censorship)

Otter@lemmy.ca · edit-2 16 days ago

Do you use anything to archive content for yourself or others? (research, videos, articles, and anything that could be lost to time or censorship)

Otter@lemmy.ca · 16 days ago

One option that I’ve heard of in the past

ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline.

Krafting@lemmy.world · 15 days ago

I archive youtube videos that I like with TubeArchivist, I have a playlist for random videos i’d like to keep, and also subscribe to some of my favourite creator so I can keeptheir videos, even when I’m offline

vividspecter@lemm.ee · 15 days ago

I’ll add pinchflat as an alternative with the same aim.

Ludrol@szmer.info · 15 days ago

https://wiki.archiveteam.org/

they have an automatic VM that dowloads stuff in distributed manner and uploads to archive.org

yasser_kaddoura@lemmy.world · edit-2 15 days ago

I have a script that archives to:

I used to solely depend on archive.org, but after the recent attacks, I expanded my options.

Script: https://gist.github.com/YasserKa/9a02bc50e75e7239f6f0c8f04fe4cfb1

EDIT: Added script. Note that the script doesn’t include archiving to archivebox, since its API isn’t available in stable verison yet. You can add a function depending on your setup. Personally, I am depending on Caddy and docker, so I am using caddy module [1] to execute commands with this in my Caddyfile:

route /add {
	@params query url=*
	exec docker exec --user=archivebox archivebox archivebox add {http.request.uri.query.url} {
		timeout 0
	}
}

[1] https://github.com/abiosoft/caddy-exec

WhyJiffie@sh.itjust.works · 15 days ago

isn’t this prone to a

 || rm -rf /

or something similar at the end of the URL?

if you can docker exec, you have a lot of privileges already, so be sure to make sure this is not a danger