We're losing our digital history. Can the Internet Archive save it?

0x815@feddit.org · 2 months ago

We're losing our digital history. Can the Internet Archive save it?

Kairos@lemmy.today · 2 months ago

They can all be DMCAd and gone by tomorrow.

over_clox@lemmy.world · edit-2 2 months ago

Snapshot feature. Apparently you’re not familiar with it.

pyre@lemmy.world · 2 months ago

are you alright dude? have you never heard of DMCA?

over_clox@lemmy.world · 2 months ago

They’d have to find my archives first, absolutely none of them are on the frontend. I don’t use the archive in any normal manner, and the links are effectively randomized so there’s no chance of just guessing any of my links.

I’ve been doing it for years and they haven’t pulled anything down yet.

Kairos@lemmy.today · 2 months ago

Alright I’ll bite. What am I missing.

over_clox@lemmy.world · 2 months ago

https://web.archive.org/

Look towards the bottom of the page, ‘Save Page Now’

Kairos@lemmy.today · 2 months ago

What?

Yes, I know about the web archive. And I know that you can pull data without being logged in. But 100% of that data can be DMCAed at any point.

CronyAkatsuki@lemmy.cronyakatsuki.xyz · 2 months ago

Somebody can always just get an offline copy of that data, that kever hits the internet so company’s won’t know where it is so it can’t be dmca’d.

thejml@lemm.ee · edit-2 2 months ago

A local copy on a single person’s storage that isn’t available for future researchers, isn’t exactly Meeting the requirements of this article.

I have a copy of slashdot when they turned it pink for April fools day. Does anyone know that? No. Could someone find it if they wanted to read it? No. Is that helpful for preservation? No. To be helpful I’d have to make it available and searchable. You know what that does? Makes it so it can be DCMA’d.

CronyAkatsuki@lemmy.cronyakatsuki.xyz · 2 months ago

They can always make a torrent of it and share it like that if they are in a country with barelly any dmca laws.

T156@lemmy.world · edit-2 2 months ago

Big “if” though, and that would be contingent on the fact that the data is desirable enough that other people are willing and able to host it long-term, even before being able to find a country like that, and set up a torrent. I’ve a few torrents that are dead now, for example, because people weren’t that interested in keeping a copy of what they pointed to/the tracker no longer works.

You’d still need to share the torrent to spread it anyhow, and that runs into the DMCA issue all over again. The pirate bay only hosted torrents and magnet links, but it still got shut down for piracy, way back when. “facilitating pervasive online infringement [of copyright]” is something that can get you shut down, as Limewire found out.

Kairos@lemmy.today · 2 months ago

Actually no. They make it difficult and “don’t allow” people downloading data from the wayback machine.

over_clox@lemmy.world · 2 months ago

Funny you’d say that. If you manipulate the link and add if_ or fw_ after the date code, you can most certainly download files directly from the wayback machine.

Kairos@lemmy.today · 2 months ago

Oo can I have an example.

over_clox@lemmy.world · 2 months ago

Wanna watch a trick?

https://tinyurl.com/missingf35

You can follow that link, it’s perfectly safe, and rather funny no less. It links to the archive…

https://web.archive.org/web/20230919001454if_/https://charleston.craigslist.org/avo/d/mount-pleasant-stealth-fighter/7667184419.html

Note the if_ after the date/time code. That bypasses their banner. None of my links are anywhere on the frontend of the archive, you literally have to know every link to find my archives.

And most of my archives aren’t even of websites, most of them are direct file downloads of older operating systems and games and stuff. Not like I’m about to share any of those here though.

I’ve been doing that for years and they haven’t found or removed a single thing I’ve archived. If they ever do, well so be it, but none of it is on the frontend, and the links are so obscure that there’s basically zero chance of anyone just randomly guessing them.