For me it would be a full copy of wikipedia, an offline copy of some maps of where I live, some linux ISO’s, and a lot of entertainment media.
archive.org, which contains wikipedia too. Checkmate!
What’s the file size on that?
The file size doesnt matter
Imagine how much money you could make selling access to specific websites
You should start downloading it right now in case the internet goes down
The amount of data required for that would be immense. From what I’ve been able to find it’s over 200 petabytes.
Pff, rookie numbers. I have a 3k zettabytes data center at home
Would you like a roommate?
in my basement, sure
Today I learned I’m unintentionally preparing for the Internet apocalypse.
Latest llama version and instructions for setting it up
I do have a copy of wikipedia and I should be good on entertainment media. I guess I should expand the emergency porn stash.
Just out of curiosity, how much space/effort was that to set up? (Yes, I know I can probably google up like a bajillion resources on this exact thing, but I’m a weirdo and am attempting to bring the (non-toxic/shitty) social back to social media)
I’ve been considering setting myself up a little NAS server since I finally dumped Spotify and am considering doing the same with video streaming too (besides Tubi, anyway), but having one just for mp3/light video streaming seems like a bit of a waste and having local repos of useful sites might be a fun side project to help justify it to myself lol
I’m using emby for music, audiobooks, tv, and movies. You can also do picture backup/sync if you want. I am running it in a docker on my unraid server.
For the Wikipedia part, it’s surprisingly simple. I just used Kiwix and grabbed a copy, it’s only about 100gb or so. You can also use it to get offline copies of other stuff, like Project Gutenberg.
I went with a Synology NAS (I know, the foss crowd will probably crucify me) which really keeps the setup effort to a minimum. You put in the HDDs, setup your pool/volume, install Plex (or jellyfin), upload your media and you’re basically good to go.
Netflix.
More pirated TV
I could honestly just re-watch most of my shows until the end of time.
I will literally never get tired of Bee and Puppycat.
A full copy of Stack Overflow. Otherwise, we would not know how to get the Internet working again.
Somethings are better when done from scratch.
A copy of scratch then.
FROM scratch AS internet # TODO
If you’d download the whole wikipedia be sure to download the whole commets section for each article to have a perspective on discussions on conflicting reasons for edits. Also include all the wiki media materials for all of the public domain literature, project gutenberg, entire archive.org, a good offline OS to be able to consume all of the information and you’re golden
entire archive.org
How much would it cost to store like 100 petabytes for (conservatively) 40 years?
A lot
You mean electricity bills for powering the storage? I guess buying 100pb worth of storage disks would be pretty expensive enough but since it’s an archive there is no need to keep it powered 24/7, just turn them on only when you need to. It’s just a hypothetical situation anyway, it’s a thing I wish to have access to; only an experienced sysadmin can actually maintain such great archive or its copy/backup
Let’s assume you have all hard drives and in a setup with absolutely zero redundancy in case a drive fails.
We’re using the Seagate Exos X24 (24TB) drive which is roughly $700 each brand new.
You’ll need 4167 of them to store 100PB. Which puts you at $2,916,900 just for the drives.
Let’s assume you already have the enclosures, racks, and servers for a small datacenter ready to go.
A drive can use 4-9w of power when spinning so assuming all drives are active (to ensure quick data access and data repair) that’ll be roughly 27086w for all the drives at 6.5w per drive. Every month (30 days), that is 19502kWh of electricity used. 40 years is roughly 349,680 hours so that comes out to around 9,471,433kWh used.
Assuming you get some damn good electricity rates at $0.12USD per kWh, it’ll cost $1,136,572 to run just the drives.
So in total, assuming you already have a datacenter with the capacity to install all the drives that runs on absolutely zero power, you’ll spend roughly $4,053,472 over the course of 40 years.
There is a much cheaper way that doesn’t use hard drives. It uses magnetic tapes, LTO-9 tapes specifically.
Each LTO-9 tape cassette can hold up to 45TB of data (compression is used to store it on the raw 18TB).
An LTO-9 tape drive can cost $10,000. Assuming you get the full 45TB per tape, you’ll need 2223 LTO-9 tape cassettes to store 100PB. Assuming you buy in bulk, you can get each tape cassette for $150 which puts you at $333,450 for the tapes.
Since the tapes don’t use power when not in use, this concludes the total cost. None of this accounts for storing all 2223 tapes or maintenance to ensure data is still intact on them but this comes out to $343,450 in total to store 100PB using magnetic tapes. While the cost is much cheaper, it’s much harder to access the data as it’s not immediately available since you have to fish out the drive you need and plop it into the tape drive then wait for it to read.
Arcane Season 1 and 2.
Honestly I’d probably just give up on technology entirely. Become a hermit carpenter or something
e621 dump
based
Not one, but two references to e621 in this thread. And neither are from users on the furry instances. Much to think about.
okay, you caught me, my .world profile was supposed to be my non-furry lemmy account.
I guess a lot of music and movies from a pirate site. I’d spend more time at the library listening to my music.
Grab the whole world, not just where you live, it’s not too much space
All the Debian ISO images and all of the documentation on everything.
This way. I should have all the stable software I could wish for and the instructions on how to use them.
don’t forget the source code!
All the extension office university data on plants, agriculture, etc. It’s invaluable info for anyone who grows their own food and deals with bees in relation to that food growth.
Is there an archive of this kind of data anywhere? I’d love to store this. I’ve already got a few wikis, including Wikipedia itself, but I’d love more