cdc psa rule

spujb@lemmy.cafe · 1 year ago

cdc psa rule

gibmiser@lemmy.world · 1 year ago

Get ready to Donate to their legal defense fund

spujb@lemmy.cafe · 1 year ago

you’re right and you should say it but it makes me sad

grue@lemmy.world · 1 year ago

It it long past overdue for the Internet Archive to move to the EU or Switzerland or something.

FundMECFS@lemmy.blahaj.zone · 1 year ago

Yep.

I wish they also could implement a decentralised hosting protocol, though I know currently that technology is in it’s infancy.

Jumuta@sh.itjust.works · 1 year ago

isn’t that just a torrent?

FundMECFS@lemmy.blahaj.zone · 1 year ago

There are different protocols that attempt to work for things like web hosting, but yes, the BitTorrent protocol is a decentralised file sharing protocol.

⛓️‍💥@sh.itjust.works · 1 year ago

Would be best if there were several mirrors in several countries. It’s unfortunately too large to realistically host via crowd sourcing. The best you could do is something ala Storj where fragments are redundantly distributed across various hosts.

Lka1988@lemmy.dbzer0.com · 1 year ago

As long as money still means something after Elon is through with the Treasury…

brucethemoose@lemmy.world · 1 year ago

We are screwed if the Internet Archive goes down, right?

Seems like a huge point of failure for one entity.

kautau@lemmy.world · 1 year ago

Agreed, I think the biggest issue though is just scale. It’s over 100 petabytes of data. Not outside the realm of big cloud providers to mirror, but they don’t really give a shit. It would require some sort of significant distributed software solution for the community to work with. Not impossible, but as far as I know, nobody’s taken up the mantle yet as I think it would need custom software just to begin the solution of how to distribute it as a sharded set of community mirrors, different people just mirroring individual pieces.

Enceladus@lemmy.ca · 1 year ago

HexOS has a plan for shared encrypted data. With the simplicity of installation and management it could take off mainstream as personal NAS are gaining popularity, but its still in early development.

AngryPancake@sh.itjust.works · 1 year ago

Interplanetary File System can do it

MonkderVierte@lemmy.ml · 1 year ago

IPFS, GnuNet?

Swedneck@discuss.tchncs.de · 1 year ago

IPFS is the way to go IMO, it’s so perfect for archival that it pains me that it’s still pretty unknown

the fact that you don’t need any sort of central organization for everyone to help seed data is amazing, no more duplicate torrents splitting seeders, so long as you have identical data the network just figures it out.
If you have the hash for a piece of data you can just set a computer to watch for someone to start seeding it, even if the last time anyone saw the data was decades ago and a dude just found a CD in their recently passed dad’s basement, if that dude seeds it overnight and then their computer explodes, you’ve now downloaded it and it’ll remain available. It’s so fucking good.

Taalnazi@lemmy.world · edit-2 1 year ago

So about 104,857,600 GB? You’d need 105,000 people with 1 TB each to save that. Or…

Assuming you bought 30 TB SSDs, you’d need about 3,500 of those, costing €80 each.

That’d be €280k, but let’s round it to €300k.

If every person spent €960 (or €80 per month), then each person could get 12 of those SSDs. You’d need 8,750 people to do that.

Should be doable if crowdfunded by a community, or if you had some big donor. Then you’d need to connect it.

dil@lemmy.ml · 1 year ago

Looking at diskprices.com, lowest prices for storage are around $8 (used) or $15 (new). I didn’t look too hard, but a 30TB SSD for $80 (~$2.5/TB) seems wrong?

100K TB * $15/TB = $1.5 million

Assuming 100PB is the amount of data, we’d also need redundancy. Idk what best practices would be, but I’ll say 3ish copies, so 300PB total.

So a grand total of ~$5 million.

Which is crazy cheap, all things considered. Like, it would be no problem for a single rich person to handle that.

Hell, subsidize/give away cheap little computers that you just plug power and an Ethernet cable into. Raspberry pi + 4TB drive ($60) + casing would be like… $100? Though I guess you’d need 75K of them, and the cost per TB is pretty bad.

This guy is 20TB for $280: https://a.co/d/17UOtFi

If we stick with $40 of overhead for rpi etc, that’s $320 for 20TB ($16/TB), and we’d need 300PB/(20TB/unit) = 15K units. And at $320 each, all in would be $4.8 million.

The software seems to exist for connecting them all… So idk seems like it would be absolutely feasible? Would be interested to learn if I’m missing a major cost.

Taalnazi@lemmy.world · 1 year ago

For the 30 TB SSD i looked at sites like Luntek.

Jorunn (she/her)@lemmy.blahaj.zone · 1 year ago

well and truly based

kate · edit-2 1 year ago

hi spujb. Only 98gb? I can mirror that 🤷‍♀️

e: https://kate.fail/cdc_2025_01_28/archive.org/download/20250128-cdc-datasets/

fossilesque@mander.xyz · 1 year ago

I suggest also mirroring on https://academictorrents.com/

kate · 1 year ago

sry i dont know what that is but once i have all the data ill post a link here. im hosting in france and i am also outside the us so i will not take down the data at tronald dumps request tyvm.

AngryCommieKender@lemmy.world · edit-2 1 year ago

Use his original last name. ~~Drumph~~ Drumpf. It pisses him off as much as being told that he has baby hands.

His father or grandfather changed it.

TheOakTree@lemm.ee · 1 year ago

I believe it was Drumpf

AngryCommieKender@lemmy.world · 1 year ago

I believe you are correct. Edited.

spicehoarder@lemm.ee · 1 year ago

Incredible 🫡

kate · 1 year ago

posted the link, i think there are a few files missing, not sure why. but the folder reads as 95GB

chiliedogg@lemmy.world · 1 year ago

I’m gonna download it when I get home and put on a few USBs. They won’t be connected to any device and will be stored in safes.

Can’t remote wipe data that’s not connected.

The more backups of important information we have the better.

TheOakTree@lemm.ee · 1 year ago

Based kate

Valmond@lemmy.world · 1 year ago

The best Kate

AllNewTypeFace@leminal.space · 1 year ago

Good thing they’re based far from the US in… oh.

some_guy@lemmy.sdf.org · 1 year ago

I will grab this torrent when I get home and make it a permanent seed, alongside the one outing nazis in Patriot Front.

LemmyFeed@lemmy.dbzer0.com · 1 year ago

Shit good idea, didn’t even know you could do this.

What else should we seed? I’ve got a homelab and am eager to put some storage to use for something like this.

waterSticksToMyBalls@lemmy.world · 1 year ago

Was there a mailing list or other identifying docs in that pf leak or was it just chats and stuff?

comfy@lemmy.ml · 1 year ago

Not sure if it’s the same leak, but if it’s PatriotFail, it’s even got videos.

Watch the marching drill one for a good laugh. https://xcancel.com/alt_uscis/status/1549969687999553539

some_guy@lemmy.sdf.org · 1 year ago

Here’s the Patriot Front link that I’ve been sharing:

Patriot Front Fascist Leak Exposes Nationwide Racist Campaigns

You can download it at the following torrent address:

magnet:?xt=urn:btih:2c87816e4c81990fb25bbca43dd8d578eaa55886&dn=patriotfront&tr=udp%3A%2F%2F9.rarbg.to%3A2920&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337&tr=udp%3A%2F%2Fexodus.desync.com%3A6969

Maiq@lemy.lol · 1 year ago

I appreciate you doing this!

some_guy@lemmy.sdf.org · 1 year ago

Glad you said so. I completely forgot that I needed to grab this once home. It is now active. Once it’s done downloading, I’ll keep it seeding forever, since I have no bandwidth cap. Cheers.

guillem@aussie.zone · 1 year ago

deleted by creator

spujb@lemmy.cafe · 1 year ago

from the linked page

Excludes corrupt datasets and data not publicly accessible.

JusticeForPorygon@lemmy.blahaj.zone · 1 year ago

Because the feds didn’t already have it out for IA.

Leaf (she/her)@lemmy.blahaj.zone · 1 year ago

Time to download all of it!

shikitohno@lemm.ee · 1 year ago

Same, especially before the inevitable attacks on the Internet Archive to come. Who knows what nonsense will be in the works to try and get this removed, or the whole project shut down in the coming years.

AdrianTheFrog@lemmy.world · 1 year ago

it sounds like it’s only stuff that was already publicly available tho

vortic@lemmy.world · 1 year ago

Some of the publicly available data is disappearing under the new administration. Most notably information about COVID, long COVID, vaccines, and bird flu is disappearing. Presumably, this data dump contains the missing data.

MutilationWave@lemmy.world · 1 year ago

Importantly they are also removing all mentions of climate change. I imagine they’ll be deleting data on that front as well.

Twista713@lemmy.world · 1 year ago

That happened last admin, so def wouldn’t be a shocker.

AdrianTheFrog@lemmy.world · 1 year ago

Oh I hadn’t heard of that, I thought it was just stopping new data

gravitas_deficiency@sh.itjust.works · 1 year ago

Nope - they’re literally destroying data if it doesn’t align with their super regressive views on sexual identity stuff, amongst other things

grue@lemmy.world · 1 year ago

Actual image of DOGE employee deleting CDC data

spoiler

It’s funny because the Nazis themselves (the 1930s ones, not the 2020s ones) also started their book burning on literally the exact same topic.

spoiler

And by “funny” I mean “not funny at all in the slightest, holy fucking shit!”

spoiler

It’s kinda neat that you can nest spoiler tags, by the way.

spujb@lemmy.cafe · 1 year ago

key word was

jherazob@beehaw.org · 1 year ago

Okay, given how things are going, do we know if the Internet Archive has a backup plan for when these fucks attack it in earnest?

meowmeowbeanz@sh.itjust.works · 1 year ago

🚨 BIG NEWS Y’ALL! 🚨

Someone just saved ALL the CDC’s public data before it could disappear! 🦅

What’s the Deal?

Some mystery hero downloaded everything from the CDC’s website (that’s 98 GIGABYTES of health info!) and uploaded it to the Internet Archive on Jan 28th. Think of it like making a backup copy of your phone before it breaks!

Why Should You Care?

This is YOUR health data - stuff about vaccines, diseases, and public health that your tax dollars paid for! 🏥
Once this info is gone from CDC’s website, it could be really hard for your doctor to get important updates
Researchers need this to keep studying ways to keep Americans healthy 💪

What’s Next?

Smart folks at places like Harvard are making sure this data stays safe by keeping copies. It’s like having multiple backups of your family photos - can’t be too careful!

Remember folks: Knowledge is power, and someone just made sure we didn’t lose a whole bunch of it! 🎯

#SaveTheData #PublicHealth #AmericanRight2Know

Source: Internet Archive upload by anonymous user on Jan 28, 2025 Post by Ed Summers (@edsu@social.coop) - Feb 3, 2025

spujb@lemmy.cafe · 1 year ago

As a reminder, AI generated content is against the rules in this community—see the sidebar. I appreciate your instinct to bring some quality content to this space, but let’s please keep in mind that genuine interaction with diverse voices is what makes this community beautiful. :)

My reasoning:

You have personally admitted to writing AI comments in the past: https://sh.itjust.works/comment/16482371
Heavy use of markdown headings, bullets, and section dividers is a common pattern in LLM output
Use of “it’s like” or “it’s about” phrases as the conclusion to a paragraph are very common in LLM models like ChatGPT
Verbatim replication of content from my original post that is common in LLM output and highly indicates an LLM was instructed to create something based on the text of the original post
Use of 🎯 emoji does not match context
“100% AI generated” response on multiple AI detection websites (GPTZero, Quillbot)

Any single one of these facts would not lead me to comment, but with all of it combined it makes a pretty strong case. Thank you for your contribution to this community but please let’s keep it genuine in the future! We love and appreciate the real you :)

spujb@lemmy.cafe · edit-2 1 year ago

Removed by mod

meowmeowbeanz@sh.itjust.works · 1 year ago

Removed by mod

spujb@lemmy.cafe · edit-2 1 year ago

Removed by mod

RangerJosey@lemmy.ml · 1 year ago

Removed by mod

meowmeowbeanz@sh.itjust.works · 1 year ago

Removed by mod

Pika@sh.itjust.works · 1 year ago

it’s weird that I learned of this through this community and not a security or health community. something to look into tomorrow

LaunchesKayaks@lemmy.world · 1 year ago

How would you recommend someone go about archiving important parts of the IA? Just external drives?

paris@lemmy.blahaj.zone · 1 year ago

The Internet Archive is, and I really want to emphasize this, Fucking Huge. If you want to help archive it, every upload has an associated torrent you can download and help seed. Torrenting itself isn’t illegal, only torrenting illegal stuff like copyrighted movies. You can buy a relatively cheap refurbished HDD of whatever size you want, set up qBittorrent, and torrent the uploads that you want to make sure are available even if the Internet Archive has to take them down or has a critical data loss failure.

LaunchesKayaks@lemmy.world · 1 year ago

Thank you so much for the advice! I want to preserve important documents like the bill of rights and the constitution, as well as sexual education material, especially stuff pertaining to women and reproductive health. Also banned books. Things the facists are trying to purge and things that are important to me.

paris@lemmy.blahaj.zone · 1 year ago

In the case of books, Anna’s Archive is looking for help seeding their enormous collection of books and research papers. Consider reading that page and helping them as well!

garbagebagel@lemmy.world · 1 year ago

I know what you’re talking about is important and a necessary comment but something about your comment hit me hard. It’s just so absolutely insane that it has to be said/done.

LaunchesKayaks@lemmy.world · 1 year ago

Ikr? It’s wild that all of this is happening.

iheartneopets@lemm.ee · 1 year ago

If anyone is looking for something specific to preserve, consider Our Bodies, Ourselves. It’s a seminal feminist work that seeks to educate women on their bodies. It’s extremely comprehensive, thicker than most textbooks.

spujb@lemmy.cafe · 1 year ago

i’m not smart enough for this but maybe look to communities like r/DataHoarder to get started

LiveLM@lemmy.zip · 1 year ago

Inb4 it gets DDoS’d again

P4ulin_Kbana@lemmy.eco.br · 1 year ago

What’s CDC?

Enkrod@feddit.org · 1 year ago

Center for Disease Control

P4ulin_Kbana@lemmy.eco.br · 1 year ago

What a weird name…

NostraDavid@programming.dev · 1 year ago

It’s where America gathers its telepaths to control COVID :3

chuymatt@startrek.website · 1 year ago

I mean, not really. It is a research and policy center for controlling diseases.