• spujb@lemmy.cafeOP
      link
      fedilink
      arrow-up
      1
      ·
      38 minutes ago

      i’m not smart enough for this but maybe look to communities like r/DataHoarder to get started

  • kateA
    link
    fedilink
    English
    arrow-up
    39
    ·
    9 hours ago

    hi spujb. Only 98gb? I can mirror that 🤷‍♀️

  • jherazob@beehaw.org
    link
    fedilink
    English
    arrow-up
    20
    ·
    8 hours ago

    Okay, given how things are going, do we know if the Internet Archive has a backup plan for when these fucks attack it in earnest?

  • brucethemoose@lemmy.world
    link
    fedilink
    arrow-up
    91
    ·
    edit-2
    14 hours ago

    We are screwed if the Internet Archive goes down, right?

    Seems like a huge point of failure for one entity.

    • kautau@lemmy.world
      link
      fedilink
      arrow-up
      42
      ·
      12 hours ago

      Agreed, I think the biggest issue though is just scale. It’s over 100 petabytes of data. Not outside the realm of big cloud providers to mirror, but they don’t really give a shit. It would require some sort of significant distributed software solution for the community to work with. Not impossible, but as far as I know, nobody’s taken up the mantle yet as I think it would need custom software just to begin the solution of how to distribute it as a sharded set of community mirrors, different people just mirroring individual pieces.

        • Swedneck@discuss.tchncs.de
          link
          fedilink
          arrow-up
          9
          ·
          4 hours ago

          IPFS is the way to go IMO, it’s so perfect for archival that it pains me that it’s still pretty unknown

          the fact that you don’t need any sort of central organization for everyone to help seed data is amazing, no more duplicate torrents splitting seeders, so long as you have identical data the network just figures it out.
          If you have the hash for a piece of data you can just set a computer to watch for someone to start seeding it, even if the last time anyone saw the data was decades ago and a dude just found a CD in their recently passed dad’s basement, if that dude seeds it overnight and then their computer explodes, you’ve now downloaded it and it’ll remain available. It’s so fucking good.

      • Enceladus@lemmy.ca
        link
        fedilink
        arrow-up
        11
        arrow-down
        1
        ·
        12 hours ago

        HexOS has a plan for shared encrypted data. With the simplicity of installation and management it could take off mainstream as personal NAS are gaining popularity, but its still in early development.

    • spujb@lemmy.cafeOP
      link
      fedilink
      arrow-up
      45
      ·
      edit-2
      14 hours ago

      from the linked page

      Excludes corrupt datasets and data not publicly accessible.

    • grue@lemmy.world
      link
      fedilink
      English
      arrow-up
      48
      ·
      14 hours ago

      It it long past overdue for the Internet Archive to move to the EU or Switzerland or something.

      • ⛓️‍💥@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        3 hours ago

        Would be best if there were several mirrors in several countries. It’s unfortunately too large to realistically host via crowd sourcing. The best you could do is something ala Storj where fragments are redundantly distributed across various hosts.

    • shikitohno@lemm.ee
      link
      fedilink
      arrow-up
      2
      ·
      8 hours ago

      Same, especially before the inevitable attacks on the Internet Archive to come. Who knows what nonsense will be in the works to try and get this removed, or the whole project shut down in the coming years.