Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

  • FaceDeer@fedia.io
    link
    fedilink
    arrow-up
    2
    ·
    1 month ago

    It means that people should not be surprised that AI companies are training on their data. They’re deliberately putting their content out into the world where AI trainers can read it in an uncontrolled manner, and reading it is all that’s needed for AI training.

    There have already been a number of lawsuits about AI training and thus far nothing seems to indicate that it’s something that copyright restricts. If you know of any cases that have established otherwise I suppose feel free to link them, but until then there’s nothing illegal going on here.

    If you just want to be angry about it then I suppose there’s nothing stopping you on that count. Go ahead.

    • Feyd@programming.dev
      link
      fedilink
      arrow-up
      1
      ·
      1 month ago

      It isn’t about what is currently legal under the law! People can discuss how they would prefer society works, and should! This is what was happening in this thread and that’s why you trying to shove your “well actually this system is federated and it’s not illegal” is pointless and unwanted. You’re not bringing anything to the conversation because you can’t even tell what the conversation is about, apparently.

      • FaceDeer@fedia.io
        link
        fedilink
        arrow-up
        1
        ·
        1 month ago

        There was someone else in this thread responding to me that didn’t understand how ActivityPub or the law worked, my explanations certainly were not “pointless” for them. They could have learned some things from what I said. Whether they did or not, who knows, that’s up to them.

        You’re not bringing anything to the conversation because you can’t even tell what the conversation is about, apparently.

        You don’t get to decide what the conversation is about, it’s a collaborative thing. All that OP opened with is “look, Facebook is training AIs off of Fediverse content” and I responded to that with my own take on what this meant. My comments have been on-topic and haven’t broken any instance or community rules that I can see.

        Feel free to not respond to my comments, or even to block me if you really prefer not to see what I have to say. User blocks are better implemented on the Fediverse than back on Reddit, they don’t wreck the flow of conversation for everyone else so they’re a better option here.

        • Feyd@programming.dev
          link
          fedilink
          arrow-up
          1
          ·
          1 month ago

          You opened by saying that somehow, using a federated social media site naturally means someone also supports using that site to train AI. My whole point in this entire thread is that you are drawing a false conclusion, clearly, because there are plenty of people that clearly don’t agree.

          You just spew the same unrelated junk over and over because you can’t back up your ridiculous assertion.

          I don’t see any point in continuing because you’re clearly tripling down, but you really should actually respond to what is actually being said to you if you’re going to respond at all.

          • FaceDeer@fedia.io
            link
            fedilink
            arrow-up
            1
            ·
            1 month ago

            I was being sarcastic. It just boggles me that people are surprised by this, it should be obvious that the Fediverse is an even better source of training material (in practical terms if not in volume) than Reddit and such because there are no API restrictions or big corporations willing to throw lawsuits around.

            If you don’t want your posts and comments to be used to train AI then posting on the Fediverse is the very last thing you should be doing.