An anticapitalist tech blog. Embrace the technology that liberates us. Smash that which does not.

  • hypnicjerk@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    8 months ago

    are there copyrighted texts that have such distinctive patterns that they would be particularly easy to spot in an LLM’s output? say, would replacing every comment with a page from moby dick or wuthering heights be more or less infringing than using harry potter? hypothetically.

    • dual_sport_dork 🐧🗡️@lemmy.world
      link
      fedilink
      English
      arrow-up
      16
      ·
      8 months ago

      Well, I’m pretty sure Moby Dick is in the public domain by now. If I were you I’d go for something from Disney which is mathematically certain to get somebody sued although I can’t predict who.