• NeatNit@discuss.tchncs.de
    link
    fedilink
    arrow-up
    157
    ·
    1 month ago

    Love this webcomic. Original source: https://www.peppercarrot.com/en/miniFantasyTheater/014.html

    It’s an open-source comic. The Krita project with all the original layers is free to download, and the SVG and text for the speech bubbles is hosted on git so it can be translated to many other languages. I’ve never seen a webcomic do this before, it’s really neat. And of course, the artwork is just beautiful!

    It’s an awesome comic.

    • NeatNit@discuss.tchncs.de
      link
      fedilink
      arrow-up
      70
      arrow-down
      1
      ·
      1 month ago

      Well, sort of. They’re not for secutiry, that’s for sure. They were originally about making it harder for automated bot requests to go through and overload the server. ReCAPTCHA then started turning it around to make OCR better using machine learning, which is commonly agreed to be a Good Thing since it helped digitize old books and things like that. But of course, this in turn made it possible for bots to get past the CAPTCHA, and everything spiraled from there.

      At some point everyone kind of forgot the real point of a CAPTCHA, and it’s now much more of a free training data generator and much less of an obstacle for bots. But it still can prevent complete rookies from making thousands of requests per second with a simple python script, so it does serve a little bit of that original purpose.

      • ulterno@programming.dev
        link
        fedilink
        English
        arrow-up
        26
        arrow-down
        1
        ·
        1 month ago

        commonly agreed to be a Good Thing

        So, did they open access their trained model weights?

        it still can prevent complete rookies from making thousands of requests per second with a simple python script

        So now you can know that if you are getting DOS’ed, it is actually malicious.

        • edinbruh@feddit.it
          link
          fedilink
          English
          arrow-up
          11
          ·
          1 month ago

          did they open access their model weights?

          In that instance it wasn’t really training, it was crowdsourcing the transcription. Rechapta would pull out a word from their book archive that the OCR failed to recognise, and if many people identified it as the same word, it would be archived. Now that rechapta has been purchased by Google, the archive and the transcriptions are available on Google books.

          They stopped doing this once ai became more effective than rechapta for book transcriptions.

          Modern chapta actually is about training models. But old, classic rechapta was really just about book transcriptions, and those are available.

    • WanderingThoughts@europe.pub
      link
      fedilink
      arrow-up
      16
      ·
      1 month ago

      The last one is popping up as age verification. And it’s done with a camera now because otherwise anyone just sends in an AI generated picture of a face, and people don’t trust companies with government id.