Gemini seem to have "solved" my duck river crossing, lol.

diz@awful.systems · 8 months ago

Gemini seem to have "solved" my duck river crossing, lol.

HedyL@awful.systems · 8 months ago

It’s also worth noting that your new variation of this “puzzle” may be the first one that describes a real-world use case. This kind of problem is probably being solved all over the world all the time (with boats, cars and many other means of transportation). Many people who don’t know any logic puzzles at all would come up with the right answer straight away. Of course, AI also fails at this because it generates its answers from training data, where physical reality doesn’t exist.

diz@awful.systems · 8 months ago

Yeah I think the best examples are everyday problems that people solve all the time but don’t explicitly write out solutions step by step for, or not in the puzzle-answer form.

It’s not even a novel problem at all, I’m sure there’s even a plenty of descriptions of solutions to it as part of stories and such. Just not as “logical puzzles” due to triviality.

What really annoys me is when they claim high performance on benchmarks consisting of fairly difficult problems. This is basically fraud, since they know full well it is still entirely “knowledge” reliant, and even take steps to augment it with generated problems and solutions.

I guess the big sell is that it could use bits and pieces of logic gleaned from other solutions to solve a “new” problem. Except it can not.

diz@awful.systems · edit-2 8 months ago

It’s google though, if nobody uses their shit they just put it inside their search.

It’s only gonna go away when they run out of cash.

edit: whoops replied to the wrong comment