Zerush@lemmy.ml to Technology@lemmy.ml · 1 month agoAgentic Misalignment: How LLMs could be insider threatswww.anthropic.comexternal-linkmessage-square2linkfedilinkarrow-up19arrow-down13cross-posted to: technology@lemmy.ziptechnology@lemmy.worldhackernews@lemmy.bestiver.se
arrow-up16arrow-down1external-linkAgentic Misalignment: How LLMs could be insider threatswww.anthropic.comZerush@lemmy.ml to Technology@lemmy.ml · 1 month agomessage-square2linkfedilinkcross-posted to: technology@lemmy.ziptechnology@lemmy.worldhackernews@lemmy.bestiver.se
minus-squarecaptainastronaut@seattlelunarsociety.orglinkfedilinkEnglisharrow-up2·1 month agoI love how these people think “we told it not to break the rules” and think somehow the stochastic parrot has understood them and will obey.
I love how these people think “we told it not to break the rules” and think somehow the stochastic parrot has understood them and will obey.