Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 15 hours agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square87fedilinkarrow-up1529arrow-down112cross-posted to: technology@lemmy.ziptechnology@beehaw.org
arrow-up1517arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 15 hours agomessage-square87fedilinkcross-posted to: technology@lemmy.ziptechnology@beehaw.org
minus-squarePunkie@lemmy.worldlinkfedilinkEnglisharrow-up2arrow-down2·12 hours agoI’d compare LLMs to a junior executive. Probably gets the basic stuff right, but check and verify for anything important or complicated. Break tasks down into easier steps.
I’d compare LLMs to a junior executive. Probably gets the basic stuff right, but check and verify for anything important or complicated. Break tasks down into easier steps.