ylai@lemmy.ml to AI@lemmy.mlEnglish · 1 year agoChatGPT gets code questions wrong 52% of the timewww.theregister.comexternal-linkmessage-square7fedilinkarrow-up174arrow-down12
arrow-up172arrow-down1external-linkChatGPT gets code questions wrong 52% of the timewww.theregister.comylai@lemmy.ml to AI@lemmy.mlEnglish · 1 year agomessage-square7fedilink
minus-squareKuvwert@lemm.eelinkfedilinkarrow-up0·1 year ago52% In the first year is pretty cool, excited to see how it will evolve.
minus-squareSirGolan@lemmy.sdf.orglinkfedilinkarrow-up1arrow-down1·1 year agoGPT4 with reflexion prompting gets 90% correct (for HumanEval coding benchmark). The paper this is based on is misleading at best.
52% In the first year is pretty cool, excited to see how it will evolve.
GPT4 with reflexion prompting gets 90% correct (for HumanEval coding benchmark). The paper this is based on is misleading at best.