ByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 2 days agoCritical thinkingslrpnk.netimagemessage-square211linkfedilinkarrow-up11.57Karrow-down131
arrow-up11.54Karrow-down1imageCritical thinkingslrpnk.netByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 2 days agomessage-square211linkfedilink
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up21arrow-down3·2 days ago It’s a two-pass solution, but it makes it a lot more reliable. So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer? We’re so doomed.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down8·edit-22 days agoGive it a try. The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results. Ask it to create something, it creates something. Ask it to check something, it checks something. Is it flawless? No. But it’s pretty reliable. It’s literally free to try it now, using ChatGPT.
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up11arrow-down1·2 days ago I don’t think I should really have to explain this, but different prompts produce different results.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up2·2 days agoHey, maybe you do. But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.
So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer?
We’re so doomed.
Give it a try.
The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results.
Ask it to create something, it creates something.
Ask it to check something, it checks something.
Is it flawless? No. But it’s pretty reliable.
It’s literally free to try it now, using ChatGPT.
Hey, maybe you do.
But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.