In this thread it was noted how an AI badly got an answer wrong (and I was the one who posted its wrong answer). That wrong answer was a while ago and I tried it again and the answer was spot on. Some claim AI is unreliable enough that it is mostly worthless.
This is me…not saying anyone else said this but I think I am in the ballpark of the skeptics with this (correct me if I am wrong): AI may get 9 of 10 right but a 10% error rate is still too unreliable to be used in a serious manner.
I make no claim AI gives perfect answers. Indeed, we know it does not. But how bad is it? Let’s see!
A few rules:
-
The question needs to be answerable. If it gave us the answers to The Goldbach conjecture or the Riemann hypothesis which of us gets the Fields Medal? (Me…definitely me)
-
Kinda along the lines of #1…don’t ask what Busy Beaver 20 is (I think BB5 is the biggest). Maybe it is answerable in principle but not before the heat death of the universe.
-
If it is a math question (that has an answer) I expect the poster to know the answer and understand the question asked so they can assess how the AI did. I certainly cannot do that.
-
Philosophy questions are ok but they do not really have an answer so…not sure but go for it…see what happens.
-
Note that my AI’s database stops in January 2025. So, anything current it will not know. Other AIs may be more recent but this is the one I am using and it told me that’s its limit to recent events.
-
Coding questions are fine but keep it short. Mainly because I want to post answers here and long lines of code on the forum…not nice (can we do collapsible text on here…really asking, that might help).
-
If you asked the question please make a post of your assessment of the response. Good, bad, ugly, nasty…let us know.
-
Catch-all…rules may change. Seeing how it goes. I will certainly note any additions or modifications. Mods, of course, can intervene as they like (they don’t need my permission)
-
Don’t use this as a way to get me to use my AI for your work. I doubt anyone would…needs to be said.
-
If you asked the question please post a response of how you think the AI did.
One more thing…I want this to be fun, I am not an AI evangelist at all. I have found it useful but I am wary. Also, I can only do so many queries per day so if I can’t answer everything it is due to limits (and I use it for some work which takes priority).
Let’s try it for a few days and see how it goes. Have at it!
FTR: I am using Claude Opus 4