This a great visualization of what LLMs are about. The purpose of a LLM is to produce something that looks right. The way this math sheet looks great at first look, but the actual content is just garbage, same do LLMs create output that sounds right and it doesn’t really matter if it is. And improving LLMs is not about making the answers more reliable, it’s about making them sound even more convincing.
It’s creative writing, about the same as what I’d expect a random artist to produce as an illustration for some genius math. I’d be much more worried about AI if it got everything right already lol.
And there are efforts, even simple methods to verify LLM output, like asking it to provide sources for claims or find examples of the formulas in the web or in textbooks.
This a great visualization of what LLMs are about. The purpose of a LLM is to produce something that looks right. The way this math sheet looks great at first look, but the actual content is just garbage, same do LLMs create output that sounds right and it doesn’t really matter if it is. And improving LLMs is not about making the answers more reliable, it’s about making them sound even more convincing.
But the worst thing is that some of the things are correct:
a²-b²=(a-b)(a+b)
a²+b²=c²
And these are common knowledge. This makes it even worse, since you have some correct parts to “prove” the rest is also correct.
It’s creative writing, about the same as what I’d expect a random artist to produce as an illustration for some genius math. I’d be much more worried about AI if it got everything right already lol.
And there are efforts, even simple methods to verify LLM output, like asking it to provide sources for claims or find examples of the formulas in the web or in textbooks.