News

OpenAI's reasoning AI models are getting better, but their hallucinating isn't, according to benchmark results.
OpenAI’s newest reasoning models, o3 and o4‑mini, produce made‑up answers more often than the company’s earlier models, as ...
Specifically, o3 tends to make more claims overall, leading to more accurate claims as well as more inaccurate/hallucinated ...