Ai Hallucinations and Tripping

News

OpenAI’s new ChatGPT models found to “hallucinate” more often

OpenAI’s newest reasoning models, o3 and o4‑mini, produce made‑up answers more often than the company’s earlier models, as ...

OpenAI says its o3 model makes more hallucinations than o1.

Specifically, o3 tends to make more claims overall, leading to more accurate claims as well as more inaccurate/hallucinated ...

OpenAI’s new reasoning AI models hallucinate more

OpenAI's reasoning AI models are getting better, but their hallucinating isn't, according to benchmark results.

Company apologizes after AI support agent invents policy that causes user uproar

When the user contacted Cursor support, an agent named "Sam" told them it was expected behavior under a new policy. But no such policy existed, and Sam was a bot. The AI model made the policy up, ...

Don't get too excited about AI agents yet. They make a lot of mistakes.

AI evaluation startup Patronus AI did the math to see how much one mistake made by an AI agent costs a company.

Samsung Swears Ballie Is Real, and Now It Will Have Google Gemini In It

Samsung’s round robot, Ballie, is now using Google Gemini alongside “proprietary” AI models. We still need to see it to ...

10d

An AI avatar tried to argue a case before a New York court. The judges weren't having it

A man appearing before a New York court got a scolding from a judge after he tried to use an avatar generated by artificial ...

13d

New research shows your AI chatbot might be lying to you - convincingly

That's the unsettling takeaway from a new study by Anthropic, the makers of the Claude AI model. They decided to test whether ...

WCAX314d

An AI avatar tried to argue a case before a New York court. The judges weren’t having it

And he felt the avatar would be able to deliver the presentation without his own usual mumbling, stumbling and tripping over ... was also capable of so-called AI hallucinations.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results