OpenAI wants chatbots to guess less, admit more

September 23, 2025

Why Do Advanced Language Models Like GPT-5 Still Fabricate Information?

Even the most sophisticated AI chatbots occasionally generate false information with unwavering confidence. This phenomenon, often referred to as “hallucination,” remains a significant challenge in the development of large language models (LLMs) such as GPT-5.

Understanding the Root Cause of AI Hallucinations

At the core, these AI systems are designed to predict the next word in a sequence based on patterns learned from vast datasets. During their initial training phase, known as pretraining, LLMs are not explicitly taught to distinguish between factual accuracy and falsehood. Instead, they optimize for the likelihood of word sequences, which works well for grammar and syntax but falters when precise factual knowledge is required.

For example, when researchers queried a popular language model about the title of Adam Tauman Kalai’s PhD dissertation, the AI confidently provided three different, entirely fabricated titles. Similarly, it offered multiple incorrect birthdates for the same individual. This illustrates how, despite processing billions of data points, the model can still fail on straightforward factual queries.

The Role of Evaluation Metrics in Encouraging AI Bluffing

Current evaluation methods for LLMs resemble multiple-choice tests where only correct answers receive credit. Guessing can sometimes yield points, while abstaining from answering results in zero. This scoring system inadvertently incentivizes the AI to “guess” rather than admit uncertainty, leading to confidently stated inaccuracies.

To address this, some experts suggest revamping the evaluation framework. Drawing a parallel to standardized tests like the SAT, where incorrect answers carry penalties and partial credit is awarded for uncertainty, a similar approach could encourage AI models to express doubt rather than fabricate information.

Can Adjusted Scoring Systems Improve AI Reliability?

By rewarding honesty and penalizing confident errors, AI systems might reduce the frequency of hallucinations. However, this shift could also lead to an increase in responses like “I don’t know,” which might frustrate users accustomed to definitive answers.

Despite these challenges, hallucinations are unlikely to disappear entirely. The goal is to minimize them and foster AI that is more transparent about its limitations.

Practical Advice for Users Interacting with AI Chatbots

Until AI models become more reliable, it’s wise to treat chatbots as engaging conversational partners who may occasionally provide incorrect information with great assurance. Always verify critical facts through trusted sources.

Join the Conversation

Should AI developers prioritize designing chatbots that openly acknowledge uncertainty over those that provide confident but potentially false answers? Will penalizing hallucinations in evaluation systems enhance AI trustworthiness, or will it lead to user dissatisfaction due to more frequent admissions of ignorance? Share your thoughts in the comments or connect with us through our contact channels.

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}

Why Do Advanced Language Models Like GPT-5 Still Fabricate Information?

Understanding the Root Cause of AI Hallucinations

The Role of Evaluation Metrics in Encouraging AI Bluffing

Can Adjusted Scoring Systems Improve AI Reliability?

Practical Advice for Users Interacting with AI Chatbots

Join the Conversation

RELATED ARTICLES

The AI lab revolving door spins ever faster

A Coding Guide to Build a Procedural Memory Agent That Learns,...

Mistral AI Ships Devstral 2 Coding Models And Mistral Vibe CLI...