Home Technology OpenAI cracks AI’s hallucination code

OpenAI cracks AI’s hallucination code

0

Good morning, AI enthusiasts. Every user of large language models (LLMs) has encountered it: those overly confident responses from chatbots that are entirely fabricated. OpenAI now believes it has uncovered the root cause behind these persistent AI hallucinations.

Their newest research indicates that the key to reducing hallucinations might be surprisingly straightforward – training AI models to accept and express uncertainty by saying “I don’t know” when appropriate.


Today’s AI Highlights:

  • OpenAI identifies the cause of chatbot hallucinations
  • Anthropic settles a $1.5 billion lawsuit with authors
  • How to automate web monitoring using AI agents
  • OpenAI partners with Broadcom to produce custom AI chips
  • New AI tools, community workflows, and more updates
AI INSIGHTS

OpenAI’s Breakthrough on AI Hallucinations

OpenAI’s latest study proposes that AI hallucinations stem from conventional training methods that reward models for confidently guessing answers rather than admitting when they lack knowledge. This insight could pave the way for more reliable AI systems.

  • Models tend to fabricate information because training rewards correct guesses but penalizes uncertainty.
  • This creates a dilemma where AI is incentivized to always provide an answer, even if it’s a guess.
  • Experiments showed that when asked for specific details like birthdays or dissertation titles, models confidently gave varying incorrect answers.
  • The researchers suggest revising evaluation metrics to penalize confident mistakes more heavily than expressions of uncertainty.

Addressing this could shift AI development towards systems that recognize their limitations, prioritizing truthful responses over mere accuracy scores – a crucial step for applications where reliability is paramount.

SPONSORED: Concierge AI

Today’s SaaS customers expect instant, precise answers without endless searching. Concierge is an AI-powered answer engine tailored to your company’s data, delivering personalized, accurate responses directly on your website.

  • Handles complex buyer inquiries using advanced retrieval-augmented generation (RAG) techniques.
  • Offers full control and monitoring of conversations, including sentiment analysis and safety guardrails.
  • Builds trust with visitors by providing instant, reliable information before demo commitments.

Transform every visitor question into meaningful engagement and boost your conversion rates.

ANTHROPIC LEGAL UPDATE

Anthropic Agrees to $1.5 Billion Copyright Settlement

Anthropic has consented to a historic $1.5 billion settlement resolving a class-action lawsuit filed by authors whose copyrighted works were used without permission to train the Claude AI model.

  • The lawsuit revealed Anthropic sourced over 7 million unauthorized books from shadow libraries like LibGen.
  • A federal court ruled that while training on legally purchased books is fair use, pirated copies infringe copyright.
  • The settlement compensates authors at roughly $3,000 per pirated book, covering about 500,000 works, with provisions for additional payments if more infringing content is found.
  • Anthropic must also delete all pirated materials and is barred from using them in future training.

This landmark case sets a precedent in the ongoing debate over AI training data legality, emphasizing the importance of respecting copyright in AI development. Despite the large payout, Anthropic’s recent $13 billion funding round at a $183 billion valuation may cushion the financial impact.

AI MONITORING TOOLS

Streamline Web Updates with Yutori Scouts

Yutori Scouts is an AI-powered agent designed to track specific online content changes and notify you instantly via email, eliminating the need for manual page refreshes.

  • Input your monitoring criteria, such as “Latest releases from OpenAI, Anthropic, Gemini, or xAI.”
  • Select alert frequency: immediate, daily, or weekly, then activate your Scout.
  • Manage all active Scouts through a centralized dashboard, with options to edit, pause, or delete.
  • Receive detailed reports with direct links to sources for quick follow-up.

Pro tip: Use Scouts for time-sensitive alerts like product restocks, reservation openings, or breaking industry news, and integrate with automation tools for seamless workflows.

FEATURED TOOL: Popcorn AI

Popcorn revolutionizes film creation by generating complete 1-3 minute movies from a single conceptual prompt, including narrative structure, dialogue, lip-sync, soundtrack, and sound effects.

  • Manages the entire creative pipeline from idea generation to automated editing.
  • Produces full narrative films rather than isolated scenes.
  • Allows instant changes in style, genre, and story arcs.
  • Maintains consistent libraries of characters, props, and settings.

Start your first movie for free using the code AGENTIC.

OPENAI HARDWARE INITIATIVE

OpenAI to Manufacture Custom AI Chips with Broadcom

According to recent reports, OpenAI is set to begin mass production of proprietary AI chips next year through a collaboration with Broadcom, aiming to reduce reliance on Nvidia’s GPUs amid soaring demand.

  • Broadcom disclosed a $10 billion chip order from a confidential client, confirmed to be OpenAI for internal use.
  • These custom chips will enable OpenAI to double its computational capacity within five months, supporting upcoming models like GPT-5.
  • The partnership was initiated last year, with production timelines clarified only recently.
  • Other tech giants such as Google, Amazon, and Meta have also developed in-house AI chips, signaling a trend away from Nvidia dominance.

This move highlights the strategic importance of owning hardware infrastructure to manage costs and scale AI capabilities efficiently.

QUICK UPDATES

  • Google launches an open-source, on-device embedding model enhancing privacy and speed.
  • New voice-command tools enable coding and app development hands-free.
  • Alibaba unveils Qwen3-Max, a trillion-parameter model outperforming previous versions and competitors.
  • Innovative AI-driven product placement ads now create hyper-realistic custom content.

Upcoming Event: MongoDB.local NYC on September 17 – explore AI-driven data innovations. Use code SOCIAL50 for 50% off registration.

Additional news: OpenAI projects $115 billion in expenditures over the next four years due to data center expansion and compute needs; French startup Mistral raises $1.7 billion in Series C, valuing it at $11.7 billion; and a new class-action lawsuit targets Apple for allegedly training LLMs on pirated book datasets.

COMMUNITY SPOTLIGHT

AI in Everyday Life

Each issue, we highlight how readers leverage AI to enhance productivity and simplify tasks. Today, Avi F. from Portland, ME shares:

“I uploaded my 100+ page health insurance policy into a custom GPT and asked about coverage details and costs. I discovered I have unlimited free telehealth therapy, information buried in complex language. Previously, I’d spend over an hour navigating customer service calls with uncertain answers. This AI solution took minutes to set up and query.”

How are you using AI? Share your story with us.

Until next time,

Rowan, Joey, Zach, Shubham, and Jennifer – your team behind The Rundown

Exit mobile version