OpenAI

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

AI Observer
News

OpenAI releases new simulated reason models with full access to tools

AI Observer
News

xAI adds a memory feature to Grok

AI Observer
News

Claude has just acquired superpowers. Anthropic AI can now search through...

AI Observer
News

OpenAI names new nonprofit ‘advisors’

AI Observer
News

ChatGPT 4.1 Early Benchmarks compared to Google Gemini

AI Observer
News

OpenAI launches its flagship AI model, the GPT-4.1

AI Observer
News

OpenAI plans to phase-out GPT-4.5 from its API

AI Observer
News

OpenAI’s new GPT-4.1 AI models focus on coding

AI Observer
News

Netflix is testing out a new OpenAI powered search

AI Observer
News

How AI is interacting our creative human processes.

AI Observer

Featured

News

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

AI Observer
News

OpenAI releases new simulated reason models with full access to tools

AI Observer
News

xAI adds a memory feature to Grok

AI Observer
AI Hardware

Congress wants to know if Nvidia superchips slipped through Singapore to...

AI Observer
AI Observer

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

Wei and team don't directly offer any hypothesis about why Deep Research fails almost half the time, but the implicit answer is in the scaling of its ability with more compute. As they run more parallel tasks, and ask the model to evaluate multiple answers, the accuracy scales...