OpenAI

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

AI Observer
News

ChatGPT’s memory enhancement might just be the greatest AI improvement we...

AI Observer
News

ChatGPT can now remember all conversations, not only what you tell...

AI Observer
News

OpenAI wants ChatGPT ‘to know you over your lifetime’ with new...

AI Observer
News

Claude copies ChatGPT’s $200 Max Plan, but users aren’t happy

AI Observer
News

ChatGPT now remembers and references all of your previous chats.

AI Observer
News

Anthropic’s Max Plan provides nearly unlimited Claude usage at $200 per...

AI Observer
News

Microsoft celebrates 50 years by adding familiar AI features

AI Observer
News

Google is allegedly paying AI staff to do absolutely nothing for...

AI Observer
News

Llama 4, Meta’s answer DeepSeek, is here! Long context Scout and...

AI Observer
News

OpenAI tests watermarking of ChatGPT-4o image generation model

AI Observer

Featured

News

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

AI Observer
News

OpenAI releases new simulated reason models with full access to tools

AI Observer
News

xAI adds a memory feature to Grok

AI Observer
AI Hardware

Congress wants to know if Nvidia superchips slipped through Singapore to...

AI Observer
AI Observer

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

Wei and team don't directly offer any hypothesis about why Deep Research fails almost half the time, but the implicit answer is in the scaling of its ability with more compute. As they run more parallel tasks, and ask the model to evaluate multiple answers, the accuracy scales...