Technology

Opera Mini launches AI-powered update to compete with Google and Microsoft...

AI Observer
News

ChatGPT 4.1 Early Benchmarks compared to Google Gemini

AI Observer
Anthropic

Kick’s cofounder discusses the creator push and growing-pains

AI Observer
Anthropic

Marketing Briefing: “Expecting Chaos”: With tariff uncertainty as a new constant...

AI Observer
Anthropic

What next for TikTok creators following the latest ban delay? Alyssa...

AI Observer
Anthropic

Why brands and agencies are putting AI Chiefs in their C...

AI Observer
News

Nvidia joins the Made-in-America Party, hoping to flog $500B worth of...

AI Observer
News

Nvidia Says It Started Making Chips for AI in the US

AI Observer
News

OpenAI launches its flagship AI model, the GPT-4.1

AI Observer
News

OpenAI plans to phase-out GPT-4.5 from its API

AI Observer
News

OpenAI’s new GPT-4.1 AI models focus on coding

AI Observer

Featured

News

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

AI Observer
News

OpenAI releases new simulated reason models with full access to tools

AI Observer
News

xAI adds a memory feature to Grok

AI Observer
AI Hardware

Congress wants to know if Nvidia superchips slipped through Singapore to...

AI Observer
AI Observer

OpenAI’s Deep Research is more accurate than you in fact-finding, but...

Wei and team don't directly offer any hypothesis about why Deep Research fails almost half the time, but the implicit answer is in the scaling of its ability with more compute. As they run more parallel tasks, and ask the model to evaluate multiple answers, the accuracy scales...