Gemini vs ChatGPT Deep Research destroyed my expectations
If you value guides on AI, please see my new section at the top of my Newsletter website. On the right side margin of the web view, I also have a trending works section highlighting articles that have done well. As we’re waiting for the next splash of new models, there’s a lot of noise about reasoning models and their agents. Today’s guide will be by How should we think of Deep Research, I mean really? We’ve been waiting so long for OpenAI’s GPT-5, it feels like we’ve been cheated somehow. 🤷🏻♂️ Disclosure, I don’t think Grok 3 is the answer. Deep Research is supposed to be the start of an AI agent era of utility, but is it? Google Gemini’s version of Deep Research is notably faster in conducting research, often completing tasks within 5-15 minutes. In contrast, OpenAI's Deep Research can take anywhere from 5 to 30 minutes to finish similar tasks.
The Comparison
Since DeepSeek R1, I’ve been wondering at OpenAI’s product timeline. And no sooner had I analyzed it that OpenAI mentioned they will make their Deep Research model available to users on ChatGPT’s free tier soon, according to an X post by CEO Sam Altman, yesterday on February 12th. Now Grok 3 by xAI also has its own Deep Search. But is Deep Search more Deep Research or more Perplexity for X? xAI's Deep Search is a new tool introduced as part of the Grok 3 AI model announced recently, which serves as a next-generation search engine. Note also that the xAI Super Grok subscription is a premium tier offering designed to provide users with early access to the latest advancements in AI technology, including the Grok 3 model. DeepSearch by xAI (or Research) is an agentic feature, which means it deputes the AI chatbot to act more like an assistant for searching, sorting, and compiling all the information you need in the form of a comprehensive report, complete with citations. Full disclosure, I mostly use the free tier of Genspark that also has its own agent.
OpenAI's Spin
Consensus so far is that OpenAI’s spin on “Deep Research” is the best in class. Is this really a Generational Tool or just part of the future of Search that we will soon take for granted? As Google brings their version of Deep Research to their app on mobile, OpenAI is suddenly downgrading it from Pro to all users. Meanwhile, I’m seeing a lot of noise about OpenAI’s version of it. Personally, I liked the virality of DeepSeek; I’m less enamored with Deep Research clones. You might have noticed that Tencent has integrated DeepSeek with its search on WeChat and Baidu has integrated it with advanced Search with ERNIE. Agentic AI is not a mature ecosystem with scalable frameworks or much consolidation yet. But the application layer innovation is starting to take place a bit more rapidly so far in 2025. Note: During the time this draft was in the queue, even Perplexity announced its own hallucinating-prone version of Deep Research. But let’s try to understand it deeper. So is this changing the future of science like Nathan Lambert suggests, or another Meta Galactica like Gary Marcus suggests? Pragmatic as I am, I turn to my trusty guide maker Is this a promising intern or a moment in the evolution of so-called reasoning models? If usually sane people like Nathan are this bullish, am I missing something? Apparently I’m missing a lot. But let’s get to the point already. When to use Google Gemini’s version vs. OpenAI’s version? Keep in mind that OpenAI’s deep research is supposed to be better, it primarily utilizes a specialized version of their upcoming "o3" model. Read the FAQ. We should really mention that both You.com and genspark have had agents like this for quite some time, though obviously not as sophisticated. We’re supposed to get GPT-4.5 in weeks, but we got access to GPT-4 in March 2023 - nearly two full years ago. 🕭 Many bells and whistles later, we got another tool supposed to keep us busy in an onslaught of agents from every direction.
Industry Insights
The WSJ recently interviewed OpenAI’s VP of Engineering, Srinivas Narayanan, where he tried to validate OpenAI’s need for so much compute like Stargate and scaling laws. Credit to the Journal’s Belle Lin. So let’s get into our guide: What follows is a 19-min video guideArticle comparing OpenAI’s Deep Research step by step with that of Google’s. 🦜 From Rio de Janeiro, Brazil - Alex is one of my favorite AI tool guide makers. Subscribe to AI Supremacy to keep reading this post and get 7 days of free access to the full post archives.

The Comparison continued...