Good Idea, Bad Execution

Lots of AI companies have great ideas, but their products just aren't quite there yet.

Welcome back, agent.

This week's mission: Testing Gamma against a winning pitch deck, and answering the question everyone keeps asking: ChatGPT, Claude, Gemini, or Grok?

AGENT #1: Gamma

Forgettable Presentations. Unforgettable Show

Gamma promises AI-generated presentations in seconds. I just won $20K pitching at SF Tech Week, so naturally I had to see if AI could compare against my winning slides.

  1. 9/10 in Time: Quick + easy to develop 10 slides based on your ideas

  2. 4/10 on Design: The slides have an nice and clean look. But I dislike the super small font, the many words and the varying slide heigh (seriously.. why is the slide so long?)

  1. 8/10 on Graphics: Cool AI images that are relevant to the topic (way better than stock images, and all the images have the same theme)

Presentation made by Gamma on Fast Fashion pitch.

  1. 5/10 for Uniqueness: It feels basis. There is no storyline to the presentations. Just words formatted nicely.

How You can Exploit Gamma & Make it Useful:

  • Use the image generation as a tool: AI images that match the same theme are hard to find, and stock images are boring. Instead use the Gamma images from the presentations it produces.

  • Great for brainstorming: Stuck and don’t know where to start? Gamma will help you start ideating.

Bottom Line: Gamma gets you from zero to presentation quickly, but won't impress anyone. If you're okay blending in, it works. If you need to win, use Canva instead.

With so many leading Generative AI tools chatbots out there, its useful to know which one are best at what.

Task

Best Model

Coding & Software Development

Claude 4

Math & Complex Reasoning

Grok 4

Creative Writing

Claude 4

Deep Research & Analysis

ChatGPT o3

Multimodal Tasks (e.g., Image/Video Generation)

Gemini 2.5 Pro

General Conversation & Everyday Queries

ChatGPT o3

Long Documents & Context Handling

Gemini 2.5 Pro

Agentic Behavior & Tool Use

Grok 4

Up-to-Date Knowledge & Real-Time Info

Grok 4

Bottom Line: All of these chatbots are incredibly powerful. Optimizing their strengths can give you a slight advantage. However, in the incredibly fast AI race, these companies can leapfrog one another at any time.

AI Intel Briefing

On October 10, JPMorgan’s new AI trading model hit 85% accuracy in predicting short-term stock movements, analyzing 1B+ data points daily to guide $500M in trades, outpacing human analysts.

Meta has poached Andrew Tulloch, co‑founder of the startup Thinking Machines Lab, with a compensation package rumored to reach $1.5 billion over six years.

A new study has revealed that a growing number of Americans are forming romantic attachments with AI chatbots, Many participants said they preferred bot interactions over messy real-life relationships, praising the consistency, availability, and nonjudgmental nature of AI pals.

Google's October 11 update equips Gemini 2.5 agents to navigate real apps via screenshots and actions like clicks or typing—looping goals with history for autonomous desktop ops, no code required

Mission Debrief

Human creativity has not yet been cloned by AI. If you want to win, don’t resort to AI, resort to your brain. Instead, use AI to get your ideas from the mind to prototype instantly.

A riddle for my puzzle lovers:

Suppose you have the ability to randomly generate a number in [0, 1] and then create a block of that height. Keep generating these blocks and stack them on top of each other until the height exceeds 1.

What is the average number of blocks needed?

Agent Check In

If you got this far: Hit reply and tell me what AI chatbot you use most. Is it ChatGPT, Grok, Gemini or Claude?

Stay Undercover,

Ashna Jain

Enjoying these intelligence reports? Forward this to a fellow agent.