Andrew Mayne’s blog

October 27, 2025

Will AI displace humans in the economy and culture?

The question of how much compute it takes to replace humans in the workforce is unnecessarily reductive. Creating efficiencies does not guarantee humans will prefer a fully automated solution. More so, it does not mean it can’t be outcompeted by an automated solution wielded by a well-networked human. While AI will likely be able to…
Read more
October 8, 2025

Apps Inside ChatGPT: A Glimpse into the Future of AI-Native Development

At this year’s OpenAI Dev Day, I think we saw the next chapter of software unfold. The big headline wasn’t just about new models—it was about an entirely new way of building and interacting with apps. OpenAI’s introduction of AppKit hints at a world where chat itself becomes the operating system. During my time at OpenAI there…
Read more
October 4, 2025

Sora: The ChatGPT Moment for Generative Video

Seventy-two hours ago, OpenAI launched Sora, an invite-only app that has already climbed to the top of Apple’s App Store. In just three days, it’s changed how I—and a lot of others—spend time online. Speaking personally, I’ve logged more hours inside Sora this week than I’ve spent on TikTok across my entire life. More than Instagram…
Read more
September 21, 2025

Thoughts about OpenAI GPT-5-Codex from my conversation with Greg Brockman and Thibault Sottiaux

For the latest episode of the OpenAI Podcast I sat down with OpenAI president and co-founder Greg Brockman and Code engineering lead Thibault Sottiaux to talk about the release of OpenAI’s new GPT-5-Codex model.
Read more
May 2, 2025

Inside the Launch of GPT-4

With GPT-4 now stepping back from its starring role in ChatGPT, I want to share a few of my favorite memories from its launch. I originally joined OpenAI as an engineer on the Applied team, but later moved into a hybrid role as OpenAI’s “science communicator.” That shift let me dive deep into technical work…
Read more
January 8, 2025

Understanding AI Hallucinations: How AI and Humans Make Similar Mistakes

In this video, we delve into the concept of AI hallucinations by comparing them to human cognitive errors. We explore how both humans and AI can make false statements due to incorrect information or reasoning errors. The video explains different scenarios where AI, like humans, can generate incorrect conclusions based on faulty data or assumptions.…
Read more
October 18, 2024

Can You Dramatically Improve Results on the Latest Large Language Model Reasoning Benchmark With a Simple Prompt?

tldr: Yes. A recent paper, GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models, makes the claim that their new benchmark demonstrates that state of the art reasoning models have critical limitations when it comes to reasoning. Putting aside the fact that they grouped together tiny 2.7 B parameter models (that could fit…
Read more
March 21, 2024

How to use GPT-4 with Vision for Robotics and Other Applications

TLDR: There are multiple ways you can use GPT-4 with Vision to power robotics and other systems. I included several sample apps you can download to experiment with including a robot simulator. Multimodal AI models like GPT-4 with Vision have allowed for entirely new kinds of applications that go far beyond just text comprehension. A…
Read more
March 12, 2024

Improving GPT-4’s Visual Reasoning with Prompting

TLDR: How to boost GPT-4 with Vision’s capabilities with a simple prompt addition. A recent paper How Far Are We from Intelligent Visual Deductive Reasoning? points out the limitations in visual reasoning in image models like GPT-4V. Like other related papers, I think the investigators are directionally correct (these models don’t have human-level reasoning and…
Read more
November 14, 2023

Is the Reversal Curse Real?

A recent paper The Reversal Curse points out an apparent failure in large large language models like GPT-4. From the abstract: We expose a surprising failure of generalization in auto-regressive large language models (LLMs). If a model is trained on a sentence of the form “A is B”, it will not automatically generalize to the reverse direction…
Read more