New models in the API: GPT-5 Pro, Sora 2, and a voice model (View Highlight)
Overall, it was a solid day on a few fronts, but it was missing some of the mind-blowing moments for developers of DevDays past. Last year’s DevDay, for example, came right after the launch of OpenAI’s first reasoning model, o1, and featured the launch of the Realtime API. In 2023, the company launched GPT-4 Turbo and custom GPTs (more on the latter below).
Maybe it’s a sign of how much the goalposts have moved, but this year’s DevDay felt a little more like OpenAI doubling down on existing opportunities than it was pushing the frontier of the future. (View Highlight)
What it is: AppsSDK, a set of tools (or software developer kit) for developers to build apps inside of ChatGPT instead of as stand-alone apps.
What it does: Users can chat with apps—like Figma, Zillow, or Spotify—without ever leaving the ChatGPT interface. (View Highlight)
Our take: This is the third time that OpenAI has launched an app store concept. In March 2023, it was ChatGPT plugins, which allowed ChatGPT to call third-party apps from inside the app. They were exciting at the time, but adoption was poor.
In November 2023, it was custom GPTs, and a few months later a custom GPT app store (basically a directory of custom GPTs for anyone to use). This was significantly more popular than plugins, but OpenAI didn’t continue to update custom GPTs. (View Highlight)
Now, it’s apps in ChatGPT, which looks like a move back to the original plugins concept. Both are effectively attempts to allow you to use third-party apps inside of the ChatGPT interface. It is the closest they’ve been to getting the app store right, but the key question is going to be performance. (View Highlight)
As a day-to-day user, I will only use an app inside of ChatGPT if it is at least as fast as using that app in a separate interface. I know this because we’ve been experimenting with experiences like this for some of our products. For example, we built an alpha feature for Sparkle that allows you to search the files on your computer from within Claude. You might input, “Grab the contract I just downloaded and summarize what it says,” without switching out of Claude.
This seems like a good idea in theory, but in practice, it’s so much faster to use the Sparkle native app to find a file. No one used the feature, so we didn’t ship it. (View Highlight)
Another problem is the company’s launch strategy. Apps in ChatGPT are currently limited to only a few big OpenAI partners, like Canva, Coursera, Figma, and Spotify. OpenAI is promising to open up the tools for other approved developers later this year, but I worry that it’s slow-rolling the discovery of good use cases by limiting access to big companies that are not AI-first. (View Highlight)
AgentKit: A feast for AI operations
What it is: A suite of tools that allows you to make agent workflows in a visual no-code builder and embed them directly in your own site.
What it does: If you’re an AI operations pro looking to streamline processes in your business—from customer service to accounts receivable—AgentKit lets you do that from within the OpenAI ecosystem.
Our take: OpenAI is coming for all of the developer-adjacent use cases that are becoming popular with the growing importance of AI operations. Cora general manager Kieran Klaassen observed that AgentKit isn’t so much an agent builder—which allows AI to go down many different paths to accomplish a goal—as it is a workflow builder, where a business user can define a specific end-to-end process for the AI to follow. (View Highlight)
OpenAI has also built evaluations—aka evals, or benchmarks—directly into AgentKit. Evals embed trust in your AI system. Having them built in by default makes them essential and easy(er) to rely on, instead of an afterthought.
One of the tools within AgentKit is Agent Builder, which allows you to use chat to easily create a visual process workflow, such as to handle customer refund requests:
AgentKit. Source: OpenAI (https://openai.com/index/introducing-agentkit/).
This is a great move by OpenAI, and it’s one we’re likely to use immediately. We’re already building these kinds of workflows for customer service using a bunch of hacked-together solutions from companies like Zapier and Gumloop.
Now that OpenAI has a version of this, we can’t not consider switching. It’s much easier to have everything in the same ecosystem. (View Highlight)
Nothing big from Codex
What it is: Codex, OpenAI’s coding agent, is out of research preview and now generally available. You can now call it from Slack, in addition to a few other small features for enterprise companies.
Our take: This is a snooze. The Codex team has been shipping fast and hard ever since the GPT-5 launch to catch up to Claude Code. It seems like there wasn’t much held in reserve for DevDay.
Codex CLI is my (Dan’s) daily driver for programming, so it was disappointing that there wasn’t more here. (View Highlight)
PT-5 Pro, Sora 2, and a new voice model in the API
What it is: Developers can call OpenAI’s most powerful model, GPT-5 Pro; its new video creation tool, Sora 2 (stay tuned for a forthcoming vibe check); and a cheaper voice model—gpt-realtime-mini—in the API.
What it does: Its most powerful new models, which were previously only available in OpenAI-powered apps like ChatGPT and Sora, are available to developers.
Our take: This was the biggest gift for developers from the whole day. GPT-5 Pro is a blessing for complex applications in programming, legal, healthcare, and more. And Sora 2 is a breakthrough for developers building creative tools or new social apps. Everything else at DevDay felt more targeted at AI operations and developer-adjacent users. (View Highlight)
DevDay was a little boring for developers but very exciting if you’re an AI operations pro. I’m glad OpenAI is still pushing on the app store front—if Every had access to the AppsSDK, we’d build an app for ChatGPT.
I’m not convinced that this is yet the app store moment, but I like that they’re continuing to try. One thing that’s been lost in all of the feature announcements: OpenAI announced that ChatGPT went from 100 million weekly active users last year to 800 million this year—a truly astounding number. (View Highlight)
Where is the vision? And who is it for?
Generative AI is the closest thing we have to magic. But today felt more like pulling back the curtain to explain how it’s done, instead of putting on a show and looking forward to what could be. (View Highlight)
By contrast, DevDay felt like we were looking backwards at what’s been done, and what a few partners with early access did with a small team. It was, essentially, a coding demo. It may be because OpenAI is trying to build AI for everyone, but the company hasn’t yet found the problem that everyone has that it uniquely solves. Compare that to Anthropic’s new “thinking” brand campaign, which seems to have a way to inspire people from all walks of life.
OpenAI is shipping incredible work and has laid a great foundation that will compound: The primitives for “apps integrated into ChatGPT” and “agents with built-in evaluations” have been built; now the focus can shift to building with them to solve real problems. But today OpenAI seems stretched thin, and I yearn for more. I yearn for inspiration. A picture of the dream they’re chasing. (View Highlight)