
📺 Today’s recommended deep-dive video: https://www.youtube.com/watch?v=hS1YqcewH0c
Beyond the Chatbot: OpenAI DevDay Unveils Native Apps, Autonomous Agents, and “Vibe Coding”
OpenAI is moving beyond simple chat interfaces to a world where AI creates, deploys, and manages software natively. From building apps inside ChatGPT to agents that control physical hardware, the barrier between idea and execution has never been thinner.
Core Question: How is OpenAI transforming from a model provider into a comprehensive operating system for AI-native applications and autonomous agents?
Highlights
- Apps SDK: Developers can now build interactive, full-stack applications directly inside the ChatGPT interface.
- Agent Kit: A visual building-block system designed to take autonomous agents from prototype to production with built-in evals.
- GPT-5 Codex: A specialized model that automates software engineering, enabling “vibe coding” where voice and sketches turn into functional code.
- Sora 2 API: A major update to the video generation model featuring high controllability and perfectly synchronized soundscapes.
⏱️ Reading time: approx. 6 minutes · Saves you about 46 minutes vs. watching.
Want to take notes while watching? Click the image below and let AI Notebook capture the key points for you 👇
The ChatGPT “Operating System”: Introducing the Apps SDK
Native Interaction and Global Distribution
OpenAI is reinventing the interface by allowing developers to build real, interactive apps inside ChatGPT. This isn’t just a plugin; it’s a full-stack environment where you can connect data, trigger actions, and render a fully interactive UI using the Model Context Protocol (MCP).
The Apps SDK gives you complete control over your backend logic and frontend UI while providing instant access to ChatGPT’s hundreds of millions of weekly users. If a user is already subscribed to your external service, they can log in directly within the conversation. Looking ahead, OpenAI plans to support diverse monetization strategies, including a new Agentic Commerce Protocol for instant checkouts.
During the keynote, live demos showcased the sheer versatility of this new platform. A user asked Coursera to teach them machine learning, and the app rendered a video player that pinned to the top of the screen. Because the app shares context with the model, the user could ask, “What is the instructor saying right now?” and ChatGPT could explain the video content in real-time without further prompting.

💡 Digging Deeper
Q: How does the app maintain state during a conversation?
A: The Apps SDK uses an API to expose real-time context back to ChatGPT, allowing the model to “see” exactly what the user is interacting with in the app UI.
Q: What is the underlying architecture?
A: Everything is built on MCP (Model Context Protocol), an open standard that makes it easy to integrate existing tools into the ChatGPT ecosystem.
Q: How do users find these new apps?
A: Users can invoke them by name, but ChatGPT will also proactively suggest relevant apps based on the needs of the conversation, such as suggesting Spotify for a party playlist.
Agent Kit: Standardizing Autonomous Workflows
From Prototype to Production
AI has evolved from systems you can ask anything to systems you can ask to do anything. However, building agents remains notoriously difficult due to the complexity of orchestration, evaluation loops, and tool connections. Agent Kit is OpenAI’s answer to this friction, providing a visual canvas to design logic steps and ship agentic ideas.
The toolkit is comprised of several core pillars: the Agent Builder for visual workflow design, Chat Kit for embedding customizable chat interfaces into external apps, and specialized Evals. These evaluation tools are critical for production, offering trace grading to understand agent decisions step-by-step and automated prompt optimization to refine performance over time.

💡 Digging Deeper
Q: Can I use Agent Kit with external models?
A: Yes, the platform now allows you to run evaluations on external models directly from the OpenAI dashboard to compare performance.
Q: How are these agents secured?
A: Agent Kit includes a Connector Registry for secure internal tool access and pre-built Guardrail nodes that block PII or prevent hallucinations.
Q: Is this strictly for OpenAI’s platform?
A: While designed for the OpenAI ecosystem, the builder includes a “Code Export” feature, allowing you to run the generated logic on your own servers.
The Future of Software: GPT-5 Codex and “Vibe Coding”
Turning Sketches into Systems
Software engineering is undergoing its most significant shift in decades as OpenAI moves Codex out of research preview and into general availability. Powered by the new GPT-5 Codex model, this system is specifically trained for agentic coding, refactoring, and deep architectural reviews. It now lives everywhere a developer works: the IDE, the terminal, GitHub, and Slack.
The power of this new model was demonstrated through “vibe coding”—the ability to build complex systems through natural interaction. In one demo, a developer controlled a professional Sony camera and venue lighting using only voice commands and an Xbox controller. Codex figured out the legacy 30-year-old Visca protocols and wrote the UDP packet logic entirely on its own.
Beyond simple scripts, Codex is now a teammate capable of seven-hour refactoring tasks. OpenAI reported that its own engineers complete 70% more pull requests per week using Codex. By integrating with the new Codex SDK, teams can now automate entire engineering workflows, allowing AI to not just write code, but to iterate on it based on live user feedback.

Key Takeaways
OpenAI’s DevDay 2024 marks the transition from AI as a “knowledge assistant” to AI as an “execution engine.” The introduction of the Apps SDK and Agent Kit suggests that the future of software isn’t just about chat bubbles; it’s about persistent, interactive tools that live within a unified intelligence layer.
The heavy emphasis on GPT-5 Codex and the Sora 2 API indicates a push toward multi-modal mastery. Whether it’s controlling physical hardware via Visca protocols or generating cinematic video with synchronized audio, OpenAI is providing the tools for developers to build apps that are aware of, and can interact with, the physical and digital worlds simultaneously.
Ultimately, the goal is to collapse the time between “idea” and “product” from months to minutes. As Sam Altman noted, you no longer need a massive engineering team to build complex systems; you need a good idea and the ability to steer the models effectively.
Q&A
Q1: What are the main model updates announced?
A1: OpenAI launched GPT-5 Pro in the API, Realtime Mini for faster voice interactions, and a preview of the Sora 2 API for video generation.
Q2: Can I build my own UI for agents?
A2: Yes, Chat Kit allows you to bring your own branding and workflows into a simple, embeddable chat interface for your specific product.
Q3: What makes Sora 2 different from the first version?
A3: Sora 2 offers significantly more controllability over state and composition, and most importantly, it can generate synchronized soundscapes and ambient audio grounded in the visuals.
Q4: How does the new Slack integration work for Codex?
A4: Teams can ask Codex to write code, perform reviews, or answer technical questions directly within their Slack conversations, making it a persistent teammate.
Q5: When can developers publish their ChatGPT apps?
A5: The Apps SDK is currently in preview. Developers can start building today, with a public directory and submission process opening later this year.
Q6: What is “trace grading” in Agent Kit?
A6: Trace grading allows developers to inspect the step-by-step decision-making process of an agent to identify exactly where a logic flow might be failing.
Q7: Does Sora 2 support image-to-video?
A7: Yes, the demo showed Sora 2 taking a static photo of a dog and generating a video of multiple dogs interacting while maintaining the original dog’s likeness.
