6 Sources
6 Sources
[1]
OpenAI launches new MacOS app for agentic coding | TechCrunch
AI is already having a seismic impact on how software is written, with much of the grunt work of programming now performed by swarms of agents and subagents. But as developers experiment with new interfaces and form factors for human-AI collaboration, it's become hard for even the most advanced AI labs to keep up. The current trend is for agentic software development -- systems where AI agents can work independently on coding tasks -- epitomized by the Claude Code and Cowork apps. In the meantime, OpenAI has been gradually building out its Codex tool, which launched as a command line tool last April and expanded to a web interface one month later. Now, OpenAI is taking a major step towards catching up. On Monday, the company launched a new MacOS app for Codex, integrating many of the agentic practices that have become popular in the past year. The new app is designed to work with multiple agents in parallel, integrating agent skills and other state-of-the-art workflows. The launch also comes less than two months after the launch of GPT-5.2-Codex, OpenAI's most powerful coding model, which the company hopes will be enough to tempt over Claude Code users. "If you really want to do sophisticated work on something complex, 5.2 is the strongest model by far," CEO Sam Altman told reporters on a press call. "However, it's been harder to use, so taking that level of model capability and putting it in a more flexible interface, we think is going to matter quite a bit." While Altman's confidence in GPT-5.2 is understandable, coding benchmarks tell a more complicated story. GPT-5.2 does hold the top spot on TerminalBench (a test measuring how well AI handles command-line programming tasks), at least as of press time. But agents from Gemini 3 and Claude Opus have logged roughly equivalent scores -- lower, but within the margin of error of the benchmark. Results from SWE-bench, another coding benchmark that tests AI's ability to fix real-world software bugs, are similar, showing no clear advantage for GPT-5.2. However, agentic use cases have been difficult to benchmark effectively, and state-of-the-art models can vary significantly in user experience. The Codex app also comes with a range of new features that OpenAI says will help it achieve parity or, in some cases, outpace the various Claude apps. The Codex app will allow for automations that can be set to run in the background on an automatic schedule, with results placed in a queue to be reviewed when the user returns. Users can also select different personalities for the agent -- from pragmatic to empathetic -- depending on their working style. But for the company, the biggest selling point is the sheer speed of development that's made possible by AI. "You can use this from a clean sheet of paper, brand new, to make a really quite sophisticated piece of software in a few hours," Altman said. "As fast as I can type in new ideas, that is the limit of what can get built."
[2]
OpenAI's Codex just got its own Mac app - and anyone can try it for free now
Switching between IDE, terminal, and app keeps context across tools. OpenAI today announced a new Mac app dedicated to working with its Codex AI coding agent. This is different from the general-purpose ChatGPT app that OpenAI has been shipping for a while. The new coding app is intended to be something of a command center, not only for directing coding agents, but also for managing multiple coding agents across projects and tasks that run for long periods of time. Also: I got 4 years of product development done in 4 days for $200, and I'm still stunned In a briefing I attended with OpenAI executives a few days before the launch, OpenAI cofounder and CEO Sam Altman said, "There's obviously been a huge shift in software agents over the last few months as the models have crossed a threshold of real utility. 5.2, in particular, is a model that many of us have found can do extremely complex things, and we realized we started to feel limited by the interface." The new Mac app is intended to help mitigate that limitation. OpenAI described Codex as expanding from single-agent coding operations to being at the core of multi-agent software lifecycles. Rather than pairing with a developer on a single edit, OpenAI sees programmers as coordinating teams of agents across all of the design, build, ship, and ongoing maintenance stages of work. (Disclosure: Ziff Davis, ZDNET's parent company, filed an April 2025 lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.) The latest version of Codex, GPT-5.2-Codex, only launched in mid-December. Yet, according to the company, Codex usage has nearly doubled since that time, with more than a million developers using it in the last month. I used it. In fact, with just the $20-per-month ChatGPT Plus plan, I found a mystery bug. Subsequently, I also used GPT-5.2-Codex on that same Plus plan to add two fairly significant features to my security product and ship a major update. Also: I used GPT-5.2-Codex to find a mystery bug and hosting nightmare - it was beyond fast In the briefing, Altman stated that GPT-5.2-Codex "is the fastest adopted model that we have ever made." He also reported on an extreme level of momentum using this tool, with usage growing more than 20 times since last August. In addition to independent developers like me, the company reported that major customers include Cisco, Ramp, Virgin Atlantic, Vanta, Duolingo, and Gap. The new app is designed to run multiple tasks in parallel, and help users supervise agents across longer end-to-end work. The company is also introducing skills, similar to those in Claude Code. These are meant to allow developers to define repeatable workflows like fetching logs and fixing tests, along with coding-adjacent tasks like summarizing threads and completing tickets. Right now, the app is Mac only. However, I expect that there will be a Windows app eventually. OpenAI shipped the ChatGPT app in Mac-only form, but some months later launched a Windows version. In addition to the app itself, OpenAI announced a new plan mode for Codex that allows for a read-only review (meaning the AI won't muck with your code) and selectable personalities. Personally, I've had just about enough personality from the human programmers I've managed, so I'd prefer a nice, personality-free personality in my coding agent. Recently, OpenAI also announced that Codex has an IDE extension for use in the JetBrains IDEs. Readers may recall that back in June I moved off of PhpStorm, my favorite JetBrains development environment. I moved to VS Code simply because the AI tools were more available for that environment. It's nice to see JetBrains IDE availability for those of us who prefer it over VS Code. Also: Stop using ChatGPT for everything: My go-to AI models for research, coding, and more (and which I avoid) During the briefing, I asked about the relationship between the various coding options now offered with Codex. We can code in our favorite IDE, in the terminal, and now in this new app. I wanted to know whether it would be possible to get the same richness of experience in the app that we have in the IDE, with our entire codebase in front of us. Sam told me, "I was astonished by this, but I did a fairly big project in a few days earlier this week and over the weekend." He said, "I did not open an IDE during the process. Not a single time. I did look at some code, but I was not doing it the old-fashioned way." He further told me, speaking about this level of coding ability from the AI, "I did not think that was going to be happening by now. I never had that experience before." I was given access to a preview build of the Mac app. I found I was easily able to work in my codebase. In fact, it knew about my most recent work in the IDE. It just automatically picked up from where I last left off. It seems that we'll be able to switch between whatever coding tools we want, and Codex will maintain context. Codex does have a cloud tool, which lives in GitHub. In fact, that's how Codex initially launched. When I first tested it in that context, I didn't like it. But once I started working with Codex on my own computer using the IDE extensions, I found that my productivity skyrocketed. The problem with running an AI on a local computer, especially with write access, is that it can be a security concern. In the briefing, one of OpenAI's Codex developers posed the worry from a design perspective, saying, "If the agent runs on the user's computer, how do we make that as safe and secure as possible?" Also: How to learn ChatGPT in an hour - for free The key is limiting sandbox access to only approved folders. In my case, that's my project folder. The tool also has permissioned network access and remembers approvals over time. The new Mac app adds a sandbox mode and lets developers set approval levels, including Untrusted, On failure, On request, and Never (meaning the app is never permitted to ask for elevated permissions). The OpenAI development team described its current coding agents as now reliable enough to let it push entire workflows into the new app. A number of the OpenAI developers said that, in just the last few weeks, they've started working almost entirely in the Mac app. They haven't even needed to use their IDEs or terminal windows. OpenAI currently makes Codex available with limited usage in the $20-per-month ChatGPT Plus plan. It offers much more extensive resource availability in its $200-per-month ChatGPT Pro plan. While I ran into roadblocks building new products (full days of coding) in the $20-per-month plan, I never ran into a roadblock when I was using the $200-per-month plan. Also: Is ChatGPT Plus still worth your $20? I compared it to the Free, Go, and Pro plans - here's my advice But companies have asked OpenAI for something more. It's not just about how many tokens can be allocated. For those companies, it's about speed of computation. You can wait quite a few minutes after each major AI command. OpenAI said that it's looking at creating higher compute-intensive, speed-intensive tiers for companies willing to pay for pure speed. These would allow for very long context coding work and include high-speed infrastructure options. To encourage users to give Codex a try, the company announced, "For a limited time we're including Codex with ChatGPT Free and Go, and we're doubling the rate limits on Plus, Pro, Business, Enterprise, and Edu plans." So feel free to give Codex a try. Also: 10 ChatGPT Codex secrets I only learned after 60 hours of pair programming with it What about you? Are you planning to try the new Codex app when it launches, or do you prefer sticking with an IDE extension like VS Code or JetBrains? Have you used GPT-5.2-Codex (or other coding agents) for real features and bug hunts yet? Did AI tools actually save you time, or did they create cleanup and tech-debt headaches later? Comment below.
[3]
OpenAI brings its Codex coding app to Mac, with new multi-agent abilities included
Since last spring, OpenAI has offered Codex. What started life as the company's response to Claude Code is becoming something more sophisticated with the release of a new dedicated macOS app. At its most basic form, Codex is a programming agent capable of writing code for users, but now it can also manage multiple AI assistants that can work together to complete more complex tasks. OpenAI gives an example of how this could work in practice. The company used Codex to create a Mario Kart-like racing game, complete with a selection of different playable cars, eight tracks and a collection of powerups players can use against the competition. For a single AI agent, generating a game from scratch, with all the needed visual assets, would be a tough ask, but Codex was able to complete the task because it could delegate the work of making the game to different models with complementary capabilities. For example, it turned to GPT Image for the visual assets, while a separate model simultaneously coded the web game. "It took on the roles of designer, game developer and QA tester to validate its work by actually playing the game," OpenAI says of the process. If that sounds complicated, OpenAI has tried to make it more approachable with a section of the app titled Skills. The feature bundles "instructions, resources, and scripts so Codex can reliably connect to tools, run workflows, and complete tasks according to your team's preferences," the company explains. "The Codex app includes a dedicated interface to create and manage skills. You can explicitly ask Codex to use specific skills, or let it automatically use them based on the task at hand." As you might imagine, Codex can also automate repetitive tasks. A dedicated Automations section of the app allows you to schedule tasks, which the software will complete in the background. "At OpenAI, we've been using Automations to handle the repetitive but important tasks, like daily issue triage, finding and summarizing CI failures, generating daily release briefs, checking for bugs, and more," the company said. The release of the Codex macOS app comes as AI startups explore what a group of AI agents working in parallel can accomplish. At the start of the year, Anysphere, the company behind Cursor, found it was possible to build a working web browser from scratch using such an approach, though it did encounter problems along the way. For a limited time, OpenAI is making Codex available to ChatGPT Free and Go users so they can see what's possible with this new software. At the same time, the company is doubling rates for Plus and Pro subscribers.
[4]
OpenAI launches standalone Codex app for Apple computers
The new app has a simple interface and is designed to serve as a "command center" that makes it easy for software developers to manage multiple AI agents at once, OpenAI said. An AI agent is a tool that can independently complete tasks, like writing code, on behalf of a user. AI coding assistants have exploded in popularity over the last year, and OpenAI said more than 1 million developers have used Codex in the past month. The company initially launched Codex in April, and it made the product generally available in October. The new Codex app is part of OpenAI's ongoing effort to lure users and market share away from rivals like Anthropic and Cursor, which have their own buzzy offerings for developers. OpenAI CEO Sam Altman said the Codex app is "the most loved internal product we've ever had" during a briefing with reporters on Friday. "It's been totally an amazing thing for us to be using recently at OpenAI," Altman said. "'I've been staying up late at night with excitement, building all sorts of things myself."
[5]
OpenAI launches a Codex desktop app for macOS to run multiple AI coding agents in parallel
OpenAI on Monday released a new desktop application for its Codex artificial intelligence coding system, a tool the company says transforms software development from a collaborative exercise with a single AI assistant into something more akin to managing a team of autonomous workers. The Codex app for macOS functions as what OpenAI executives describe as a "command center for agents," allowing developers to delegate multiple coding tasks simultaneously, automate repetitive work, and supervise AI systems that can run for up to 30 minutes independently before returning completed code. "This is the most loved internal product we've ever had," Sam Altman, OpenAI's chief executive, told VentureBeat in a press briefing ahead of Monday's launch. "It's been totally an amazing thing for us to be using recently at OpenAI." The release arrives at a pivotal moment for the enterprise AI market. According to a survey of 100 Global 2000 companies published last week by venture capital firm Andreessen Horowitz, 78% of enterprise CIOs now use OpenAI models in production, though competitors Anthropic and Google are gaining ground rapidly. Anthropic posted the largest share increase of any frontier lab since May 2025, growing 25% in enterprise penetration, with 44% of enterprises now using Anthropic in production. The timing of OpenAI's Codex app launch -- with its focus on professional software engineering workflows -- appears designed to defend the company's position in what has become the most contested segment of the AI market: coding tools. Why developers are abandoning their IDEs for AI agent management The Codex app introduces a fundamentally different approach to AI-assisted coding. While previous tools like GitHub Copilot focused on autocompleting lines of code in real-time, the new application enables developers to "effortlessly manage multiple agents at once, run work in parallel, and collaborate with agents over long-running tasks." Alexander Embiricos, the product lead for Codex, explained the evolution during the press briefing by tracing the product's lineage back to 2021, when OpenAI first introduced a model called Codex that powered GitHub Copilot. "Back then, people were using AI to write small chunks of code in their IDEs," Embiricos said. "GPT-5 in August last year was a big jump, and then 5.2 in December was another massive jump, where people started doing longer and longer tasks, asking models to do work end to end. So what we saw is that developers, instead of working closely with the model, pair coding, they started delegating entire features." The shift has been so profound that Altman said he recently completed a substantial coding project without ever opening a traditional integrated development environment. "I was astonished by this...I did this fairly big project in a few days earlier this week and over the weekend. I did not open an IDE during the process. Not a single time," Altman said. "I did look at some code, but I was not doing it the old-fashioned way, and I did not think that was going to be happening by now." How skills and automations extend AI coding beyond simple code generation The Codex app introduces several new capabilities designed to extend AI coding beyond writing lines of code. Chief among these are "Skills," which bundle instructions, resources, and scripts so that Codex can "reliably connect to tools, run workflows, and complete tasks according to your team's preferences." The app includes a dedicated interface for creating and managing skills, and users can explicitly invoke specific skills or allow the system to automatically select them based on the task at hand. OpenAI has published a library of skills for common workflows, including tools to fetch design context from Figma, manage projects in Linear, deploy web applications to cloud hosts like Cloudflare and Vercel, generate images using GPT Image, and create professional documents in PDF, spreadsheet, and Word formats. To demonstrate the system's capabilities, OpenAI asked Codex to build a racing game from a single prompt. Using an image generation skill and a web game development skill, Codex built the game by working independently using more than 7 million tokens with just one initial user prompt, taking on "the roles of designer, game developer, and QA tester to validate its work by actually playing the game." The company has also introduced "Automations," which allow developers to schedule Codex to work in the background on an automatic schedule. "When an Automation finishes, the results land in a review queue so you can jump back in and continue working if needed." Thibault Sottiaux, who leads the Codex team at OpenAI, described how the company uses these automations internally: "We've been using Automations to handle the repetitive but important tasks, like daily issue triage, finding and summarizing CI failures, generating daily release briefs, checking for bugs, and more." The app also includes built-in support for "worktrees," allowing multiple agents to work on the same repository without conflicts. "Each agent works on an isolated copy of your code, allowing you to explore different paths without needing to track how they impact your codebase." OpenAI battles Anthropic and Google for control of enterprise AI spending The launch comes as enterprise spending on AI coding tools accelerates dramatically. According to the Andreessen Horowitz survey, average enterprise AI spend on large language models has risen from approximately $4.5 million to $7 million over the last two years, with enterprises expecting growth of another 65% this year to approximately $11.6 million. Leadership in the enterprise AI market varies significantly by use case. OpenAI dominates "early, horizontal use cases like general purpose chatbots, enterprise knowledge management and customer support," while Anthropic leads in "software development and data analysis, where CIOs consistently cite rapid capability gains since the second half of 2024." When asked during the press briefing how Codex differentiates from Anthropic's Claude Code, which has been described as having its "ChatGPT moment," Sottiaux emphasized OpenAI's focus on model capability for long-running tasks. "One of the things that our models are extremely good at -- they really sit at the frontier of intelligence and doing reliable work for long periods of time," Sottiaux said. "This is also what we're optimizing this new surface to be very good at, so that you can start many parallel agents and coordinate them over long periods of time and not get lost." Altman added that while many tools can handle "vibe coding front ends," OpenAI's 5.2 model remains "the strongest model by far" for sophisticated work on complex systems. "Taking that level of model capability and putting it in an interface where you can do what Thibault was saying, we think is going to matter quite a bit," Altman said. "That's probably the, at least listening to users and sort of looking at the chatter on social that's that's the single biggest differentiator." The surprising satisfies on AI progress: how fast humans can type The philosophical underpinning of the Codex app reflects a view that OpenAI executives have been articulating for months: that human limitations -- not AI capabilities -- now constitute the primary constraint on productivity. In a December appearance on Lenny's Podcast, Embiricos described human typing speed as "the current underappreciated limiting factor" to achieving artificial general intelligence. The logic: if AI can perform complex coding tasks but humans can't write prompts or review outputs fast enough, progress stalls. The Codex app attempts to address this by enabling what the team calls an "abundance mindset" -- running multiple tasks in parallel rather than perfecting single requests. During the briefing, Embiricos described how power users at OpenAI work with the tool. "Last night, I was working on the app, and I was making a few changes, and all of these changes are able to run in parallel together. And I was just sort of going between them, managing them," Embiricos said. "Behind the scenes, all these tasks are running on something called gate work trees, which means that the agents are running independently, and you don't have to manage them." In the Sequoia Capital podcast "Training Data," Embiricos elaborated on this mindset shift: "The mindset that works really well for Codex is, like, kind of like this abundance mindset and, like, hey, let's try anything. Let's try anything even multiple times and see what works." He noted that when users run 20 or more tasks in a day or an hour, "they've probably understood basically how to use the tool." Building trust through sandboxes: how OpenAI secures autonomous coding agents OpenAI has built security measures into the Codex architecture from the ground up. The app uses "native, open-source and configurable system-level sandboxing," and by default, "Codex agents are limited to editing files in the folder or branch where they're working and using cached web search, then asking for permission to run commands that require elevated permissions like network access." Embiricos elaborated on the security approach during the briefing, noting that OpenAI has open-sourced its sandbox technology. "Codex has this sandbox that we're actually incredibly proud of, and it's open source, so you can go check it out," Embiricos said. The sandbox "basically ensures that when the agent is working on your computer, it can only make writes in a specific folder that you want it to make rights into, and it doesn't access network without information." The system also includes a granular permission model that allows users to configure persistent approvals for specific actions, avoiding the need to repeatedly authorize routine operations. "If the agent wants to do something and you find yourself annoyed that you're constantly having to approve it, instead of just saying, 'All right, you can do everything,' you can just say, 'Hey, remember this one thing -- I'm actually okay with you doing this going forward,'" Embiricos explained. Altman emphasized that the permission architecture signals a broader philosophy about AI safety in agentic systems. "I think this is going to be really important. I mean, it's been so clear to us using this, how much you want it to have control of your computer, and how much you need it," Altman said. "And the way the team built Codex such that you can sensibly limit what's happening and also pick the level of control you're comfortable with is important." He also acknowledged the dual-use nature of the technology. "We do expect to get to our internal cybersecurity high moment of our models very soon. We've been preparing for this. We've talked about our mitigation plan," Altman said. "A real thing for the world to contend with is going to be defending against a lot of capable cybersecurity threats using these models very quickly." The same capabilities that make Codex valuable for fixing bugs and refactoring code could, in the wrong hands, be used to discover vulnerabilities or write malicious software -- a tension that will only intensify as AI coding agents become more capable. From Android apps to research breakthroughs: how Codex transformed OpenAI's own operations Perhaps the most compelling evidence for Codex's capabilities comes from OpenAI's own use of the tool. Sottiaux described how the system has accelerated internal development. "A Sora Android app is an example of that where four engineers shipped in only 18 days internally, and then within the month we give access to the world," Sottiaux said. "I had never noticed such speed at this scale before." Beyond product development, Sottiaux described how Codex has become integral to OpenAI's research operations. "Codex is really involved in all parts of the research -- making new data sets, investigating its own screening runs," he said. "When I sit in meetings with researchers, they all send Codex off to do an investigation while we're having a chat, and then it will come back with useful information, and we're able to debug much faster." The tool has also begun contributing to its own development. "Codex also is starting to build itself," Sottiaux noted. "There's no screen within the Codex engineering team that doesn't have Codex running on multiple, six, eight, ten, tasks at a time." When asked whether this constitutes evidence of "recursive self-improvement" -- a concept that has long concerned AI safety researchers -- Sottiaux was measured in his response. "There is a human in the loop at all times," he said. "I wouldn't necessarily call it recursive self-improvement, a glimpse into the future there." Altman offered a more expansive view of the research implications. "There's two parts of what people talk about when they talk about automating research to a degree where you can imagine that happening," Altman said. "One is, can you write software, extremely complex infrastructure, software to run training jobs across hundreds of thousands of GPUs and babysit them. And the second is, can you come up with the new scientific ideas that make algorithms more efficient." He noted that OpenAI is "seeing early but promising signs on both of those." The end of technical debt? AI agents take on the work engineers hate most One of the more unexpected applications of Codex has been addressing technical debt -- the accumulated maintenance burden that plagues most software projects. Altman described how AI coding agents excel at the unglamorous work that human engineers typically avoid. "The kind of work that human engineers hate to do -- go refactor this, clean up this code base, rewrite this, write this test -- this is where the model doesn't care. The model will do anything, whether it's fun or not," Altman said. He reported that some infrastructure teams at OpenAI that "had sort of like, given up hope that you were ever really going to long term win the war against tech debt, are now like, we're going to win this, because the model is going to constantly be working behind us, making sure we have great test coverage, making sure that we refactor when we're supposed to." The observation speaks to a broader theme that emerged repeatedly during the briefing: AI coding agents don't experience the motivational fluctuations that affect human programmers. As Altman noted, a team member recently observed that "the hardest mental adjustment to make about working with these sort of like aI coding teammates, unlike a human, is the models just don't run out of dopamine. They keep trying. They don't run out of motivation. They don't get, you know, they don't lose energy when something's not working. They just keep going and, you know, they figure out how to get it done." What the Codex app costs and who can use it starting today The Codex app launches today on macOS and is available to anyone with a ChatGPT Plus, Pro, Business, Enterprise, or Edu subscription. Usage is included in ChatGPT subscriptions, with the option to purchase additional credits if needed. In a promotional push, OpenAI is temporarily making Codex available to ChatGPT Free and Go users "to help more people try agentic workflows." The company is also doubling rate limits for existing Codex users across all paid plans during this promotional period. The pricing strategy reflects OpenAI's determination to establish Codex as the default tool for AI-assisted development before competitors can gain further traction. More than a million developers have used Codex in the past month, and usage has nearly doubled since the launch of GPT-5.2-Codex in mid-December, building on more than 20x usage growth since August 2025. Customers using Codex include large enterprises like Cisco, Ramp, Virgin Atlantic, Vanta, Duolingo, and Gap, as well as startups like Harvey, Sierra, and Wonderful. Individual developers have also embraced the tool: Peter Steinberger, creator of OpenClaw, built the project entirely with Codex and reports that since fully switching to the tool, his productivity has roughly doubled across more than 82,000 GitHub contributions. OpenAI's ambitious roadmap: Windows support, cloud triggers, and continuous background agents OpenAI outlined an aggressive development roadmap for Codex. The company plans to make the app available on Windows, continue pushing "the frontier of model capabilities," and roll out faster inference. Within the app, OpenAI will "keep refining multi-agent workflows based on real-world feedback" and is "building out Automations with support for cloud-based triggers, so Codex can run continuously in the background -- not just when your computer is open." The company also announced a new "plan mode" feature that allows Codex to read through complex changes in read-only mode, then discuss with the user before executing. "This means that it lets you build a lot of confidence before, again, sending it to do a lot of work by itself, independently, in parallel to you," Embiricos explained. Additionally, OpenAI is introducing customizable personalities for Codex. "The default personality for Codex has been quite terse. A lot of people love it, but some people want something more engaging," Embiricos said. Users can access the new personalities using the /personality command. Altman also hinted at future integration with ChatGPT's broader ecosystem. "There will be all kinds of cool things we can do over time to connect people's ChatGPT accounts and leverage sort of all the history they've built up there," Altman said. Microsoft still dominates enterprise AI, but the window for disruption is open The Codex launch occurs as most enterprises have moved beyond single-vendor strategies. According to the Andreessen Horowitz survey, "81% now use three or more model families in testing or production, up from 68% less than a year ago." Despite the proliferation of AI coding tools, Microsoft continues to dominate enterprise adoption through its existing relationships. "Microsoft 365 Copilot leads enterprise chat though ChatGPT has closed the gap meaningfully," and "Github Copilot is still the coding leader for enterprises." The survey found that "65% of enterprises noted they preferred to go with incumbent solutions when available," citing trust, integration, and procurement simplicity. However, the survey also suggests significant opportunity for challengers: "Enterprises consistently say they value faster innovation, deeper AI focus, and greater flexibility paired with cutting edge capabilities that AI native startups bring." OpenAI appears to be positioning Codex as a bridge between these worlds. "Codex is built on a simple premise: everything is controlled by code," the company stated. "The better an agent is at reasoning about and producing code, the more capable it becomes across all forms of technical and knowledge work." The company's ambition extends beyond coding. "We've focused on making Codex the best coding agent, which has also laid the foundation for it to become a strong agent for a broad range of knowledge work tasks that extend beyond writing code." When asked whether AI coding tools could eventually move beyond early adopters to become mainstream, Altman suggested the transition may be closer than many expect. "Can it go from vibe coding to serious software engineering? That's what this is about," Altman said. "I think we are over the bar on that. I think this will be the way that most serious coders do their job -- and very rapidly from now." He then pivoted to an even bolder prediction: that code itself could become the universal interface for all computer-based work. "Code is a universal language to get computers to do what you want. And it's gotten so good that I think, very quickly, we can go not just from vibe coding silly apps but to doing all the non-coding knowledge work," Altman said. At the close of the briefing, Altman urged journalists to try the product themselves: "Please try the app. There's no way to get this across just by talking about it. It's a crazy amount of power." For developers who have spent careers learning to write code, the message was clear: the future belongs to those who learn to manage the machines that write it for them.
[6]
OpenAI launches Codex App to bring its coding models, used to build OpenClaw, to more users | Fortune
OpenAI's move follows rival Anthropic's debut of a similar application for its popular Claude coding assistant, called Code Cowork, in January. Like the new Codex App, Claude Cowork is designed for non-software developers to start using Claude to build AI agents that work across the software they already have installed on their computer. OpenAI's Codex tool has already been credited with some productivity gains. Independent developer Peter Steinberger, creator of the now viral AI tool OpenClaw (which spawned "social network for AI agents" Moltbook), said in a post on X that his productivity had roughly doubled since switching to OpenAI's Codex. Steinberger has said he built the entire OpenClaw tool with OpenAI's Codex, despite publicly saying Anthropic's Claude Opus is the "best general-purpose agent" and the model recommended for use in OpenClaw. Codex is also already being used within OpenAI to build new features and products. According to the company, a four-person engineering team built and shipped the Sora for Android app in just 28 days using Codex. With the Codex App, OpenAI appears to be trying to shore up users in the developer tools market. The company says that over a million developers have used Codex in the past month, including teams at startups like Harvey and Sierra as well as large enterprises like Cisco. But OpenAI is in an increasingly cutthroat fight with Anthropic and Google to be the platform on which software developers -- and increasingly others too -- build new software applications. Anthropic has reportedly been outpacing OpenAI in enterprise adoption -- though OpenAI disputes some of the data -- and has positioned itself as an enterprise AI company first and a consumer one second. Claude Code is now used by major companies including Uber, Netflix, Spotify, Salesforce, Accenture, and Snowflake, according to Anthropic. OpenAI is betting that developers want centralized control over agent workflows. The new app lets users run multiple AI agents at once across different projects, automate repetitive tasks, and monitor what the agents are doing. The company currently recommends using the GPT-5.2-Codex model for coding projects and GPT-5.2 for analysis and writing tasks, with adjustable thinking intensity and personality options.
Share
Share
Copy Link
OpenAI released a dedicated MacOS app for its Codex AI coding tool, transforming how developers work with AI agents. The new command center for agents allows software development teams to run multiple AI coding agents in parallel, automate workflows, and delegate complex tasks. With over 1 million developers using Codex in the past month, the app introduces Skills and Automations to handle everything from bug fixing to building complete applications autonomously.
OpenAI launched a new MacOS app for its Codex AI coding tool on Monday, marking a shift from single-agent assistance to managing teams of AI agents working simultaneously on software development projects
1
. The dedicated application functions as a command center for agents, allowing developers to delegate multiple coding tasks at once, run AI agents in parallel, and supervise autonomous systems that can work independently for up to 30 minutes5
. This represents OpenAI's most aggressive move yet to compete with rivals like Anthropic and Cursor in the increasingly competitive AI coding tool market4
.
Source: TechCrunch
The new app arrives less than two months after OpenAI released GPT-5.2-Codex in mid-December, its most powerful coding model to date
1
. According to Sam Altman, OpenAI's CEO, usage has grown more than 20 times since last August, with more than 1 million developers using Codex in the past month2
. Codex usage has nearly doubled since GPT-5.2-Codex launched, making it "the fastest adopted model that we have ever made," Altman stated during a press briefing2
.
Source: ZDNet
The Codex app introduces a fundamentally different approach to AI-assisted software development. Rather than pair programming with a single assistant, developers now coordinate teams of AI agents across design, build, ship, and maintenance stages
2
. The app is designed to work with multiple agents in parallel, integrating agent skills and state-of-the-art workflows that have become popular over the past year1
.
Source: VentureBeat
Alexander Embiricos, the product lead for Codex, explained that developers have shifted from writing small code chunks in their IDE to delegating entire features. "What we saw is that developers, instead of working closely with the model, pair programming, they started delegating entire features," Embiricos said
5
. Sam Altman shared his own experience, revealing he completed a substantial project over a weekend without opening an IDE once. "I did not open an IDE during the process. Not a single time," Altman told reporters, adding that he didn't expect this capability to exist yet2
.The app seamlessly switches between IDE, terminal, and app while maintaining context across tools
2
. This allows developers to work wherever they prefer, with Codex automatically picking up from where they last left off.The Codex app introduces two major features: Skills and Automations. Skills bundle instructions, resources, and scripts so Codex can reliably connect to tools, run workflows, and complete coding tasks according to team preferences
3
. The app includes a dedicated user interface to create and manage skills, with users able to explicitly invoke specific skills or let the system automatically select them based on the task3
.OpenAI has published a library of skills for common workflows, including tools to fetch design context from Figma, manage projects in Linear, deploy web applications to Cloudflare and Vercel, generate images using GPT Image, and create documents in PDF, spreadsheet, and Word formats
5
. To demonstrate multi-agent capabilities, OpenAI used Codex to build a Mario Kart-like racing game from a single prompt, complete with eight tracks, multiple playable cars, and powerups3
. The system delegated work to different models—GPT Image for visual assets while another coded the web game—taking on roles of designer, game developer, and QA tester while using over 7 million tokens5
.Automations allow developers to schedule Codex to work in the background on an automatic schedule, with results placed in a review queue
1
. Thibault Sottiaux, who leads the Codex team, described how OpenAI uses automations internally for daily issue triage, finding and summarizing CI failures, generating release briefs, and bug fixing5
. Users can also select different personalities for agents—from pragmatic to empathetic—depending on their working style1
.Related Stories
The launch positions OpenAI to compete more directly with Anthropic's Claude Code and Cowork apps, which have epitomized the trend toward agentic software development
1
. While GPT-5.2-Codex holds the top spot on TerminalBench for command-line programming tasks, agents from Gemini 3 and Claude Opus have logged roughly equivalent scores within the margin of error1
. Results from SWE-bench, which tests AI's ability to fix real-world software bugs, show no clear advantage for GPT-5.2, though agentic use cases remain difficult to benchmark effectively1
.According to an Andreessen Horowitz survey of 100 Global 2000 companies, 78% of enterprise CIOs now use OpenAI models in production, though Anthropic posted the largest share increase of any frontier lab since May 2025, growing 25% in enterprise penetration to reach 44% of enterprises
5
. Major customers using Codex include Cisco, Ramp, Virgin Atlantic, Vanta, Duolingo, and Gap2
.For a limited time, OpenAI is making the Codex app available to ChatGPT Free and Go users, while doubling rates for $20-per-month Plus and Pro subscribers
3
2
. While currently Mac-only, expectations point to a Windows version eventually following the same pattern as the ChatGPT app2
. Codex also now supports JetBrains IDEs through an extension, expanding beyond VS Code2
.Summarized by
Navi
1
Policy and Regulation

2
Technology

3
Technology
