

2 Sources
2 Sources
[1]

OpenAI's Atlas ushers in the era of AI browsing. Here's what it means for media
The Atlas-vs.-Comet fight may be moot, though, since Google Chrome is the incumbent browser for most people (it has 74% market share worldwide), and it has AI features, too. Chrome's large user base, however, also means Google can't move as fast: Since the whole idea of AI agents taking control of your browser to perform tasks is fraught with security concerns, Google's Gemini assistant in Chrome is relatively feeble; if you ask it to, say, shop for you on Amazon, it'll give you the digital equivalent of a shrug. So Chrome's continued dominance in the AI era isn't assured. But the question of who will win the AI browser war doesn't matter so much as whether AI browsing will take off at all. I've been using Comet heavily for a few months, and although I find the idea of an agent doing all my tedious internet tasks compelling, I've found the actual set of things it can do to be quite narrow. Generally, the task needs to be something that doesn't require a lot of specialized context (since the AI can't read your mind) or complex prompting (since spending several minutes crafting a prompt is time you could use to just do the task yourself). Nonetheless, OpenAI imagines a future where most of the activity online is done via AI agents in browsers like Atlas. In its announcement, it says, "This launch marks a step toward a future where most web use happens through agentic systems -- where you can delegate the routine and stay focused on what matters most." OpenAI could be right. Those narrow use cases for agentic browsing could be expanded greatly with more elegant and comprehensive merging of personal context and the browsing experience. If the agent understands the entire background of what you're doing -- the why -- and gets better at navigating the web (as it inevitably will), AI browsing might even burst through to the mainstream.
[2]

The AI Browser Power Play
Earlier this month, OpenAI announced ChatGPT Atlas, "a new web browser built with ChatGPT at its core," which it is promoting to the service's hundreds of millions of users. Unusual for OpenAI, which has a history of rushing to beat competitors to market, Atlas launched into a crowded marketplace for AI-assisted browsers. Heavyweights Google and Microsoft are already integrating chatbots and other LLM features into Chrome and Edge. Opera has been building AI features into its apps since 2023, while AI search-engine Perplexity released its own browser, Comet, promoting "agentic" features earlier this year. In late 2024, the Browser Company, the maker of Arc, announced plans to shift focus to a new AI-centric browser called Dia. Even Firefox has chatbot integration now, leaving Apple's Safari, with its few scattered Apple Intelligence features, as the last major browser without a new chat interface growing in or on top of it. Each of these browsers has a different take on what it means to integrate -- or build a browser around -- generative AI, but a few common features have emerged. You've got easily accessible summarization, editing, and lookup capabilities, of course, as well as different abilities to (borrowing Dia's words) "chat with your tabs." In interface terms, they mostly seem to have settled on a right-hand sidebar for chatting, which may end up being this era's version of the "omnibox," or tabbed browsing, and which can accommodate the wide range of relationships people have developed with services like ChatGPT, from occasional search replacement or word generator to consultative companion to a layer through which their entire digital world is interpreted. Finally, in architectural terms, we're mostly talking about browsers built on top of Chromium, the Google-supported open-source project. If there are major features in Atlas that weren't already teased or tested in one of these browsers, I haven't had much luck finding them. In the demo above, even Sam Altman struggled to muster enthusiasm as his employees showed off text summarization, web search, and the automatic chat sidebar. Most of the time, it feels, at least in its current state, like a browser made by OpenAI staffers who were tired of opening a competitor's product to use their own. In regular usage, it's less of a new take on browsing than it is a fuller interface for using ChatGPT. That makes sense: OpenAI is attempting to do in 2025 something like what Google did with Chrome's launch in 2008, guiding users into a relationship with the company deeper than a single product, and much harder to leave. In doing so, it's also targeting Google more directly than ever. When you open ChatGPT in Chrome (or any other browser), you'll now get a prompt to try Atlas. When you open Atlas, you'll see an interface that's far Googlier than your typical ChatGPT window, with default tabs for Search, Images, Videos, and News. In its launch demo, ChatGPT talked about how popular "search" behaviors are for ChatGPT users, and spent some time showing how the browser can be used to work within Google docs. It wasn't subtle. They're coming for Chrome. This is, again, pretty straightforward tech-firm behavior with plenty of precedent: A company with a major web-based product and ambitious omnidirectional plans wants to own its distribution. This sort of thinking isn't just how Google ended up with a browser but also with a mobile operating system; it has inspired less successful adventures, too, like Facebook's failed attempt to make a phone. As a solid browser with a few perks for heavy ChatGPT users, Atlas would be yet another example of OpenAI's transition from a slippery, speculative nonprofit to a more conventional sort of major consumer and enterprise tech company -- from a firm that talks about ASI to a firm that also has lots of KPIs -- except for one thing: Agent Mode. OpenAI's announcement leaned heavily on Agent Mode, which the company says can help with "researching and analyzing, automating tasks, and planning events or booking appointments while you browse," and with good reason: It's pretty wild to watch. Like previous "computer use" features, including OpenAI's Operator and ChatGPT Agent, Atlas's Agent Mode effectively takes control of the browser interface, navigating pages, moving a mouse cursor, filling out forms, and narrating the entire process in a sidebar. As a demo, it's powerful: Look, the computer is using itself! When I asked it to buy me some wool socks in a certain size and color, it navigated to REI's website and added something pretty close to what I asked for to the cart before asking me to log in; same for a specific type of floss, although it took the liberty of upping the order to four boxes to meet Walmart's free-shipping threshold. When I asked it to find a flight for this weekend, it spent a few minutes browsing through travel aggregators, describing its strange process in a sidebar in deep detail: "I need to consider the best way to handle the flight search and booking, ensuring I find the most relevant and convenient options for the user. Let's get this just right," it printed before creating a Python script to figure out what day it was and calculate what I meant by "this coming Saturday." It got there in the end, sort of, in that it found a flight that matched my request. Neat! And a task I will not be assigning it for real. As a performance, computer use is impressive. For many Atlas users, it will also be their first encounter with an agentic tool that can plausibly execute multistep tasks on their behalf. (Programmers will be more familiar with the sensation if they've used AI coding assistants, and anyone who has used Perplexity's Comet or Opera's Neon will have tried something nearly identical already.) As a feature, it still sits somewhere in AI limbo, where the suggestion that such a tool could one day be useful -- if it were faster, smarter, knew more about you, etc. -- remains far more powerful than the tool itself. In other words, like a lot of AI products and companies, Agent Mode benefits from the tailwind of belief in inevitable, continual, exponential improvement. For now, it's something many users will test, play with, and perhaps incorporate into their general models of the future, if not their actual workflow. (OpenAI currently describes it as a "preview" feature, and it's only available to paid users.) Browser Company founder Josh Miller, who experimented early on with similar features in Dia, recalled agentic browsing feeling initially dazzling but eventually like a bit of a dead end, or at the very least a strange and cumbersome way to interact with the web and AI. "Training an AI model to click around a computer like a human is akin to putting a print magazine on the early web," he told me. "It feels futuristic, but it's unimaginative and not native to the technology." As I spent time watching ChatGPT bonk its way through various web interfaces, I also found myself thinking of self-driving cars. A browser that pretends to be a person at the input level -- moving a cursor, scrolling a human GUI -- felt less like a Waymo, in which an unattended steering wheel turns as a result of actions taken by systems closer to the road, than a regular car with a humanoid robot sitting in the driver's seat. Again, it's pretty interesting to watch! But it also makes you wonder: Why are we doing this that way? Aren't there better ways for machines to talk to each other? The answer, Miller suggests, comes down to the "incentives of their makers more than the intrinsic value of the technology." (The Browser Company was recently acquired by software-maker Atlassian.) For OpenAI, building systems that can execute complex commands on behalf of users is the whole ball game -- it's the path to wide-ranging automation and/or AGI, depending on which definition of the term the company is going with that day. An AI model that can provide useful information in a chat window, or handle tasks clearly outlined by the user, is a useful product, but the prospect of an AI model that can productively and proactively interact with the world around it in ways comparable to a human -- or, more specifically, an employee -- is where trillion-dollar valuations come from. To get where it wants to go, though, OpenAI has a number of challenges. Some are widely discussed and frustratingly hard to pin down, revolving around benchmarks, varying definitions of model capability, and predictions about scaling. Others are more banal: To answer questions more usefully, for example, chatbots tend to do better if they have more data about users; likewise, to execute tasks on their behalf, they need to operate in an environment where users are logged in to the various services they use to work and live their lives. They need a breathtaking amount of access and permission, in other words. ChatGPT isolated in a chat window doesn't have that, and it takes a long time to draw out of users, if they're willing to offer it at all. ChatGPT as a default browser -- authenticated in dozens of different sites, payment methods at the ready, or perhaps even logged into a work environment -- does. (Such agents also create, as many in the AI space have pointed out, a potential security nightmare.) That a given service or piece of software could be more powerful with access to all the other software you use is sort of a tech truism -- look no further than the thousands of times you've been asked by one company for "permission" or "access" to data from another for evidence. But it is usually managed through official partnerships and software APIs. By installing itself at the browser level, OpenAI is looking for the mother of all loopholes: That's where people who use computers, particularly for work, spend all their time, and through which vast quantities of valuable information flow in and out. Also, if you're a company hoping to train your models to replicate a bunch of white-collar work, millions of browser sessions would be a pretty valuable source of data. It would be amazing for OpenAI, in other words, if lots of people found Agent Mode useful and used it at scale, but this is far from a sure thing; likewise, it would depend on a bunch of other companies tolerating the gradual botification of their users without taking the sorts of countermeasures that, say, Ticketmaster does against automated snipers (also, you can't make money showing ads to a ChatGPT agent!). Instead, I think, Atlas can be understood as a statement of intention, or as a preemptive power play. Before launching its browser, OpenAI announced that it was partnering with a number of companies to build their services directly into ChatGPT, so users could, for example, use the chatbot to create a playlist that would then show up in Spotify, or transition from a chat session directly into a productivity tool like Canva. As a way of working with chatbots, it was far less clunky than the self-driving browser. But it also requires willing partners, and the lineup at launch was fairly thin, consisting largely of companies with nothing to fear from OpenAI, little to lose, or that are already major OpenAI customers. In OpenAI's ideal world -- much like Google's, Meta's, and Apple's, to be fair -- every other tech product exists as a subordinate app inside its own platform, letting ChatGPT users seamlessly navigate their digital lives through the company's interface, its agentic capabilities unencumbered by security concerns, privacy, competition, or interfaces designed for humans rather than machines. Plenty of companies will be eager to partner with a popular service like ChatGPT. Others -- particularly those who see OpenAI, or AI in general, as a potential threat to their businesses or their proprietary data as a protective moat -- will be more cautious. OpenAI's sources of leverage are its large user base and its narrative of inevitability. Agentic browsing is an attempt to fuse the two: A feature that may soon be available to and used by lots of people, that may soon be better than it is. If building an ecosystem for other companies is a polite invitation into such a future, in other words, Atlas functions as something more like a threat, setting up an easy-way-or-hard-way scenario for potential partners or competitors: If you'd rather not work with OpenAI for now, fine. In the meantime, it will be trying to take over your relationship with your users or customers, deceptively and without your consent. Capisce?
Share
Share
Copy Link
OpenAI launches ChatGPT Atlas, a new AI-powered browser featuring Agent Mode capabilities, joining a competitive landscape of AI-enhanced browsers from Google, Microsoft, and others as companies race to control web distribution.

OpenAI has officially launched ChatGPT Atlas, a new web browser built with ChatGPT at its core, marking the company's entry into an increasingly competitive landscape of AI-powered browsing experiences
2
. The announcement comes as major tech companies race to integrate artificial intelligence capabilities into web browsers, fundamentally reshaping how users interact with the internet.Atlas joins a crowded marketplace where heavyweights Google and Microsoft are already integrating chatbots and large language model features into Chrome and Edge respectively
2
. Opera has been building AI features since 2023, while AI search engine Perplexity released its own browser called Comet with "agentic" features earlier this year. The Browser Company, maker of Arc, has announced plans for a new AI-centric browser called Dia, and even Firefox now includes chatbot integration.What distinguishes Atlas from its competitors is its Agent Mode capability, which OpenAI heavily emphasized in its announcement. This feature can help with "researching and analyzing, automating tasks, and planning events or booking appointments while you browse"
2
. Agent Mode effectively takes control of the browser interface, navigating pages, moving the mouse cursor, filling out forms, and narrating the entire process in a sidebar.Demonstrations show the system's impressive capabilities: when asked to purchase wool socks in specific sizes and colors, it navigated to REI's website and added appropriate items to the cart before requesting user login credentials. Similarly, when tasked with finding specific dental floss, it browsed to Walmart's website and even made the decision to increase the order quantity to meet free shipping thresholds
2
.Despite the innovation, Atlas faces significant challenges in a market dominated by Google Chrome, which holds 74% of global market share
1
. However, Chrome's massive user base also constrains Google's ability to move quickly with AI features due to security concerns. Google's Gemini assistant in Chrome remains relatively limited compared to more aggressive AI implementations.The competitive landscape reveals different approaches to AI integration. Most browsers have settled on common features including easily accessible summarization, editing, and lookup capabilities, along with the ability to "chat with your tabs." Interface-wise, many have adopted a right-hand sidebar for chatting, which may become this era's equivalent of the omnibox or tabbed browsing
2
.Related Stories
Atlas represents OpenAI's attempt to replicate what Google accomplished with Chrome's 2008 launch - guiding users into a deeper relationship with the company that extends beyond a single product
2
. The browser directly targets Google with features like default tabs for Search, Images, Videos, and News, and integration capabilities with Google Docs.This move reflects OpenAI's transformation from a speculative nonprofit to a conventional major consumer and enterprise technology company. The company envisions a future where "most web use happens through agentic systems" where users can "delegate the routine and stay focused on what matters most"
1
.Despite the promising technology, current AI browsing capabilities remain narrow in scope. Tasks generally need to be straightforward without requiring specialized context or complex prompting, since spending time crafting detailed prompts often negates the time-saving benefits
1
.However, these limitations could be overcome through more elegant integration of personal context with the browsing experience. If AI agents can better understand the complete background and purpose behind user tasks, and continue improving at web navigation, AI browsing could potentially achieve mainstream adoption
1
.Summarized by

Navi
[2]