The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved
Curated by THEOUTPOST
On Sat, 25 Jan, 12:05 AM UTC
2 Sources
[1]
ChatGPT Operator isn't worth its $200-per-month subscription yet - here's why
This week, OpenAI is introducing a research preview called Operator. I initially wanted to do a hands-on, but once I found out that you need a Pro account (which costs $200 per month), I decided to watch the various OpenAI demos, share them with you, and then share my thoughts. Altman did say that users of the $20-per-month Plus plan would eventually be able to use Operator. Operator is an AI agent. Fundamentally, it simulates keyboard and mouse clicks in a browser, reading the screen, and performing actions. Also: Have a genealogy mystery? How I used AI to solve a family puzzle I have a fairly long history of building this kind of app, using mostly algorithmic programming along with a little machine learning to identify the location of certain images on the screen. My most recent project was an auto-posting tool that would make my social media posts for me. Yes, there are a plethora of subscription services that will do that for you, but I decided to see what it would take to build my own. My code used a combination of the DOM (document object model) for individual social media service pages, along with image recognizers that were able to find buttons (like the + or Post buttons). I used the tool I built for about a year but ran into a very annoying snag. About every two weeks, one of the six sites I was navigating made a small change to the screen interface, which proceeded to break my code. So every two weeks, instead of posting my social media posts normally, I had to spend a few hours fixing whatever had broken. The fact that the web is constantly changing (for example, a blue "Post" button might turn into a red "Post / Subscribe at 30% off" button during a promotion) might knock the AI off its game. The model OpenAI is using is called CUA, or computing-using agent. This model dictates how Operator talks to the websites it's supposed to navigate. In their introduction video, Sam Altman and OpenAI team members Yash Kumar, Casey Chu, and Reiichiro Nakano explained that Operator doesn't use APIs and isn't working off of extracted text pulled from the DOM. Instead, it's "viewing" an actual web page in a live browser running in the cloud, reading the context directly off the screen. Also: How ChatGPT scanned 170k lines of code in seconds, saving me hours of work They were very clear that the control mechanism for the web pages was mouse and keyboard simulation, and the input that the AI reads is the visual representation of the actual web page that we see as humans. The OpenAI team did say that Operator will work just like a human using a web browser -- searching, clicking, and visiting websites. But there is a contradiction that I haven't fully figured out yet, which is that OpenAI has partnered with a bunch of sites (Instacart, DoorDash, Etsy, OpenTable, Tripadvisor, AP, Priceline, StubHub, Thumbtack, Target, Uber, and more). What do these partnerships do for Operator? Are they affiliate deals where OpenAI gets a kickback on any sales? Do they have an agreement to let Operator know if the website format has changed? Did OpenAI do additional modeling for those sites? Does it have some level of API access to the data those sites display on the web? Until we have a better understanding of those answers, we won't really know the scope of what Operator can do. All the demos shown were conducted using sites the company has partnered with, so it's not clear, for example, that it could go into ZDNET and construct a list of my last 10 articles and email that to me using Gmail. Also: How to use ChatGPT Right now, I get the impression that Operator is fairly shallow in what it can accomplish. This demo, for example, was able to look up a recipe on one site and then populate an Instacart shopping cart with the ingredient list. There were demos that showed making a restaurant reservation, buying tickets to a basketball game, and so on. Each of these were one or two site processes where data was found on one site and then applied to another. OpenAI does appear to have given some serious consideration to issues of privacy and guardrails. For example, one demo showed the booking of four basketball tickets for a total of more than $1,000. It's unlikely any of us would feel comfortable just letting the AI go ahead and spend that kind of cash on our behalf unsupervised. Operator knows when to pause and ask for human intervention. Or at least, it's supposed to. It's still in beta, so it's possible that it could run amok, just because it's not quite finished. Also: The best AI for coding But the key idea is simple: when the operations on a website are about to get sensitive (logging in, spending money, making reservations, checking out, etc.), Operator asks its human to confirm the operation. Additionally, the human user can take control of the cloud-based browser window. According to OpenAI, when the human is controlling the browser, it acts like a private session, and nothing that takes place while the human is in control is fed back to the AI. You can also opt out of allowing your website interactions to be used as training data for the AI. Operator allows you to create site-specific custom instructions on a site-by-site basis. In the above example, pulled from the video below, the demonstrator wants to make sure that bookings on Priceline are fully refundable and have a free breakfast. By placing that custom instruction in the web site's preferences, the AI agent will always consider that when performing a task on Priceline. Additionally, Operator will allow you to save a task so you can rerun it or schedule it later. If you have a regular activity you'd like Operator to do for you, this is a quick way to ensure you can re-run your work when you want. Operator feels very much like baby steps to me at this time. For example, I'd love to tell an AI to go through my inbox, find all the press releases, and assign them to one label (I'm using Gmail). Or find all the AI-related press releases and give them one label, while the rest of the press releases get another. This is both a complex task and one that's got quite a long runtime (I have 51,000 marketing pieces in my Promotions tab). As such, it's way beyond the scope of what Operator can do. Also: I spent hours testing ChatGPT Tasks - and its refusal to follow directions was mildly terrifying But someday? Maybe. I'm also trying to avoid the science fiction horror interpretation of all of this. There's a little part of my brain yelling, "They're letting the AI surf the Internet? Are they nuts?" And yeah, tools like Operator (and even all the AIs that are trained on the Internet as a whole) are probably opening doors to some really bad things, especially if we ever do create sentient AIs. But for now, it's an interesting exercise to see how well an AI succeeds at reading a recipe and ordering the ingredients from Instacart. What do you think? When the price comes down to the $20-per-month range, do you see tasks you might assign to Operator? Does it worry you? Let us know your thoughts in the comments below.
[2]
Operator isn't worth its $200-per-month ChatGPT Pro subscription yet - here's why
This week, OpenAI is introducing a research preview called Operator. I initially wanted to do a hands-on, but once I found out that you need a Pro account (which costs $200 per month), I decided to watch the various OpenAI demos, share them with you, and then share my thoughts. Altman did say that users of the $20-per-month Plus plan would eventually be able to use Operator. Operator is an AI agent. Fundamentally, it simulates keyboard and mouse clicks in a browser, reading the screen, and performing actions. Also: Have a genealogy mystery? How I used AI to solve a family puzzle I have a fairly long history of building this kind of app, using mostly algorithmic programming along with a little machine learning to identify the location of certain images on the screen. My most recent project was an auto-posting tool that would make my social media posts for me. Yes, there are a plethora of subscription services that will do that for you, but I decided to see what it would take to build my own. My code used a combination of the DOM (document object model) for individual social media service pages, along with image recognizers that were able to find buttons (like the + or Post buttons). I used the tool I built for about a year but ran into a very annoying snag. About every two weeks, one of the six sites I was navigating made a small change to the screen interface, which proceeded to break my code. So every two weeks, instead of posting my social media posts normally, I had to spend a few hours fixing whatever had broken. The fact that the web is constantly changing (for example, a blue "Post" button might turn into a red "Post / Subscribe at 30% off" button during a promotion) might knock the AI off its game. The model OpenAI is using is called CUA, or computing-using agent. This model dictates how Operator talks to the websites it's supposed to navigate. In their introduction video, Sam Altman and OpenAI team members Yash Kumar, Casey Chu, and Reiichiro Nakano explained that Operator doesn't use APIs and isn't working off of extracted text pulled from the DOM. Instead, it's "viewing" an actual web page in a live browser running in the cloud, reading the context directly off the screen. Also: How ChatGPT scanned 170k lines of code in seconds, saving me hours of work They were very clear that the control mechanism for the web pages was mouse and keyboard simulation, and the input that the AI reads is the visual representation of the actual web page that we see as humans. The OpenAI team did say that Operator will work just like a human using a web browser -- searching, clicking, and visiting websites. But there is a contradiction that I haven't fully figured out yet, which is that OpenAI has partnered with a bunch of sites (Instacart, DoorDash, Etsy, OpenTable, Tripadvisor, AP, Priceline, StubHub, Thumbtack, Target, Uber, and more). What do these partnerships do for Operator? Are they affiliate deals where OpenAI gets a kickback on any sales? Do they have an agreement to let Operator know if the website format has changed? Did OpenAI do additional modeling for those sites? Does it have some level of API access to the data those sites display on the web? Until we have a better understanding of those answers, we won't really know the scope of what Operator can do. All the demos shown were conducted using sites the company has partnered with, so it's not clear, for example, that it could go into ZDNET and construct a list of my last 10 articles and email that to me using Gmail. Also: How to use ChatGPT Right now, I get the impression that Operator is fairly shallow in what it can accomplish. This demo, for example, was able to look up a recipe on one site and then populate an Instacart shopping cart with the ingredient list. There were demos that showed making a restaurant reservation, buying tickets to a basketball game, and so on. Each of these were one or two site processes where data was found on one site and then applied to another. OpenAI does appear to have given some serious consideration to issues of privacy and guardrails. For example, one demo showed the booking of four basketball tickets for a total of more than $1,000. It's unlikely any of us would feel comfortable just letting the AI go ahead and spend that kind of cash on our behalf unsupervised. Operator knows when to pause and ask for human intervention. Or at least, it's supposed to. It's still in beta, so it's possible that it could run amok, just because it's not quite finished. Also: The best AI for coding But the key idea is simple: when the operations on a website are about to get sensitive (logging in, spending money, making reservations, checking out, etc.), Operator asks its human to confirm the operation. Additionally, the human user can take control of the cloud-based browser window. According to OpenAI, when the human is controlling the browser, it acts like a private session, and nothing that takes place while the human is in control is fed back to the AI. You can also opt out of allowing your website interactions to be used as training data for the AI. Operator allows you to create site-specific custom instructions on a site-by-site basis. In the above example, pulled from the video below, the demonstrator wants to make sure that bookings on Priceline are fully refundable and have a free breakfast. By placing that custom instruction in the web site's preferences, the AI agent will always consider that when performing a task on Priceline. Additionally, Operator will allow you to save a task so you can rerun it or schedule it later. If you have a regular activity you'd like Operator to do for you, this is a quick way to ensure you can re-run your work when you want. Operator feels very much like baby steps to me at this time. For example, I'd love to tell an AI to go through my inbox, find all the press releases, and assign them to one label (I'm using Gmail). Or find all the AI-related press releases and give them one label, while the rest of the press releases get another. This is both a complex task and one that's got quite a long runtime (I have 51,000 marketing pieces in my Promotions tab). As such, it's way beyond the scope of what Operator can do. Also: I spent hours testing ChatGPT Tasks - and its refusal to follow directions was mildly terrifying But someday? Maybe. I'm also trying to avoid the science fiction horror interpretation of all of this. There's a little part of my brain yelling, "They're letting the AI surf the Internet? Are they nuts?" And yeah, tools like Operator (and even all the AIs that are trained on the Internet as a whole) are probably opening doors to some really bad things, especially if we ever do create sentient AIs. But for now, it's an interesting exercise to see how well an AI succeeds at reading a recipe and ordering the ingredients from Instacart. What do you think? When the price comes down to the $20-per-month range, do you see tasks you might assign to Operator? Does it worry you? Let us know your thoughts in the comments below.
Share
Share
Copy Link
OpenAI introduces Operator, an AI agent that automates web browsing tasks, but its $200 monthly subscription and limited capabilities raise questions about its current value and functionality.
OpenAI has introduced Operator, a new AI agent designed to automate web browsing tasks. Currently available as a research preview, Operator is accessible only to users with a $200 per month Pro account, though OpenAI CEO Sam Altman has indicated that $20 per month Plus plan subscribers will eventually gain access 12.
Operator functions as an AI agent that simulates keyboard and mouse clicks in a browser, reading the screen and performing actions. Unlike traditional web automation tools, Operator doesn't rely on APIs or DOM extraction. Instead, it "views" live web pages in a cloud-based browser, interpreting the visual context directly from the screen 12.
The AI uses a model called CUA (computing-using agent) to interact with websites. OpenAI team members, including Sam Altman, Yash Kumar, Casey Chu, and Reiichiro Nakano, emphasized that Operator mimics human browsing behavior by searching, clicking, and visiting websites 12.
Current demonstrations of Operator showcase relatively simple tasks:
These demos primarily involve one or two-site processes, where data is found on one site and applied to another. This suggests that Operator's current capabilities may be somewhat limited 12.
OpenAI has partnered with several companies for Operator, including Instacart, DoorDash, Etsy, OpenTable, Tripadvisor, AP, Priceline, StubHub, Thumbtack, Target, and Uber. However, the nature and extent of these partnerships remain unclear, raising questions about potential affiliate deals, API access, or specialized modeling for partner sites 12.
OpenAI has implemented several privacy and control features for Operator:
Despite its potential, Operator faces several challenges:
The author, who has experience in building similar automation tools, expresses skepticism about Operator's current value. They note that maintaining such tools can be challenging due to frequent website changes and suggest that Operator may face similar issues 12.
While Operator represents an interesting development in AI-driven web automation, its current $200 per month price tag and limited capabilities make it difficult to justify for most users. As the technology evolves and becomes more accessible, it may offer greater value in the future 12.
OpenAI's new AI agent, Operator, shows potential in automating online tasks but faces challenges in reliability and user experience.
7 Sources
7 Sources
OpenAI launches Operator, an AI agent capable of performing web-based tasks autonomously, sparking discussions about its implications for AGI and potential risks.
70 Sources
70 Sources
AI agents capable of using computers like humans are emerging, promising to revolutionize how we interact with technology. While still in early stages, these tools raise questions about efficiency, safety, and the future of human-computer interaction.
2 Sources
2 Sources
OpenAI's new AI agent, Operator, shows potential in automating web-based tasks but falls short of full autonomy, requiring significant user intervention and facing challenges in reliability and efficiency.
3 Sources
3 Sources
OpenAI's new Operator tool, powered by AI, is transforming how businesses conduct product research and opening up new opportunities for side hustles, despite its high price point.
3 Sources
3 Sources