ZeroStep

Contact for Pricing

Twitter

Facebook

Copy Link

ZeroStep enhances Playwright tests with AI, enabling script creation via plain-text instructions for more resilient E2E testing.

How ZeroStep can help you:

Transform Playwright tests by incorporating AI to interpret plain-text instructions.
Eliminate the need for CSS selectors or XPath locators, making tests more adaptable to changes.
Allow integration with your existing Playwright tests, offering flexibility to use AI as needed without altering development workflows.
Facilitate the scripting of complex interactions and assertions through straightforward instructions.

Why choose ZeroStep: Key features

Direct integration with Playwright for seamless test enhancement.
AI-powered test script interpretation for greater test resilience.
Support for complex test scenarios through simple plain-text commands.
Provides a generous free tier and straightforward pricing for cost efficiency.

Who should choose ZeroStep:

Developers and testers looking to improve their E2E testing capabilities.
Teams seeking more resilient tests that are easier to maintain and adapt to change.
Organizations wanting to leverage AI without significant changes to their current testing workflows.

About ZeroStep

Website

https://zerostep.com

Release Date

March 2024

Pricing

Contact for Pricing

Related fields

Related News

"Fix with AI" Button to Automate Playwright Test Fixes

End-to-end tests are essential for ensuring the reliability of your application, but they can also be a source of frustration. Even small changes to the user interface can cause tests to fail, leading developers and QA teams to spend hours troubleshooting. In this blog post, I'll show you how to utilize AI tools like ChatGPT or Copilot to automatically fix Playwright tests. You'll learn how to create an AI prompt for any test that fails and attach it to your HTML report. This way, you can easily copy and paste the prompt into AI tools for quick suggestions on fixing the test. Join me to streamline your testing process and improve application reliability! By following these steps, you can enhance your end-to-end testing process and make fixing Playwright tests a breeze. To detect a failed test in Playwright, you can create a custom fixture that checks the test result during the teardown phase, after the test has completed. If there's an error in and the test won't be retried, the fixture will generate a helpful prompt. Check out the code snippet below: I'll start with a simple proof-of-concept prompt (you can refine it later): Playwright stores the error message in . However, it includes special ASCII control codes for coloring output in the terminal (such as or ): This cleaned-up message can be inserted into the prompt template. The test code snippet is crucial for AI to generate the necessary code changes. Playwright often includes these snippets in its reports, for example: You can see how Playwright internally generates these snippets. I've extracted the relevant code into a helper function, , to retrieve the source code lines from the error stack trace: ARIA snapshots, introduced in Playwright 1.49, provide a structured view of the page's accessibility tree. Here's an example ARIA snapshot showing the navigation menu on the Playwright homepage: While ARIA snapshots are primarily used for snapshot comparison, they are also a game-changer for AI prompts in web testing. Compared to raw HTML, ARIA snapshots offer: When the prompt is built, you can attach it to the test using : Now, whenever a test fails, the HTML report will include an attachment labeled "Fix with AI." When it comes to using ChatGPT for fixing tests, you typically have to manually implement the suggested changes. However, you can make this process much more efficient by using Copilot. Instead of pasting the prompt into ChatGPT, simply open the Copilot edits window in VS Code and paste your prompt there. Copilot will then recommend code changes that you can quickly review and apply -- all from within your editor. Check out this demo video of fixing a test with Copilot in VS Code: Vitaliy Potapov created a fully working GitHub repository demonstrating the "Fix with AI" workflow. Feel free to explore it, run tests, check out the generated prompts, and fix errors with AI help. To integrate the "Fix with AI" flow into your own project, follow these steps: Run your tests and open the HTML report to see the "Fix with AI" attachment under any failed test From there, simply copy and paste the prompt into ChatGPT or GitHub Copilot, or use Copilot's edits mode to automatically apply the code changes.

DZone

Fri, 24 Jan, 2:11 PM UTC

Working With Vision AI to Test Cloud Applications

Recently, I've been looking into Tricentis Tosca to better understand how its testing suite can benefit my app development workflow. In my last article about Tosca, I wrote about some of the tool's visual capabilities, such as QR code testing. Testing QR codes is great if you need an effective way to validate that specific part of your app. Then, I discovered Tosca's Vision AI tools. Imagine giving your testing tool some simple visual cues for how your system should work and then building the functionality to make the tests pass. That's what these tools are designed to do. Writing tests that reflect how users interact with your app is crucial. But as your interface becomes more visual, testing gets trickier. You've likely had to include app-specific markup in your tests, which isn't ideal -- you're supposed to test the app, not rebuild it in your tests. So, how do you avoid this? In this article, we will look at how to shift UI testing away from technical implementation details, instead describing visual cues to ensure things work as expected. Let's dive in! With Vision AI, you can start with a mockup and build the actual UI later. I don't think I've encountered anything like this before. When it comes to interface visuals, I'm used to building tests that are -- unfortunately -- tied more closely to my implementation details than I'd like. And these tests are fragile. Vision AI allows me to point my tests to a prototype and train my test suite on the desired behavior. I say "train" because Tosca's Vision AI features use convolutional neural networks underneath to build test suites using computer vision. In Tosca Commander, I instructed Tosca to scan an application and set up a new testing module based on whatever application I wanted to use. I took a simple wireframe of a web form and opened it up in Microsoft Paint. Then, I used the Select on screen functionality in the screen scanner to choose the right elements from the page to make a testing module. As you can see, I simply clicked on the boxes drawn in the wireframe, and Tosca could pick them out from a list of items it saw when it scanned the "application." Not only that, Tosca could parse out what the mockup likely indicated on the "form" it saw. From there, to build out a test, all I needed to do was provide the values I wanted Tosca to enter on the form: Each form element had an icon next to it, indicating what Tosca determined the fields to be (such as dropdowns or text fields). For the Next element, Tosca determined it to be a button. It was time to run the test. Watching what Tosca did here was a big shock -- but in a good way. I didn't need to write any code at all. Yet, Tosca could enter data on a demo site with these same fields on it. The fields on the demo form were even in a different order than the mockup, but that wasn't an issue. As Tosca navigated the form, it looked for the essential properties of each field or element and performed the action or input requested by the test case. As Tosca finds each item, it makes a log of the success or failure of each step, abstracting all the code used to implement it. In addition to building test suites based on a mockup, Vision AI also lets us keep our tests intact even as changes are made to the less critical parts of the page being tested. For example, if Tosca detects that the position of fields has changed, it can still perform the tests you trained it to do previously. Here's how Tosca handled another implementation of the same form but restructured to take up two columns. Tosca can deal with position changes, but it can also attempt to "self-heal" if something like a label has changed. For example, consider a button the user must click to continue. When we originally built the form, the button label was Next, but we later changed it to Continue. Tosca will still be able to recognize what to do. All I needed to do was enter a few testing parameters for Tosca to enable self-healing tests. Vision AI uses various algorithms to retry a test until it's sure it really can't find the element. When Tosca couldn't find the button label it expected, it self-healed and updated its model to use the label that it was able to find instead. Pretty amazing! It's nice to know that the visual aspects of your application are well-tested. However, it's important to remember that accessibility (a11y) is critical as well. You want to ensure that as many people as possible can use your application. Even though Tosca can use computer vision to verify the visual aspects are working as expected, Tricentis has also designed its development tools for accessibility. The most recent version of Tosca has an updated a11y report detailing information on adherence to WCAG 2.2. You can generate this report for any features under test. Adding accessibility testing is straightforward and doesn't require any additional coding. Just drag the Check Webpage Accessibility into your test case, and Tosca will give you a quick summary of the state of the site you're testing: Finding 11 critical issues seems pretty concerning! To look more deeply, you can export the a11y report. You can even choose what standards Tosca should report against: As soon as the results were compiled, my report opened up, showing me helpful charts about where the issues were and what needed to be fixed. Fortunately, regarding what needed to be fixed across the site, the report showed me that I didn't have quite as much to do as I had feared. Vision AI opens up a world of possibilities for simplifying and enhancing UI testing. From building tests directly from mockups to handling changes with ease, it's a game-changer for anyone dealing with dynamic, visually complex applications. It's not just about saving time -- it's about making the entire testing process smarter, more reliable, and less tied to brittle implementation details. I've barely scratched the surface of what's possible, but what I've seen so far has left me thoroughly impressed. How are you using it for your app testing workflow? Is it making your testing (and test writing) of visual interfaces faster? Share your stories!

DZone

Mon, 27 Jan, 10:08 PM UTC

BrowserStack introduces AI-powered platform to streamline software testing

Accel-backed software testing platform BrowserStack has launched an AI-powered test platform that consolidates the entire toolchain for quality assurances (QAs) under one roof from creating, planning, executing and debugging testing to help development teams to deliver applications faster and smarter. The platform processes over one billion tests annually for seven million developers across 135 countries, the company said on Wednesday. By consolidating multiple testing tools into a single platform, BrowserStack aims to reduce fragmentation, lower costs, and boost productivity. According to the company, the fragmentation in traditional testing toolchains has created significant challenges for development teams. "We are consolidating and simplifying the entire toolchain for QAs under one brand where you can go from your creating, planning, executing and debugging of your test cases in a single platform or browser platform in this way," Dhimil Gosalia, vice president of products at BrowserStack told ET. A recent report by a global market research firm Forrester found that development teams experience a 10% "DevOps tax," meaning a tenth of their workforce is dedicated solely to maintaining testing toolchains without improving release speed. Despite these efforts, release velocity has remained unchanged over the past five years, underscoring the need for a more integrated approach. The consolidated BrowserStack test platform addresses these challenges by offering faster testing cycles by enterprise-grade infrastructure for browser and mobile app testing. It supports both cloud-based and self-hosted setups on major cloud providers, with AI-driven automation for test analysis, orchestration, and self-healing capabilities to improve efficiency and maximise return on automation. Further, AI-powered agents within the platform enhance all stages of the testing lifecycle by using a unified data store, providing richer test context and greater accuracy. It also supports testing across over 20,000 real devices and more than 3,500 browser-desktop combinations. BrowserStack noted that its test observability and management features help organisations make data-driven decisions and improve their testing strategies. Founded in 2011 by Ritesh Arora and Nakul Aggarwal, BrowserStack is a cloud-based platform for developers to test websites and mobile apps across different devices, operating systems, and browsers. It serves companies including Amazon, Nvidia, MongoDB, Microsoft, and X (formerly Twitter), enabling them to deliver high-quality software at scale with access to real devices via its cloud infrastructure. By 2026, BrowserStack plans to expand the platform to support over 30 distinct testing products. "Engineering teams today face dual challenges: optimising testing costs while figuring out how to effectively implement AI in their testing workflows," said Arora, CEO of BrowserStack. "Our vision for the test platform goes beyond providing tools - we're fundamentally transforming how teams approach quality. By unifying the testing ecosystem and embedding AI throughout the testing lifecycle, we're enabling teams to achieve up to 50% productivity gains while expanding test coverage." Also Read: US court dismisses Deque Systems' claim of IP theft against BrowserStack

Economic Times

Wed, 26 Feb, 6:32 PM UTC

Modern Test Automation With AI (LLM) and Playwright MCP

Delivering high-quality applications at scale is a constant challenge. Traditional test automation, while powerful, often struggles with dynamic user interfaces, flaky tests, and time-consuming script maintenance. This blog explores how generative AI (GenAI) and Playwright MCP (Model Context Protocol) work together to streamline QA processes, boost efficiency, and empower testers to focus on strategic tasks. Large language models (LLMs) like ChatGPT, Gemini, Claude, and DeepSeek are powerful tools that can process complex queries, generate code, write emails, and even simulate conversations -- all using natural language. But there's a catch: LLMs can think, but they can't act. LLMs are designed to understand and generate human-like text, but they lack the ability to directly interact with external resources. LLMs handle the "thinking" (e.g., generating prompts, code, or logic). MCPs handle the "doing" (e.g., executing actions, connecting to resources, and automating workflows). The ability to interact with the web programmatically is becoming increasingly crucial. This is where GenAI steps in, by leveraging large language models (LLMs) like Claude or custom AI frameworks, GenAI introduces intelligence into test automation, enabling natural language test creation, self-healing scripts, and dynamic adaptability. The bridge that makes this synergy possible is the Model Context Protocol (MCP), a standardized interface that connects GenAI's cognitive power with Playwright's automation prowess. MCPs bridge the gap between LLMs and real-world applications by providing a framework to integrate multiple components, including browsers, databases, APIs, and more. Unlike LLMs, MCPs are designed to orchestrate complex workflows that involve external resources. For instance: Model Context Protocol (MCP), as described, is an open-source protocol developed by Anthropic to create a consistent method for large language models (like Claude) to interact with external systems, such as databases, APIs, or tools. By standardizing this communication, MCP ensures that LLMs can seamlessly integrate with diverse external resources without requiring custom solutions for each combination of model and system. At its core, MCP follows a client-server architecture where a host application can connect to multiple servers: Model Context Protocol (MCP), as described, is an open-source protocol developed by Anthropic to create a consistent method for large language models (e.g., Claude) to interact with external systems, such as databases, APIs, or tools. By standardizing this communication, MCP ensures that LLMs can seamlessly integrate with diverse external resources without requiring custom solutions for each combination of model and system. Let's try to understand how MCP works by taking the LLM Claude Desktop as an example. Below are some examples of MCP servers: Playwright MCP is a server that acts as a bridge between large language models (LLMs) or other agents and Playwright-managed browsers. It enables structured command execution, allowing AI to control web interactions like navigation, form filling, or assertions. What sets MCP apart is its reliance on the browser's accessibility tree -- a semantic, hierarchical representation of UI elements -- rather than screenshot-based visual interpretation. In Snapshot Mode, MCP provides real-time accessibility snapshots, detailing roles (e.g., button), labels (e.g., "Submit"), and states (e.g., disabled). This approach is lightweight and precise, unlike Vision Mode, which uses screenshots for custom UIs but is slower and less reliable. By prioritizing the accessibility tree, MCP delivers unparalleled speed, reliability, and resource efficiency. The Accessibility Tree is how assistive technologies "see" your web application. It includes information about elements such as: In the context of Playwright, MCP acts as a server that sits between the AI model and the browser, translating high-level test instructions into executable scripts while handling complexities like dynamic UIs or cross-browser nuances. Combining GenAI with Playwright MCP unlocks a new paradigm for test automation, addressing pain points that have long plagued QA teams. Here's how: Imagine writing test cases in plain English without touching a line of code. With GenAI and MCP, testers can describe scenarios like, "Navigate to the login page, enter valid credentials, and verify the dashboard loads." The AI interprets this via MCP, generating Playwright scripts like: UI changes -- like a renamed button or updated selector -- are a leading cause of test failures. GenAI, powered by MCP, analyzes the DOM in real-time and adapts scripts to these changes. For example, if a button's ID changes from submit-btn to login-btn, the AI detects the new context and updates the script, saving hours of manual maintenance. Modern applications often behave differently based on user context (e.g., logged-in vs. anonymous users). MCP enables GenAI to understand these variations and adjust test flows dynamically, ensuring comprehensive coverage without redundant scripts. GenAI can analyze an application's behavior and suggest test cases for edge cases or failure-prone areas. For instance, it might propose testing a form's error handling for invalid inputs, which MCP then converts into Playwright tests. This reduces the time to achieve high test coverage. Playwright MCP integrates seamlessly with CI/CD pipelines (e.g., GitHub Actions, Jenkins) and tools like Claude Desktop or Cursor IDE. Community projects like https://github.com/microsoft/playwright-mcp further enhance its capabilities, supporting API testing and containerized environments. To harness Playwright MCP's potential, you need to configure it within VS Code, allowing AI agents to communicate with Playwright-managed browsers. Below are two straightforward methods to install and configure MCP. The fastest way to get started is by registering the Playwright MCP server through VS Code's terminal. This method is platform-agnostic and works for both stable and Insiders editions of VS Code. For more control or to tailor the setup, you can manually configure Playwright MCP in VS Code's settings.json file. This method is ideal for adding custom arguments or integrating with specific workflows. Below are the available tools in the Playwright MCP Server: Below are the available tools in the Playwright MCP Server Once the Claude setup is done, we can see the tools under the Claude desktop. Cursor, an AI-powered IDE, uses Playwright MCP to enhance test automation and UI development by providing real-time browser context to its Composer feature. Let's take a simple example and execute with the help of Claude Desktop. Type all the above instructions in the Claude desktop and execute.

DZone

Fri, 23 May, 12:08 PM UTC

Smarter Shift-Right Testing With AI and Observability

Conventional testing practices have mainly focused on discovering problems before the software is released to the market, also referred to as shift-left testing. Nevertheless, due to the heightened pace of software development owing to DevOps and CI/CD, many real-world conditions that do not mimic the live environment can go undetected in pre-production environments. This is where shift-right testing comes in. Therefore, it is possible to improve the effectiveness of the strategies implemented by QA automation engineers through testing in production-like environments and with the help of AI-driven observability. Shift-right testing goes beyond the conventional approach of performing pre-release testing, thereby enabling the development teams to deploy the software in real-time conditions. This approach includes canary releases where new features are released to a subset of users before the full launch. It also involves A/B testing, where two versions of the application are compared in real time. Another important feature is chaos engineering, which implies that failures are deliberately introduced to check the strength of the system. This paper shows that observability-driven testing is an essential part of the shift-right approach as it helps to improve test coverage using real-time data. No matter how accurately the staging environment replicates the production environment, there are always unknown factors that emerge when real users use the system. Shift-right testing ensures that defects, performance bottlenecks, and usability issues are found in the live environments. The observability tools that are powered by AI, including Datadog, Honeycomb, and New Relic, offer automated monitoring, anomaly detection, and predictive analysis. These tools allow the QA team to analyze logs, metrics, and traces in real time, look for patterns and possible failures and leaks, and perform automated root cause analysis to reduce the need for manual debugging. Feature toggles enhance the risk management process as they enable safe deployment where features can be released, tested and reverted back to the previous state in case of an issue. Tools like LaunchDarkly and Split.io enables the QA team to validate the new changes in production without affecting all the users at once. It is crucial to integrate AI-based monitoring into the test automation strategies. The observability tools, such as Dynatrace and OpenTelemetry, gather real-time performance data and use machine learning models to detect anomalies that may lead to failure. Synthetic monitoring can also be used for proactive testing where the representative of users can interact with the application in production and track URLs, APIs, UI, and system behavior constantly. Chaos engineering is the practice of injecting controlled failures into the system to assess its robustness with the help of tools like Chaos Monkey and Gremlin. This helps validate the actual behavior of a system in a production-like environment. All the testing feedback loops are also automated to ensure that Shift-Right is applied consistently by using AI-powered test analytics tools like Testim and Applitools to learn from test case selection. This makes it possible to use production data to inform the automatic generation of test suites, thus increasing coverage and precision. Real-time alerting and self-healing mechanisms also enhance shift-right testing. Observability tools can be set up to send out alerts whenever a test fails and auto-remediation scripts to enable the environment to repair itself when test environments fail without the need to involve the IT staff. To ensure system resilience, a chaos engineering experiment can be performed in a similar manner as this script that I made that terminates instances to test recovery using Python: To detect anomalies, we can use the help of AI to make the work smarter and efficient. Here's a draft code for that as well: Netflix has led the way in embracing shift-right testing methodologies, specifically by its development of chaos engineering methodologies. To ensure system resilience, Netflix developed Chaos Monkey, a system that randomly terminates production instances to test the ability of the system to recover from failure. This led to the development of Chaos Kong, which is able to simulate the failure of whole regions, thereby directing engineering teams to make services more resilient. Such pre-emptive actions have ensured minimum disruption, providing a seamless viewing experience for consumers. Similarly, Capital One embraced shift-right testing by reorienting its DevOps practices to facilitate a microservices architecture. It employed continuous integration and continuous deployment (CI/CD) pipelines with automated compliance gates that ensured each microservice met high-quality standards prior to deployment. This improved delivery velocity without compromising quality, demonstrating the effectiveness of shift-right testing in a financial services context. Shift-right testing, together with AI and observability, is a game-changer for modern QA automation. By actively observing the production environments, incorporating real-time feedback, and using machine learning-based recommendations, it becomes possible to provide quality software at scale without jeopardizing the velocity of delivery. For QA engineers who want to enhance their skills, implementing the Shift-Right approach is not a choice but a necessity in the current DevOps environment. So, are you ready to take testing to the next level? To sum up, using AI-driven observability tools such as Datadog, New Relic, or Honeycomb can help us better understand the production systems. Using feature flags and canary releases in test strategies can make the rollouts more controlled and safe.

DZone

Mon, 7 Apr, 4:35 PM UTC

Similar products

Octomind

An AI-powered tool designed to automate Playwright end-to-end testing, improving reliability and efficiency for web applications.

Contact for Pricing

Carbonate

Carbonate is a tool that simplifies end-to-end browser testing by translating plain English instructions into automated test scripts.

Freemium

Webo

Reduce test time by 80% with Webo.Ai, the most powerful AI-powered test automation platform.

Free Trial

DeepUnit

DeepUnitAI is an AI-powered tool that assists developers in automatically generating Jest unit tests for TypeScript projects within Visual Studio Code (VS Code) or GitHub environments, with future plans to support Jasmine and Gitlab integrations.

Freemium

Ray Run

Ray Run is a comprehensive platform designed to enhance productivity and efficiency in end-to-end testing using the Playwright automation framework.

Contact for Pricing

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

The Outpost

Top stories

News

About