Software Development

Leveraging Generative AI in software testing

Yashaswi Saraswat • Aug 8, 2024

In the world of technology, innovation is the lifeblood of progress. As a tech leader collaborating with some of the industry’s most brilliant minds, Aubergine Solutions has had the privilege of witnessing and driving the transformative power of AI in various field.

Today, I want to share our journey of integrating generative AI into software testing, a breakthrough that’s redefining the landscape of software development.

The spark of innovation: Aubergine AI lab’s approach to software testing

At Aubergine AI Lab, our mission to transform groundbreaking ideas into cutting-edge AI solutions led us to a game-changing revelation in software testing.

During an intensive brainstorming session exploring ways to enhance software development processes, we conceived the idea of applying generative AI to testing. This wasn’t just about automating existing processes; we envisioned an AI system that could think like a tester, anticipating edge cases, generating comprehensive test scenarios, and predicting likely bug locations.

This vision perfectly aligned with our core mission of developing AI projects that drive scalability and impact. Leveraging our deep expertise in AI development, we rapidly moved from concept to prototype, pushing the boundaries of what’s possible in quality assurance.

The breakthrough came swiftly and decisively. Within weeks, our prototype AI generated a set of test cases that uncovered a critical bug in a client’s software, one that had eluded human testers for months.

This moment wasn’t just a proof of concept; it marked the beginning of a completely new, more optimal software testing process.

The benefits of generative AI in software testing

Our success with the prototype spurred us to refine and expand our AI testing system. We experimented with advanced generative AI testing models, including GPT-4 and our proprietary algorithms, training them on vast datasets of historical test cases, bug reports, and code repositories.

The result is a system that doesn’t just assist testers, it actively predicts and identifies potential issues before they manifest. Here’s how our AI-driven approach is transforming key areas of our software testing process.

Crafting Comprehensive Test Cases
The AI system excels in generating test cases across various formats, from traditional step-by-step instructions to Gherkin scenarios for Behavior Driven Development. This versatility ensures comprehensive coverage across different testing methodologies, adapting to the unique needs of each project.
Uncovering Edge Cases
One of the most impressive capabilities of AI powered testing is its ability to identify edge cases that often elude human testers. By analyzing patterns in codebases and historical data, the AI consistently pinpoints potential failure points, significantly enhancing the robustness of software applications.
Generating Detailed Bug Reports
Gone are the days of vague, hastily written bug reports. AI can now produce detailed, structured reports complete with reproduction steps, expected vs. actual results, and even suggestions for potential fixes. This level of detail dramatically reduces the time developers spend on bug triage and resolution.
Streamlining the Testing Process

The integration of AI has revolutionized the daily workflow of our QA department:

The AI analyzes overnight code commits and generates a prioritized list of areas requiring focused testing.
Testers review and refine AI-generated test cases, initiating automated test runs.
As results come in, the AI correlates failures with historical data, providing context and potential root causes.
For identified bugs, the AI drafts comprehensive reports, which testers enhance with their insights.
The system generates predictive analytics on test coverage, guiding the next day’s testing strategy.

This streamlined process has cut our testing cycle time by 40%, allowing for faster product updates and feature releases.

Generative AI testing: real-world impact

Integrating generative AI in software testing process at Aubergine has yielded transformative results. Examining the key metrics before and after implementation shows the impact across various facets of our testing workflow. Here’s a detailed look at the improvements we’ve observed.

Uncovering Hidden Bugs

Before integrating generative AI, our team typically discovered around 5 critical bugs each month. Manual testing, while thorough, often missed deeper issues.

With generative AI, we’re now uncovering about 15 critical bugs monthly. One memorable instance involved a persistent bug that evaded detection for months, which the AI pointed within days after deployment. This was a game-changer, highlighting how AI can delve into complexities that might slip past human eyes.

AI generated test cases in different formats

One of the most time-consuming tasks for testers is writing comprehensive test cases. Our AI-driven approach significantly alleviates this burden by creating test cases in various formats, such as Gherkin and classic methods, for different types of testing including regression, functional, UI, and performance testing.

Example: Let’s consider a test case for verifying the login functionality with valid credentials.

Classic Format

Title: Verify login functionality with valid credentials
Preconditions: User is on the login page

Steps:

Enter valid username
Enter valid password
Click on the “Login” button

Expected Result: User is redirected to the dashboard

Gherkin Format

Feature: Login functionality
Scenario: Successful login with valid credentials
Given the user is on the login page
When the user enters a valid username and password
And clicks on the “Login” button
Then the user should be redirected to the dashboard.

Accelerating Testing Cycles

Time is of the essence in software development. Our traditional testing cycles spanned around 10 days, often hampered by repetitive tasks and manual processes.

The introduction of AI decreased this to just 6 days. Imagine launching new features almost twice as fast! This acceleration not only boosts our productivity but also enhances our ability to swiftly respond to market demands and client needs.

Crafting Detailed Bug Reports

Bug reports are crucial for resolving issues efficiently. Pre-AI, our reports were often basic, requiring additional clarification and follow-ups.

Now, our AI-generated reports are comprehensive, including detailed reproduction steps, expected versus actual results, and even suggestions for fixes. Developers can dive straight into solving problems without the back-and-forth, significantly speeding up the resolution process.

Example Bug Report

Title: Application crashes when adding a new user with special characters in the username

Preconditions: User is on the “Add New User” page, application version 2.3.1
Steps to Reproduce:

Navigate to the “Add New User” page.
Enter a username containing special characters (e.g., “John@Doe!”).
Fill in the remaining mandatory fields.
Click on the “Submit” button.

Expected Result: The new user is added successfully, and a confirmation message is displayed.

Actual Result: The application crashes, and an error message “Unhandled exception: invalid input” is displayed.

Severity: Critical Additional Information: [Browser and OS details, attached log files]

Boosting Tester Productivity

Tester productivity has skyrocketed. Previously, our testers spent considerable time on mundane, repetitive tasks. With AI handling these, our testers can focus on more strategic, complex issues. This shift not only makes their work more engaging but also enhances the overall quality of our testing processes.

Identifying Edge Cases

Edge cases can be the bane of any software application, often revealing themselves at the worst possible times. Before AI, identifying these edge cases relied heavily on the experience and intuition of our testers.

Now, our AI system excels at spotting these elusive scenarios. For instance, in an e-commerce application, AI suggested testing with exceptionally high order quantities and unusual payment methods, uncovering vulnerabilities we hadn’t considered.

Leveraging AI tools for enhanced test coverage

To harness the full potential of AI in testing, we employ a variety of advanced tools that significantly increase our test coverage. Here are some key tools we use:

GenRocket: Utilizes AI to create diverse and complex data sets. It helps in testing edge cases by generating data that covers extreme, rare, or unexpected scenarios.
Applitools: Its Visual AI can catch edge cases related to UI that might not trigger functional errors but could impact user experience.
mabl: Adapts and generates tests based on changes in the application, ensuring that edge cases arising from new features or updates are tested.
Test.AI: Automatically detects new edge cases as the application evolves, reducing the need for manual edge case identification.
Testim: Analyzes test patterns and user flows, identifying potential edge cases that traditional scripts might miss.

The future of generative AI in software testing

Picture a future where predictive defect analysis proactively catches bugs, NLP seamlessly translates requirements into test cases, AI-driven environments optimize themselves, quantum computing handles the heaviest lifting, and emotion AI ensures users are delighted every step of the way.

The road ahead isn’t just about making testing faster or more efficient—it’s about fundamentally changing the game, making software development smarter, more intuitive, and incredibly responsive to user needs.

Predictive Defect Analysis

Imagine an AI system that can predict potential defects after adding new code. While AI cannot foresee exact future defects, it excels at highlighting common pitfalls and recurring issues based on historical data and past projects. By analyzing this data, predictive defect analysis identifies weak spots and suggests improvements, saving time and resources. This insight helps developers avoid frequent mistakes, enhancing code quality from the start. Additionally, AI assesses workflow efficiency, pinpointing process inefficiencies and recommending optimizations for a more streamlined development cycle.

A recent study found that organizations using predictive maintenance saw a 19% decrease in maintenance costs, leading to fewer bugs and smoother launches. Leveraging predictive defect analysis allows teams to preemptively address issues, reducing downtime and boosting user satisfaction. Moreover, AI prioritizes testing efforts based on historical defect data, making the testing phase more efficient and effective. This holistic approach significantly improves software quality and reliability.

Natural Language Processing for Requirements Analysis

Natural Language Processing (NLP) is set to redefine how we interpret project requirements. AI systems equipped with NLP can read and understand project documentation, automatically generating test suites.

This means no more manual interpretation errors and a faster, more accurate transition from requirements to testing. Imagine a world where your AI assistant can instantly create test cases as soon as you finalize your project specs—truly a game-changer.

AI-Driven Test Environment Management

Setting up and optimizing test environments can be a complex, time-consuming task. Enter AI-driven test environment management. These systems can dynamically configure environments based on project needs, ensuring optimal testing conditions.

For example, companies report an upto 40% reduction in setup times after implementing AI-driven management, leading to faster testing cycles and quicker time-to-market.

Emotion AI in User Experience Testing

Emotion AI adds an exciting new dimension to user experience (UX) testing. By analyzing real-time emotional responses during software interaction, this technology provides a profound understanding of how users truly feel about your product. Traditional metrics can only tell you so much, but Emotion AI dives deeper, capturing the nuances of user emotions. This allows developers to refine and personalize interfaces, ensuring they resonate better with user needs and expectations, ultimately creating a more engaging and delightful user experience.

Real-Time Adaptation and Self-Healing

As AI capabilities grow, we can envision self-healing test suites that automatically adapt to changes in the application under test.

In the case of an updated checkout process with a new payment option, Gen AI can:

Detect Changes: Identify that a new payment method has been added.
Update Test Cases: Automatically adjust existing test cases to include the new payment option.
Re-execute Tests: Run the updated test cases to ensure the new payment option functions correctly.

Continuous Learning and Improvement
Gen AI systems can continuously learn from the data generated during test executions. This learning process helps improve the accuracy and efficiency of future tests.

Predictive Analysis
Gen AI can analyze historical test data and user behavior to predict potential issues before they occur. For instance, if a specific user interaction frequently leads to errors, Gen AI can proactively generate test cases to target this area and prevent future problems.

Vision AI in Testing

Imagine an AI system capable of interpreting a screen’s layout, identifying elements like buttons, text fields, and images, and then automatically generating test cases based on this understanding. This could revolutionize test case creation.

For example, Vision AI could analyze a checkout screen on an e-commerce platform. It could recognize the input fields for shipping information, payment options, and the order summary. Based on this understanding, it can generate test cases to validate:

The presence and functionality of input fields.
The availability and working of different payment options.
The correctness of the order summary display.

This automated test case generation can drastically reduce the time QAs spend on writing and maintaining test cases while ensuring comprehensive test coverage.

Creating AI-powered testing assistants with Gen AI

Gen AI takes the capabilities of AI in testing a step further. By leveraging advanced machine learning models, Gen AI can simulate, test, and predict potential issues with unprecedented accuracy.

Consider a newly developed mobile app screen. An AI-powered testing assistant, using image recognition and natural language processing, could independently analyze the screen. It would identify UI elements, understand their functionalities, and automatically generate a comprehensive suite of test cases. For a login screen, it could generate test cases for:

Successful login with valid credentials.
Failed login with incorrect credentials.
Password recovery functionality.
Error handling for empty fields.
Password strength validation.

Gen AI can also predict potential user interactions and generate exploratory test cases, such as testing what happens when a user repeatedly clicks a button, enters invalid characters in a field, or tries to navigate to non-existent screens.

Combining Gen AI and Vision AI

The combined power of Gen AI and Vision AI brings a new level of sophistication to software testing, making the process more intuitive, comprehensive, and efficient.

Example: Testing a Food Delivery App Flow

Suppose you want to test the shopping cart UI of an e-commerce website across different devices—desktop, tablet, and mobile. Your goal is to ensure that all elements, like product images, prices, quantities, and the checkout button, display correctly and function seamlessly on every screen size.

Traditionally, this would involve a manual approach where a QA team meticulously checks each device, verifies the alignment, font size, and visibility, and then documents any inconsistencies. But now, with Vision AI, this process can be streamlined and significantly enhanced.Here’s a comparison of the Manual QA Approach versus the Vision AI Tool Approach:

Testing Task	Manual QA Approach	Vision AI Tool Approach	Time Required
Device-Specific Testing	QA manually checks alignment, font size, image display, and visibility on each device.	Vision AI automatically captures and compares the UI across devices.	Manual: 2-3 hours per deviceAI: 10-15 minutes for all devices
Visual Comparison	QA visually compares UI across devices and documents inconsistencies.	Vision AI provides an immediate side-by-side comparison, highlighting differences.	Manual: 1-2 hoursAI: A few minutes
Regression Testing	QA repeats the entire process after any UI update.	Vision AI automatically re-runs tests after any UI update.	Manual: Same as initial testingAI: 10-15 minutes, automated

Impact

Improved Efficiency: Automated generation and execution of test cases reduce manual testing efforts.

Enhanced Accuracy: AI-driven insights ensure subtle bugs are detected and addressed.

Comprehensive Coverage: Combining visual analysis with data-driven test generation ensures thorough testing of all application aspects.

The future of QA lies in AI-human collaboration. AI handles data processing and routine tasks, while humans bring critical thinking, creativity, and real-world insights to refine the process. Together, they ensure software is robust, reliable, and user-friendly.

While AI automates QA testing, human testers excel at exploratory testing, using creativity and intuition to find issues AI might overlook. They ensure data biases don’t skew results, aligning software with real-world expectations. In essence, AI handles the heavy lifting while humans provide critical thinking and real-world insights, creating a robust, user-friendly process.

In essence, AI handles the heavy lifting while humans bring critical thinking and real-world insights, creating a robust software testing process.

Leading the AI revolution in software testing

At Aubergine, we’re committed to staying at the forefront of these developments in AI and software testing. By continuously researching and adapting to new technologies, we aim to provide our clients with the most efficient, thorough, and innovative testing solutions possible.

The future of software testing is increasingly AI-driven, and at Aubergine AI Labs, we’re dedicated to exploring and leveraging these advancements to benefit our clients and the industry as a whole.

We invite you to join us on this exciting journey, where we’re not just imagining the future of software testing—we’re building it.

Yashaswi Saraswat

QA Engineer with a knack for finding the quirks in software. From the tiniest bugs to the most elusive edge cases, I’m committed to delivering software that not only works but delights users.