Hacker News with Generative AI: Software Testing

Functional Tests as a Tree of Continuations (2010) (evanmiller.org)
One of the most essential practices for maintaining the long-term quality of computer code is to write automated tests that ensure the program continues to act as expected, even when other people (including your future self) muck with it.
Revolutionizing software testing: Introducing LLM-powered bug catchers (engineering.fb.com)
Meta’s Automated Compliance Hardening (ACH) tool is a system for mutation-guided, LLM-based test generation. ACH hardens platforms against regressions by generating undetected faults (mutants) in source code that are specific to a given area of concern and using those same mutants to generate tests.
Testtrim: A testing tool that couldn't test itself (until now) (fenniak.net)
Today, we’re going to deep-dive into the kind of thing you can only “invest” time on if you’re a single engineer working on a project with no supervision. I just finished a crazy complicated development effort in my project, testtrim, and all I want to do is talk about how surprised I am that it actually worked.
Compiler Fuzzing in Continuous Integration: A Case Study on Dafny [pdf] (ic.ac.uk)
The Fuzzing Book (fuzzingbook.org)
Welcome to "The Fuzzing Book"! Software has bugs, and catching bugs can involve lots of effort. This book addresses this problem by automating software testing, specifically by generating tests automatically. Recent years have seen the development of novel techniques that lead to dramatic improvements in test generation and software testing. They now are mature enough to be assembled in a book – even with executable code.
State Space Explosion: The Reason We Can Never Test Software to Perfection(2021) (concerningquality.com)
Have you ever seen a test suite actually prevent 100% of bugs? With all of the time that we spend testing software, how do bugs still get through? Testing seems ostensibly simple – there are only so many branches in the code, only so many buttons in the UI, only so many edge cases to consider. So what is difficult about testing software?
Property-Based Testing for the People (repository.upenn.edu)
Faster CI with Selective Testing (mill-build.org)
Selective testing is a key technique necessary for working with any large codebase or monorepo: picking which tests to run to validate a change or pull-request, because running every test every time is costly and slow. This blog post will explore what selective testing is all about, the different approaches you can take with selective testing, based on my experience working on developer tooling and CI for the last decade at Dropbox and Databricks.
Why Playwright Is Dominating Browser Automation: A Deep Dive (leapcell.io)
Reliably Benchmarking Small Changes – Ankush Menat (ankush.dev)
I often have to benchmark web services to see if some small change has a meaningful impact on performance. Typically, you spawn a few web service workers and use another program (ideally on another machine in the same network) to hammer that service. During this time, the test program will keep track of how many requests were processed and the latency for each of them. If throughput goes up and/or latency goes down, your change was effective.
Show HN: Hercules – Open-Source software testing agent (testzeus.com)
Effortless, Autonomous Test Automation—Zero Coding, Zero Maintenance, Infinite Possibilities.
Using LLMs to enhance our testing practices (assembled.com)
At Assembled, engineering velocity is our competitive edge. We pride ourselves on delivering new features at a fast pace. But how do we maintain quality without slowing down? The answer lies in robust testing.
Launch HN: GPT Driver (YC S21) – End-to-end app testing in natural language (ycombinator.com)
Hey HN, we are Chris and Chris from MobileBoost (https://mobileboost.io/). We’re building GPT Driver, an AI-native approach to create and execute end-to-end (E2E) tests on mobile applications.
SMURF: Beyond the Test Pyramid (googleblog.com)
SMURF: Beyond the Test Pyramid
Webinar – How Generative AI Can Help Scale Your QA Processes (testrigor.com)
Join us for an insightful webinar hosted by testRigor, where we’ll delve into the transformative potential of scaling QA processes using Generative AI.
Accumulated Test Vectors (filippo.io)
I like tests. I especially like reusable test vector libraries. Sometimes test vectors are lovingly handcrafted to target obscure edge-cases. Those vectors belong in Wycheproof or with the upstream specification. Sometimes though vectors are produced by sheer brute force. Enumerate every possible input and check the output. Try a million random inputs and see what happens. Combine all possible input sizes for every parameter. Make one very, very large input.
I Am Tired of AI (ontestautomation.com)
Unless you have been living under a rock for the last few years, you probably have seen the same massive surge I’ve seen in the application of artificial intelligence (AI) to pretty much every problem out there, in software testing, in software development, and in life in general.
Show HN: Scandium: No-Code Test Automation Tool for Software Testing (getscandium.com)
Create and maintain automated test cases without a line of code.
Earn $200K by Fuzzing for a Weekend (secret.club)
By applying well-known fuzzing techniques to a popular target, I found several bugs that in total yielded over $200K in bounties. In this article I will demonstrate how powerful fuzzing can be when applied to software which has not yet faced sufficient testing.
Automated feature testing of Verilog parsers using fuzzing (wordpress.com)
I’m delighted that Quentin Corradi, a PhD student I jointly supervise with George Constantinides, will be presenting his work to improve the reliability of hardware design tools next week at the FUZZING’24 workshop, a satellite event of the ISSTA conference.
Avoiding Test-Case Permutation Blowout (stevenhicks.me)
Rust Just Failed an Important Test (mccue.dev)
Show HN: A fake SMTP server for software integration testing (fakemail.stream)
Show HN: How we use LLMs to find testing gaps, vulnerabilities in codebases (github.com/codeintegrity-ai)
Show HN: Mutahunter – Fast, language agnostic, software testing using LLM agents (mutahunter.ai)
Automated Test-Case Reduction (cs.cornell.edu)
The sad state of property-based testing libraries (stevana.github.io)
Evaluating Fuzz Testing (arxiv.org)
Show HN: DuffMan – Fuzzer for Postman Collection (github.com/kmarkela)
Getting 100% code coverage doesn't eliminate bugs (codepipes.com)