Hacker News with Generative AI: Software Testing

Reliably Benchmarking Small Changes – Ankush Menat (ankush.dev)
I often have to benchmark web services to see if some small change has a meaningful impact on performance. Typically, you spawn a few web service workers and use another program (ideally on another machine in the same network) to hammer that service. During this time, the test program will keep track of how many requests were processed and the latency for each of them. If throughput goes up and/or latency goes down, your change was effective.
Show HN: Hercules – Open-Source software testing agent (testzeus.com)
Effortless, Autonomous Test Automation—Zero Coding, Zero Maintenance, Infinite Possibilities.
Using LLMs to enhance our testing practices (assembled.com)
At Assembled, engineering velocity is our competitive edge. We pride ourselves on delivering new features at a fast pace. But how do we maintain quality without slowing down? The answer lies in robust testing.
Launch HN: GPT Driver (YC S21) – End-to-end app testing in natural language (ycombinator.com)
Hey HN, we are Chris and Chris from MobileBoost (https://mobileboost.io/). We’re building GPT Driver, an AI-native approach to create and execute end-to-end (E2E) tests on mobile applications.
SMURF: Beyond the Test Pyramid (googleblog.com)
SMURF: Beyond the Test Pyramid
Webinar – How Generative AI Can Help Scale Your QA Processes (testrigor.com)
Join us for an insightful webinar hosted by testRigor, where we’ll delve into the transformative potential of scaling QA processes using Generative AI.
Accumulated Test Vectors (filippo.io)
I like tests. I especially like reusable test vector libraries. Sometimes test vectors are lovingly handcrafted to target obscure edge-cases. Those vectors belong in Wycheproof or with the upstream specification. Sometimes though vectors are produced by sheer brute force. Enumerate every possible input and check the output. Try a million random inputs and see what happens. Combine all possible input sizes for every parameter. Make one very, very large input.
I Am Tired of AI (ontestautomation.com)
Unless you have been living under a rock for the last few years, you probably have seen the same massive surge I’ve seen in the application of artificial intelligence (AI) to pretty much every problem out there, in software testing, in software development, and in life in general.
Show HN: Scandium: No-Code Test Automation Tool for Software Testing (getscandium.com)
Create and maintain automated test cases without a line of code.
Earn $200K by Fuzzing for a Weekend (secret.club)
By applying well-known fuzzing techniques to a popular target, I found several bugs that in total yielded over $200K in bounties. In this article I will demonstrate how powerful fuzzing can be when applied to software which has not yet faced sufficient testing.
Automated feature testing of Verilog parsers using fuzzing (wordpress.com)
I’m delighted that Quentin Corradi, a PhD student I jointly supervise with George Constantinides, will be presenting his work to improve the reliability of hardware design tools next week at the FUZZING’24 workshop, a satellite event of the ISSTA conference.
Avoiding Test-Case Permutation Blowout (stevenhicks.me)
Rust Just Failed an Important Test (mccue.dev)
Show HN: A fake SMTP server for software integration testing (fakemail.stream)
Show HN: How we use LLMs to find testing gaps, vulnerabilities in codebases (github.com/codeintegrity-ai)
Show HN: Mutahunter – Fast, language agnostic, software testing using LLM agents (mutahunter.ai)
Automated Test-Case Reduction (cs.cornell.edu)
The sad state of property-based testing libraries (stevana.github.io)
Evaluating Fuzz Testing (arxiv.org)
Show HN: DuffMan – Fuzzer for Postman Collection (github.com/kmarkela)
Getting 100% code coverage doesn't eliminate bugs (codepipes.com)
Test Failures Should Be Actionable (googleblog.com)
Temporal Fuzzing I: Memory Models (2021) (reitzen.com)
Test Sizes (2010) (googleblog.com)
AI programming tools should be added to the Joel Test (waleson.com)