Hacker News with Generative AI: Benchmarks

Killed by LLM (r0bk.github.io)
A memorial to the benchmarks that defined—and were defeated by—AI progress
Fair Go vs. Elixir Benchmarks (github.com/antonputra)
The code previously used Jason.encode! but Jason.encode_to_iodata! should be preferred over IO devices. This should increase performance and reduce memory usage. This is what frameworks such as a Phoenix would have used by default
Intel Compute Runtime 24.45 vs. ROCm 6.3 vs. Nvidia R565 Linux GPU Benchmarks (phoronix.com)
Complementing yesterday's fresh Linux gaming benchmarks of mid-range Intel Arc Graphics "Alchemist" vs. NVIDIA GeForce RTX 40 vs. AMD Radeon RX 7000 series cards ahead of the upcoming Battlemage availability, today's article is providing a fresh look at the latest Intel Compute Runtime performance for Level Zero / OpenCL on current-gen Intel discrete graphics compared to mid-range AMD Radeon GPUs on ROCm 6.3 and similar NVIDIA GeForce RTX 40 Ada graphics cards on the R565 driver.
Intel Arc B580 trades blows with the RTX 4060 and RX 7600 in early benchmarks (tomshardware.com)
1B Nested Loop Iterations (benjdd.com)
Timings taken via hyperfine on an M3 Macbook pro with 16 gb RAM. Input value of 40 given to each.
OrioleDB beta7: Benchmarks (orioledb.com)
OrioleDB is a storage extension for PostgreSQL which uses PostgreSQL's pluggable storage system.
Microbenchmarks Are Experiments (mrale.ph)
Benchmarks are not numerology. Their results are not a divine revelation. Benchmarks are experiments. Their results are meaningless without interpretation and validation.
1B nested loop iterations (benjdd.com)
Ran each three times and used the lowest timing for each.
Google Axion ARM CPU (C4A) vs. AWS Graviton4, Performance benchmark (phoronix.com)
Last week Google announced the general availability of their C4A instances powered by their in-house Axion processors.
AMD Ryzen 7 9800X3D Linux Performance: Zen 5 With 3D V-Cache (phoronix.com)
Ahead of tomorrow's availability of the AMD Ryzen 7 9800X3D processor as the first Zen 5 CPU released with 3D V-Cache, today the review embargo lifts. Here is a look at how this 8-core / 16-thread Zen 5 CPU with 64MB of 3D V-Cache is performing under Ubuntu Linux compared to a variety of other Intel Core and AMD Ryzen desktop processors.
Apple's M4 Max is the single-core performance king in Geekbench 6 (tomshardware.com)
Mac Mini with M4 Pro is the fastest Mac ever benchmarked (macrumors.com)
The first Geekbench 6 benchmark results for the M4 Pro chip surfaced today. Impressively, the results that are available so far show that the highest-end M4 Pro chip is faster than the highest-end M2 Ultra chip in terms of peak multi-core CPU performance.
Benchmarks of Google's Axion Arm-Based CPU (phoronix.com)
Earlier this year Google announced Axion as their first Arm-based CPU for the Google Cloud. Today already they are taking Axion to general availability with the new C4A instances. These new C4A instances are advertised as offering up to 50% better performance and up to 60% better energy efficiency than their current generation x86 instance types.
Encore.ts: A New Type of Framework (encore.dev)
We recently published performance benchmarks showing how Encore.ts achieves 9x request throughput compared to Express.js, and 2x compared to Fastify.
AMD Zen 5 Epyc Turin dominates previous Zen 4, Intel by 40% (phoronix.com)
Across more than 140 benchmarks the AMD EPYC 9005 series processors were delivering great performance, power efficiency, and value. Those interested can see all 140 benchmarks via this result file.
AMD EPYC Turin delivers better performance/power efficiency than AmpereOne (phoronix.com)
The AMD EPYC 9965 Turin Dense processor was delivering dominating performance in most of the HPC benchmarks tested compared to the AmpereOne A192-32X flagship ARM server processor.
AMD EPYC 9755 / 9575F / 9965 Benchmarks Show Dominating Performance Review (phoronix.com)
Last month Intel introduced their Xeon 6 "Granite Rapids" processors with up to 128 P cores, MRDIMM support, and other improvements as a big step-up in performance and power efficiency for their server processors.
Windows 11 vs. Ubuntu 24.10 Performance For Intel Core Ultra 7 Lunar Lake (phoronix.com)
We are used to seeing Linux perform much better than Windows for rendering with Blender and that still holds true with Lunar Lake.
AMD Ryzen AI 300 Series Dominates Intel Core Ultra 7 Lunar Lake Performance (phoronix.com)
Earlier this week I delivered initial Intel Xe2 Lunar Lake graphics benchmarks on Linux while today the focus is on Lunar Lake's CPU performance.
Running Spec CPU2017 at Chips and Cheese? (chipsandcheese.com)
SPEC, or Standard Performance Evaluation Corporation, maintains and publishes various benchmark suites that are often taken as an industry standard.
Intel Processor N95 vs. N97 vs. N100 vs. Core I3-N305 Benchmarks Comparison (cnx-software.com)
Intel Alder Lake-N processors have been pretty popular in mini PCs and to a lesser extent in single board computers in the last year or so, thanks to their excellent performance/price and features/price ratios.
OpenAI O1 Results on ARC Prize (twitter.com)
Open-source reflection lama 70B beats Claude 3.5 and GPT-4 on benchmarks (reflectionllama.com)
UntetherAI: Record-Breaking MLPerf Benchmarks (untether.ai)
Cerebras launches inference for Llama 3.1; benchmarked at 1846 tokens/s on 8B (twitter.com)
Intel Clear Linux: 16% more Ryzen 9 9950X performance (phoronix.com)
Windows 11 vs. Ubuntu 24.04 Linux Performance For The AMD Ryzen 9 9590X (phoronix.com)
The AMD Ryzen 9 9950X and Ryzen 9 9900X Review: Flagship Zen 5 Soars and Stalls (anandtech.com)
Benchmark of Bcachefs vs. Btrfs vs. EXT4 vs. F2FS vs. XFS on Linux 6.11 (phoronix.com)
AMD Ryzen 5 9600X and Ryzen 7 9700X Offer Excellent Linux Performance (phoronix.com)