Hacker News with Generative AI: AMD

AMD's game-changing Strix Halo, formerly Ryzen AI Max, poses for new die shots (tomshardware.com)
AVX-512 gotcha: avoid compressing words to memory with AMD Zen 4 processors (lemire.me)
The recent AMD processors (Zen 4) provide extensive support for the powerful AVX-512 instructions.
GCC 15 Compiler Showing Off Nice Performance Improvements on AMD Zen 5 (phoronix.com)
There were a number of other applications with small but consistent performance improvements when built by GCC 15.
AMD Announces Open-Source "Schola" Library for Reinforcement Learning (phoronix.com)
AMD announced today the release of Schola 1.0 as an open-source reinforcement learning library that is being made available under an MIT license and as part of their GPUOpen software collection for helping game developers.
How to make any AMD Zen CPU always generate 4 from RDRAND (theregister.com)
Googlers have not only figured out how to break AMD's security – allowing them to load unofficial microcode into its processors to modify the silicon's behavior as they wish – but also demonstrated this by producing a microcode patch that makes the chips always output 4 when asked for a random number.
AMD: Microcode Signature Verification Vulnerability (github.com/google)
Google Security Team has identified a security vulnerability in some AMD Zen-based CPUs. This vulnerability allows an adversary with local administrator privileges (ring 0 from outside a VM) to load malicious microcode patches. We have demonstrated the ability to craft arbitrary malicious microcode patches on Zen 1 through Zen 4 CPUs. The vulnerability is that the CPU uses an insecure hash function in the signature validation for microcode updates.
AMD adds RF-sampling data converters to Versal adaptive SoCs (2024) (electronicsweekly.com)
AMD is to add integrated direct RF-sampling data converters to  its Versal adaptive SoCs.
Interview with AMD Chief Software Officer: UDNA, ROCm for Radeon, AI Everywhere (techpowerup.com)
A few days ago, we reported on AMD's newest expansion plans for Serbia. The company opened two new engineering design centers with offices in Belgrade and Nis.
KVM Enhancements Within the Linux 6.14 Kernel (phoronix.com)
Along with other Intel TDX changes and AMD SEV updates separately sent out for thei n-development Linux 6.14 kernel, there is the usual hearty batch of Kernel-based Virtual Machine (KVM) virtualization updates too.
Disabling Zen 5's Op Cache and Exploring Its Clustered Decoder (chipsandcheese.com)
Zen 5 has an interesting frontend setup with a pair of fetch and decode clusters. Each cluster serves one of the core’s two SMT threads. That creates parallels to AMD’s Steamroller architecture from the pre-Zen days. Zen 5 and Steamroller can both decode up to eight instructions per cycle with two threads active, or up to four per cycle for a single thread.
ROCm Device Support Wishlist (github.com/ROCm)
We would like to hear from the community what other cards you would like to see ROCm support for. Currently compatibility matrix for Linux is at https://rocm.docs.amd.com/projects/install-on-linux/en/latest/reference/system-requirements.html and Windows is at https://rocm.docs.amd.com/projects/install-on-windows/en/latest/reference/system-requirements.html.
Linux 6.13 Released with AutoFDO and Propeller, AMD Changes and Apple Support (phoronix.com)
As anticipated the Linux 6.13 kernel was promoted to stable today with an on-time release and in turn also marking the start of the Linux 6.14 merge window.
The AMD Radeon Instinct MI300A's Giant Memory Subsystem (chipsandcheese.com)
AMD acquired ATI in 2006, hoping ATI's GPU expertise would combine with AMD's CPU know-how to create integrated solutions worth more than the sum of their parts.
Tiny Corp Nearing "Completely Sovereign" Compute Stack for AMD GPUs with Tinygr (phoronix.com)
George Hotz' Tiny Corp that develops the Tinygrad neural network framework and sells the Tinybox NVIDIA and AMD powered AI workstations is nearing a "completely sovereign" software stack for GPU compute on AMD.
Boosting Computational Fluid Dynamics Performance with AMD MI300X (blogs.amd.com)
This blog will guide you, step-by-step, through the process of installing and running benchmarks with Ansys Fluent and AMD MI300X. We start with an overview of the Ansys Fluent CFD application and then show you how to set up an AMD MI300X system to run benchmarks. The blog benchmarks results demonstrate the dramatic impact the MI300X has on speeding up simulations, improving design efficiency, and reducing costs in the automotive, aerospace, and environmental engineering industries.
Apple joins UALink consortium with Intel and AMD to take on Nvidia AI dominance (9to5mac.com)
Apple has officially gained a board seat on the Ultra Accelerator Link Consortium, a group of more than 65 members developing next generation AI accelerator architecture.
AMD says Intel's 'horrible product' is causing Ryzen 7 9800X3D shortages (tomshardware.com)
AMD says Intel's 'horrible product' is causing Ryzen 7 9800X3D shortages (tomshardware.com)
TSMC Arizona allegedly now producing AMD's Ryzen 9000 and Apple's S9 processors (tomshardware.com)
AMD Ryzen 9 9950X3D and 9900X3D claims 20% faster gaming performance vs. Intel (tomshardware.com)
AMD 'Strix Halo' Ryzen AI Max+ Debuts with RDNA 3.5 Graphics and Zen 5 CPU Cores (tomshardware.com)
Agner Fog's Software Optimization Resources (agner.org)
This series of five manuals describes everything you need to know about optimizing code for x86 and x86-64 family microprocessors, including optimization advices for C++ and assembly language, details about the microarchitecture and instruction timings of most Intel, AMD and VIA processors, and details about different compilers and calling conventions.
Intel's Linux Performance Optimizations Continue Paying Off for AMD EPYC (phoronix.com)
As part of my end-of-year benchmarking and various historical comparisons, over the holidays I was curious to take a look at how the mature AMD EPYC 9004 "Genoa" performance has evolved over the past two years under Linux.
I helped fix sleep-wake hangs on Linux with AMD GPUs (gitlab.io)
I dual-boot my desktop between Windows and Linux. Over the past few years, Linux would often crash when I tried to sleep my computer with high RAM usage. Upon waking it would show a black screen with moving cursor, or enter a "vegetative" state with no image on-screen, only responding to magic SysRq or a hard reset. I traced this behavior to an amdgpu driver power/memory management bug, which took over a year to brainstorm and implement solutions for.
AMD Continued Ramping Up Their Linux and Open-Source Investments in 2024 (phoronix.com)
AMD's new products this year have not only been supported well on the server side with their new EPYC 9005 "Turin" processors but also on the consumer side with the Ryzen AI 300 series laptop and Ryzen 9000 series desktop Zen 5 processors.
UALink Consortium Led by AMD and Intel Poised to Compete with Nvidia's NVLink (tomshardware.com)
AMD 3D V-Cache teardown shows majority of the Ryzen 7 9800X3D is dummy silicon (tomshardware.com)
Framework 13 AMD Review on Linux: Almost Perfect (boilingsteam.com)
Following our review of the Framework Intel 12th gen laptop last year, we are giving a spin to the Framework Laptop AMD 13 that features one of the latest AMD processors, and a brand new screen as well, at 2.8K resolution and 120 Hz refresh rate. And it’s a matte display!
DOOM ported to run atop AMD ROCm + LLVM libc (phoronix.com)
An open-source developer at AMD has carried out a DOOM port that runs almost entirely atop AMD GPUs for rendering and the game logic.
Ask HN: How Are AMD GPUs/ROCm for Machine Learning in 2024? (ycombinator.com)
Ask HN: How Are AMD GPUs/ROCm for Machine Learning in 2024?