Hacker News with Generative AI: GPU Architecture

Dynamic Register Allocation on AMD's RDNA 4 GPU Architecture (chipsandcheese.com)
Modern GPUs often make a difficult tradeoff between occupancy (active thread count) and register count available to each thread.
Smol-GPU: A tiny RISC-V GPU built to teach modern GPU architecture (github.com/Grubre)
An educational implementation of a parallel processor in system-verilog.
Uncovering Real GPU NoC Characteristics: Implications on Interconnect Arch. (ece.ubc.ca)
AMD announces unified UDNA GPU architecture – bringing RDNA and CDNA together (tomshardware.com)
Two become one.