Hacker News with Generative AI: GPU

$2 H100s: How the GPU Rental Bubble Burst (latent.space)
H100s used to be $8/hr if you could get them. Now there's 7 different places sometimes selling them under $2. What happened?
Scuda – Virtual GPU over IP (github.com/kevmo314)
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
Show HN: Squey, an open-source GPU-accelerated data visualization software (squey.org)
Squey 5.0 is out! Check out the new Parquet plugin and the revamped UISquey
炊紙(kashikishi) is a text editor that utilizes GPU to edit text in a 3D space (github.com/mitoma)
炊紙は三次元空間上でテキストを編集できるテキストエディタです。「かしきし」と発音します。
Hetzner introduces GPU server for AI training (hetzner.com)
Discover the next level of performance with the new GEX130 dedicated GPU server. Equipped with the NVIDIA RTX™ 6000 Ada generation graphics card, it can put ideas into practice even faster and highly complex tasks can be completed efficiently.
Writing Portable Rendering Code with Nvrhi (nvidia.com)
Modern graphics APIs, such as Direct3D 12 and Vulkan, are designed to provide relatively low-level access to the GPU and eliminate the GPU driver overhead associated with API translation.
Show HN: Oblivus GPU Cloud – On-Demand H100s from $1.98/hr – $25 Free Credit (oblivus.com)
Democratized GPU Cloud starting at only $0.12/hr! No quotas, no restrictions.
GPU Debug Scopes (wunkolo.github.io)
Rendering APIs these days tend to capture their gpu workloads into a serialized form such as a command-buffer or command-list to be dispatched at a later time into a work-queue.
Show HN: Attaching to a virtual GPU over TCP (thundercompute.com)
Open-Source AMD GPU Implementation of CUDA "Zluda" Has Been Taken Down (phoronix.com)
Show HN: Datoviz – Vulkan-based GPU scientific visualization (C/C++/Python) (github.com/datoviz)
GPU Restaking – Beyond digital currencies to physical computing resources (bagel.net)
TensorDict: A GPU-accelerated Python dictionary (github.com/pytorch)
Four billion years in four minutes – Simulating worlds on the GPU (davidar.io)
How to optimize a CUDA matmul kernel for cuBLAS-like performance (2022) (siboehm.com)
How can I do my research as a GPU poor? (ycombinator.com)
Real-Time Procedural Generation with GPU Work Graphs [pdf] (gpuopen.com)
NVIDIA Transitions Fully Towards Open-Source Linux GPU Kernel Modules (nvidia.com)
Linux Patch to Disable the Snapdragon X Elite "X1E80100" GPU by Default (phoronix.com)
Run CUDA, unmodified, on AMD GPUs (scale-lang.com)
GPU-Friendly Stroke Expansion (arxiv.org)
The Simplest Way to Control Nvidia GPU Fan Speed in Linux (github.com/RoversX)
Show HN: wc-GPU: The Unix util `wc` running on a GPU (github.com/fragmede)
Nvidia Warp: A Python framework for high performance GPU simulation and graphics (github.com/NVIDIA)
Nvidia takes 88% of the GPU market share (xda-developers.com)
An End-User Has Made It Easier to Build ROCm AMD GPU Machine Learning Software (phoronix.com)
Nvtop: Htop for GPUs (terminaltrove.com)
How to Make More Money Renting a GPU Than Nvidia Makes Selling It (nextplatform.com)
Show HN: TensorDock – GPU Cloud Marketplace, H100s from $2.49/hr (tensordock.com)
Fine tune LLAMA3 on million scale dataset in consumer GPU using QLora, DeepSpeed (medium.com)