Hacker News with Generative AI: GPU Programming

Running GPT-2 in WebGL: Rediscovering the Lost Art of GPU Shader Programming (nathan.rs)
Preface: A few weeks back, I implemented GPT-2 using WebGL and shaders (Github Repo) which made the front page of Hacker News (discussion). By popular demand, here is a short write-up over the main ideas behind GPU shader programming (for general-purpose computing).

Generative AI, WebGL, GPU Programming, Computer Graphics

157 points by nathan-barry 47 days ago | 41 comments

AMD GPU Programming in Julia (juliagpu.org)

AMD, GPU Programming, Julia, Programming Languages

26 points by pxl-th 65 days ago | 4 comments

Faster sorting with SIMD CUDA intrinsics (2024) (winwang.blog)
Recently, I finished a batch at the Recurse Center… is what I would have said if this post were written when I intended to write it (i.e. 3 months ago). My project there focused on a questionable application of CUDA (mostly irrelevant to this post), but it got me thinking more about other GPU-friendly algorithms.

CUDA, GPU Programming, Sorting Algorithms, Optimization

92 points by winwang 68 days ago | 11 comments

WebMonkeys: parallel GPU programming in JavaScript (2016) (github.com/VictorTaelin)
Allows you to spawn thousands of parallel tasks on the GPU with the simplest, dumbest API possible. It works on the browser (with browserify) and on Node.js. It is ES5-compatible and doesn't require any WebGL extension.

JavaScript, GPU Programming, Web Development, Parallel Computing

115 points by surprisetalk 70 days ago | 28 comments

Next-Gen GPU Programming: Hands-On with Mojo and Max Modular HQ (youtube.com)

GPU Programming, Programming, Graphics, Hardware, Software

44 points by solarmist 78 days ago | 21 comments

PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch (arxiv.org)
CUDA Graphs -- a recent hardware feature introduced for NVIDIA GPUs -- aim to reduce CPU launch overhead by capturing and launching a series of GPU tasks (kernels) as a DAG. However, deploying CUDA Graphs faces several challenges today due to the static structure of a graph. It also incurs performance overhead due to data copy. In fact, we show a counter-intuitive result -- deploying CUDA Graphs hurts performance in many cases.

CUDA, PyTorch, Performance Optimization, GPU Programming, Deep Learning

84 points by mfiguiere 79 days ago | 8 comments

CubeCL: GPU Kernels in Rust for CUDA, ROCm, and WGPU (github.com/tracel-ai)
With CubeCL, you can program your GPU using Rust, taking advantage of zero-cost abstractions to develop maintainable, flexible, and efficient compute kernels.

Rust, GPU Programming, CUDA, ROCm, WGPU

210 points by ashvardanian 80 days ago | 41 comments

How to Write a Fast Matrix Multiplication from Scratch with Tensor Cores (2024) (alexarmbr.github.io)
This post details my recent efforts to write an optimized matrix multiplication kernel in CUDA using tensor cores on a NVIDIA Tesla T4 GPU. The goal is to compute $D = \alpha * A * B + \beta * C$, as fast as possible. In this equation $D,A,B$ and $C$ are large matrices full of half precision floating point numbers, and $\alpha$, $\beta$ are constants. This problem is usually referred to as a Half-precision Generalized Matrix Multiply, or HGEMM for short.

CUDA, GPU Programming, Matrix Multiplication, Optimization, Tensor Cores

147 points by skidrow 85 days ago | 17 comments

Zig and GPUs (alichraghi.github.io)
GPU programming used to mean wrangling C++ compilers, bloated SDKs, and vendor-specific toolchains. That’s changing. You can now write GPU code in modern languages like Rust and Zig with fewer layers. This post walks through the current state of Zig’s GPU backends and how they stack up across Vulkan, OpenCL, and native ISAs.

GPU Programming, Programming Languages, Zig, Vulkan, OpenCL

57 points by Cloudef 86 days ago | 11 comments

Rust CUDA Project (github.com/Rust-GPU)
An ecosystem of libraries and tools for writing and executing extremely fast GPU code fully in Rust.

Rust, GPU Programming, Libraries, Tools

146 points by sksxihve 93 days ago | 47 comments

Show HN: HipScript – Run CUDA in the browser with WebAssembly and WebGPU (lights0123.com)
Online compiler for HIP and NVIDIA® CUDA® code to WebGPU

WebAssembly, WebGPU, CUDA, GPU Programming, Programming Languages

309 points by lights0123 187 days ago | 32 comments

GPU Programming Glossary (modal.com)

GPU Programming, Glossary, Computer Graphics

13 points by Jimmc414 212 days ago | 1 comments

Using Libc for GPUs (llvm.org)
Once you have finished building the GPU C library it can be used to run libc or libm functions directly on the GPU. Currently, not all C standard functions are supported on the GPU. Consult the list of supported functions for a comprehensive list.

GPU Programming, C Programming, Libraries, Performance

193 points by hochmartinez 214 days ago | 87 comments

Rust GPU: The future of GPU programming (rust-gpu.github.io)
Finally, a modern language for GPUs <p>Rust GPU makes it possible to write and run GPU software in Rust, leveraging the language's powerful safety and concurrency features to enhance performance and reliability. With Rust GPU, you can seamlessly develop for both CPU and GPU using a unified codebase, all while benefiting from Rust’s existing ecosystem.</p>

Programming Languages, Rust, GPU Programming, Performance

34 points by eventhelix 278 days ago | 28 comments

GPU Puzzles (github.com/srush)
This notebook is an attempt to teach beginner GPU programming in a completely interactive fashion. Instead of providing text with concepts, it throws you right into coding and building GPU kernels.

GPU Programming, Programming, Learning Resources, Education

354 points by cgadski 298 days ago | 40 comments

Ask HN: Resources for GPU Compilers? (ycombinator.com)

GPU Programming, Compilers, Resources

74 points by zvikinoza 313 days ago | 21 comments

Taichi: Productive, portable, and performant GPU programming in Python (github.com/taichi-dev)

Python, GPU Programming, Open Source, Performance

60 points by lastdong 326 days ago | 16 comments

Gpu.cpp: A lightweight library for portable low-level GPU computation (answer.ai)

GPU Programming, C++, Libraries

242 points by bovem 365 days ago | 51 comments

ILGPU: Write GPU programs with C# and F# (github.com/m4rs-mt)

GPU Programming, C#, F#, Programming Languages

157 points by neonsunset 421 days ago | 35 comments

Show HN: Metashade – a Pythonic GPU shading/compute EDSL (github.com/ppenenko)

Python, GPU Programming, Shading, Computer Graphics, Open Source

47 points by ppenenko 447 days ago | 8 comments