Hacker News with Generative AI: GPUs

LithOS: An Operating System for Efficient Machine Learning on GPUs (arxiv.org)
The surging demand for GPUs in datacenters for machine learning (ML) has made efficient GPU utilization crucial.

Operating Systems, Machine Learning, GPUs

3 points by PaulHoule 159 days ago | 0 comments

RAGDoll: Efficient Offloading-Based Online RAG System on a Single GPU (arxiv.org)
Retrieval-Augmented Generation (RAG) enhances large language model (LLM) generation quality by incorporating relevant external knowledge.

Retrieval-Augmented Generation, Artificial Intelligence, Computer Science, GPUs

4 points by PaulHoule 162 days ago | 0 comments

Analyzing Modern Nvidia GPU Cores (arxiv.org)
GPUs are the most popular platform for accelerating HPC workloads, such as artificial intelligence and science simulations. However, most microarchitectural research in academia relies on GPU core pipeline designs based on architectures that are more than 15 years old.

GPUs, Artificial Intelligence, Science, Research

178 points by mfiguiere 166 days ago | 37 comments

TScale – Distributed training on consumer GPUs (github.com/Foreseerr)
This repo contains transformer train and inference code written in C++ and CUDA.

Machine Learning, GPUs, Distributed Systems, C++, CUDA

130 points by zX41ZdbW 168 days ago | 27 comments

Intel details 14A performance and Turbo Cells for maximum CPU and GPU frequency (tomshardware.com)

Intel, CPUs, GPUs, Performance, Technology

3 points by rbanffy 170 days ago | 0 comments

China's Moore Threads polishes homegrown CUDA alternative (tomshardware.com)

China, Hardware, GPUs, Semiconductors

4 points by rguiscard 186 days ago | 0 comments

Jensen Huang on GPUs [video] (youtube.com)

Jensen Huang, GPUs, Hardware, Technology, Video

7 points by tambourine_man 207 days ago | 1 comments

Nvidia's RTX Pro 6000 has 96GB of VRAM and 600W of power (theverge.com)
Nvidia is announcing its RTX Pro Blackwell series of GPUs today, designed to meet the needs of professional designers, developers, data scientists, and creatives.

Nvidia, GPUs, Hardware, Graphics Cards, Professional Workstations

63 points by mfiguiere 214 days ago | 69 comments

Akira ransomware can be cracked with sixteen RTX 4090 GPUs in around ten hours (tomshardware.com)

Cybersecurity, Ransomware, GPUs, Technology, Hardware

153 points by Ozarkian 216 days ago | 39 comments

Decrypting Encrypted files from Akira Ransomware using a bunch of GPUs (tinyhack.com)
I recently helped a company recover their data from the Akira ransomware without paying the ransom. I’m sharing how I did it, along with the full source code.

Cybersecurity, Ransomware, GPUs, Data Recovery

6 points by notmine1337 219 days ago | 1 comments

Speeding up computational lithography with the power and parallelism of GPUs (semiengineering.com)
A new lithography library brings mask optimization operations to GPUs.

Computational Lithography, GPUs, Parallel Computing, Semiconductor Engineering

57 points by PaulHoule 229 days ago | 1 comments

SVDQuant+NVFP4: 4× Smaller, 3× Faster FLUX with 16-bit Quality on Blackwell GPUs (hanlab.mit.edu)
With Moore's law slowing down, hardware vendors are shifting toward low-precision inference. NVIDIA's latest Blackwell architecture introduces a new 4-bit floating point format (NVFP4), improving upon the previous MXFP4 format. NVFP4 features more precise scaling factors and a smaller microscaling group size (16 v.s. 32), enabling it to maintain 16-bit model accuracy even at 4-bit precision while delivering 4× higher peak performance.

Hardware, GPUs, Machine Learning, Computer Architecture, Low-Precision Inference

52 points by lmxyy 239 days ago | 10 comments

Linux Introducing a Standardized Way of Informing User-Space over Hung GPUs (phoronix.com)
The upcoming Linux 6.15 kernel is set to finally introduce a standardized way of informing user-space of GPUs becoming hung or otherwise unresponsive.

Linux, Kernel Development, GPUs, Software, Operating Systems

11 points by mfiguiere 240 days ago | 0 comments

We were wrong about GPUs (fly.io)
We’re building a public cloud, on hardware we own. We raised money to do that, and to place some bets; one of them: GPU-enabling our customers. A progress report: GPUs aren’t going anywhere, but: GPUs aren’t going anywhere.

Cloud Computing, Hardware, GPUs, Progress Reports

921 points by mxstbr 246 days ago | 576 comments

800x Speed Boost on Nvidia GPUs (scmp.com)
A high-performance algorithm that could solve complicated material design problems on consumer GPUs has been developed by Chinese researchers, achieving a groundbreaking 800-fold increase in speed over traditional methods.

Artificial Intelligence, Computer Science, Hardware, GPUs, Research

10 points by nthypes 259 days ago | 5 comments

How to Run DeepSeek R1 Distilled Reasoning Models on RyzenAI and Radeon GPUs (guru3d.com)

Deep Learning, Computer Hardware, GPUs, AI Models

82 points by waltercool 259 days ago | 22 comments

Exacluster with 144 Nvidia H200 AI GPUs detailed by its designer (tomshardware.com)

Hardware, Artificial Intelligence, Nvidia, GPUs

11 points by doener 260 days ago | 0 comments

Tiny Corp Nearing "Completely Sovereign" Compute Stack for AMD GPUs with Tinygr (phoronix.com)
George Hotz' Tiny Corp that develops the Tinygrad neural network framework and sells the Tinybox NVIDIA and AMD powered AI workstations is nearing a "completely sovereign" software stack for GPU compute on AMD.

AMD, GPUs, AI, Software, Hardware

7 points by todsacerdoti 276 days ago | 0 comments

If GPUs Are So Good, Why Do We Still Use CPUs at All? (codingstuff.substack.com)
There’s this old video from 2009 that’s been going viral on Twitter recently. Its supposed to give viewers an intuition of the difference between CPUs and GPUs.

GPUs, CPUs, Hardware, Performance, Computer Science

90 points by teddywahle 284 days ago | 69 comments

AMD 'Strix Halo' Ryzen AI Max+ Debuts with RDNA 3.5 Graphics and Zen 5 CPU Cores (tomshardware.com)

AMD, CPUs, GPUs, Hardware, Technology

83 points by kcb 285 days ago | 83 comments

Nvidia's Christmas Present: GB300 and B300 – Reasoning Inference, Amazon, Memory (semianalysis.com)
Merry Christmas has come thanks to Santa Huang. Despite Nvidia’s Blackwell GPU’s having multiple delays, discussed here, and numerous times through the Accelerator Model due to silicon, packaging, and backplane issues, that hasn’t stopped Nvidia from continuing their relentless march.

Nvidia, GPUs, Hardware, AI, Amazon

6 points by rbanffy 296 days ago | 0 comments

GPU Glossary (modal.com)
We wrote this glossary to solve a problem we ran into working with GPUs here at Modal : the documentation is fragmented, making it difficult to connect concepts at different levels of the stack, like Streaming Multiprocessor Architecture , Compute Capability , and nvcc compiler flags .

GPUs, Software, Documentation, Glossary

3 points by abhi9u 309 days ago | 0 comments

Ask HN: How Are AMD GPUs/ROCm for Machine Learning in 2024? (ycombinator.com)
Ask HN: How Are AMD GPUs/ROCm for Machine Learning in 2024?

Machine Learning, AMD, GPUs, Hardware, Ask HN

3 points by Decabytes 310 days ago | 0 comments

How AMD Is Taking Standard C/C++ Code to Run Directly on GPUs (phoronix.com)
Back at the 2024 LLVM Developers' Meeting was an interesting presentation by AMD engineer Joseph Huber for how they have been exploring running common, standard C/C++ code directly on GPUs without having to be adapted for any GPU language / programming dialects or other adaptations.

AMD, GPUs, C/C++, Programming, Hardware

26 points by peutetre 312 days ago | 3 comments

Maxsun makes a GPU with two built-in M.2 SSD ports (tomshardware.com)

Hardware, SSD, GPUs, Storage, Technology

9 points by LorenDB 315 days ago | 0 comments

GPU Accelerated Object Storage (acceleratedcloudstorage.com)
GPU-Optimized Cloud Object Storage for Real-Time Data Access

Cloud Storage, GPUs, Real-time Data

17 points by NKdeveloper 338 days ago | 7 comments

AMD Developing Next-Gen Fortran Compiler Based on Flang, Optimized for AMD GPUs (phoronix.com)
AMD today went public with details on the "AMD Next-Gen Fortran Compiler" as a new Fortran compiler they are working on based on LLVM's Flang.

AMD, Fortran, Compilers, GPUs, LLVM

5 points by sandwichsphinx 339 days ago | 0 comments

CUDA Programming Course – High-Performance Computing with GPUs [video] (youtube.com)

CUDA, Programming, High-Performance Computing, GPUs, Video

26 points by yarapavan 340 days ago | 1 comments

Fujitsu, AMD Plan to Pair Monaka CPUs with Instinct GPUs (theregister.com)
Fujitsu and AMD announced plans on Friday to develop a new, more energy-efficient AI and HPC compute platform that will pair the Japanese tech vendor's next-gen CPUs with the House of Zen's Instinct accelerators.

AI, CPUs, GPUs, Hardware, Partnerships

5 points by rbanffy 350 days ago | 0 comments

Security flaws found in Nvidia GeForce GPUs (pcworld.com)
Graphics card manufacturer Nvidia is currently issuing a warning to all owners of GeForce GPUs. According to an Nvidia security bulletin, several security vulnerabilities requiring urgent attention have been discovered in the company’s own display drivers and other software.

Security, Hardware, Nvidia, GPUs, Software

216 points by wumeow 350 days ago | 148 comments