Hacker News with Generative AI: Generative AI

Running GPT-2 in WebGL: Rediscovering the Lost Art of GPU Shader Programming (nathan.rs)
Preface: A few weeks back, I implemented GPT-2 using WebGL and shaders (Github Repo) which made the front page of Hacker News (discussion). By popular demand, here is a short write-up over the main ideas behind GPU shader programming (for general-purpose computing).
Launch HN: Relace (YC W23) – Models for fast and reliable codegen (ycombinator.com)
Hey HN community! We're Preston and Eitan, and we're building Relace (https://relace.ai). We're trying to make building code agents easy and cheap.
Some signs of AI model collapse begin to reveal themselves (theregister.com)
Prediction: General-purpose AI could start getting worse
Reflections on Neuralese (greaterwrong.com)
With the recent breakthroughs taking advantage of extensive Chain of Thought (CoT) reasoning in LLMs, there have been many attempts to modify the technique to be even more powerful.
Extending Minds with Generative AI (nature.com)
As human-AI collaborations become the norm, we should remind ourselves that it is our basic nature to build hybrid thinking systems – ones that fluidly incorporate non-biological resources.
Ask HN: How much credit can you take for code you wrote with an LLM? (ycombinator.com)
Ask HN: How much credit can you take for code you wrote with an LLM?
Claude Code does our releases now (aluxian.com)
Since Anthropic launched Claude Code we've been using it at Molin a lot. It's the best programming agent I've seen so far: it gives concise answers, it can run shell tools as well as edit files, and it's smart enough to make the right call — most of the time. The UX is excellent, too: most times you just press Enter to approve or Esc to give it a new instruction.
Show HN: Generate SVGs with AI (vectorart.ai)
Use the power of generative AI to create infinitely scalable SVG vector art images, logos, icons and illustrations for your website, business or app.
Gemma 3n Architectural Innovations – Speculation and poking around in the model (reddit.com)
Gemma 3n is a new member of the Gemma family with free weights that was released during Google I/O. It's dedicated to on-device (edge) inference and supports image and text input, with audio input. Google has released an app that can be used for inference on the phone.
Direct Preference Optimization vs. RLHF (together.ai)
We're excited to announce that the Together Fine-Tuning Platform now supports Direct Preference Optimization (DPO)! This technique allows developers to align language models with human preferences creating more helpful, accurate, and tailored AI assistants. In this deep-dive blogpost, we provide details of what DPO is, how it works, when to use it and code examples. If you'd like to jump straight into code have a look at our code notebook.
AI Hallucination Legal Cases Database (damiencharlotin.com)
Highlights from the Claude 4 system prompt (simonwillison.net)
Anthropic publish most of the system prompts for their chat models as part of their release notes. They recently shared the new prompts for both Claude Opus 4 and Claude Sonnet 4. I enjoyed digging through the prompts, since they act as a sort of unofficial manual for how best to use these tools. Here are my highlights, including a dive into the leaked tool prompts that Anthropic didn’t publish themselves.
Claude Opus 4 turns to blackmail when engineers try to take it offline (techcrunch.com)
Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report released Thursday.
Black Mirror was a warmup act - OpenAI pivots into hardware (garymarcus.substack.com)
A very short story, in three acts
Show HN: GenAI-powered OCR API for PDF receipts/invoices with smart extraction (visionparser.com)
Welcome to the next level of document automation! Our innovative Receipt and Invoice Parsing API, powered by state-of-the-art Generative AI, gives you a flexible solution that extracts structured data from any receipt format. Experience exceptional accuracy, speed, affordability and customisation for receipt parsing.
Show HN: Advanced Chunking in JavaScript/TypeScript with Chonkie (ycombinator.com)
Hi HN,<p>We’re Shreyash and Bhavnick. We built Chonkie, an open-source library for advanced chunking and embedding of text and code. It was previously Python-only, but we just released a TypeScript version: https://github.com/chonkie-inc/chonkie-ts<p>Many AI projects in JS/TS (like those using Vercel's AI SDK or Mastra) rely on basic text splitters. But better chunking = better retrieval = better performance. That’s what Chonkie is built for.
Attention Wasn't All We Needed (stephendiehl.com)
There's a lot of modern transformer techniques that have been developed since the original Attention Is All You Need paper. Let's look at some of the most important ones that have been developed over the years and try to implement the basic ideas as succintly as possible. We'll use the Pytorch framework for most of the examples.
Show HN: I made an infinite gallery of AI-generated 3D skeuomorphic icons (thiings.co)
Can't find what you're looking for?
Authors Are Accidentally Leaving AI Prompts in Their Novels (404media.co)
Fans reading through the romance novel Darkhollow Academy: Year 2 got a nasty surprise last week in chapter 3. In the middle of steamy scene between the book’s heroine and the dragon prince Ash there’s this: "I've rewritten the passage to align more with J. Bree's style, which features more tension, gritty undertones, and raw emotional subtext beneath the supernatural elements:"
Understanding Generative AI Capabilities in Everyday Image Editing Tasks (arxiv.org)
Generative AI (GenAI) holds significant promise for automating everyday image editing tasks, especially following the recent release of GPT-4o on March 25, 2025.
Anthropic faces backlash to Claude 4 Opus behavior that contacts authorities (venturebeat.com)
Anthropic’s first developer conference on May 22 should have been a proud and joyous day for the firm, but it has already been hit with several controversies, including Time magazine leaking its marquee announcement ahead of…well, time (no pun intended), and now, a major backlash among AI developers and power users brewing on X over a reported safety alignment behavior in Anthropic’s flagship new Claude 4 Opus large language model.
Codex, Jules, and Claude Code Comparison (jonatkinson.co.uk)
I've tried three of the newer agentic code assistants this week: OpenAI Codex, Google Jules, and Claude Code.
How Does Claude 4 Think? – Sholto Douglas and Trenton Bricken (dwarkesh.com)
New episode with my good friends Sholto Douglas & Trenton Bricken. Sholto focuses on scaling RL and Trenton researches mechanistic interpretability, both at Anthropic.
CivitAI Policy Update: Removal of Real-Person Likeness Content (civitai.com)
We are removing models and images depicting real-world individuals from the platform.
Like Lovable but can make apps with gen-AI powered back ends (getcreatr.com)
Create Products Without Any Limits
In 3.5 years, Notepad.exe goes from "barely maintained" to "it writes for you" (arstechnica.com)
In November, Microsoft began testing an update that allowed users to rewrite or summarize text in Notepad using generative AI.
Claude 4 (anthropic.com)
Today, we’re introducing the next generation of Claude models: Claude Opus 4 and Claude Sonnet 4, setting new standards for coding, advanced reasoning, and AI agents.
MMaDA – Open-Sourced Multimodal Large Diffusion Language Models (github.com/Gen-Verse)
MMaDA is a new family of multimodal diffusion foundation models designed to achieve superior performance across diverse domains such as textual reasoning, multimodal understanding, and text-to-image generation.
Strengths and limitations of diffusion language models (seangoedecke.com)
Google recently released Gemini Diffusion, which is impressing everyone with its speed. Supposedly they even had to slow down the demo so people could see what was happening. What’s special about diffusion models that makes text generation so much faster? Should every text model be a diffusion model, going forward?
GPT Destroyed College Camaraderie (medium.com)
I was chatting with a friend the other day, and we landed on something that’s been bothering me for a while, something that has been a noticeable shift in the very fabric of being a student. We were talking about how tools like ChatGPT, as undeniably invaluable as they are, have sort of… replaced the need for each other in academic settings. And with it, maybe something more.