Hacker News with Generative AI: Generative AI

CivitAI Policy Update: Removal of Real-Person Likeness Content (civitai.com)
We are removing models and images depicting real-world individuals from the platform.
Like Lovable but can make apps with gen-AI powered back ends (getcreatr.com)
Create Products Without Any Limits
In 3.5 years, Notepad.exe goes from "barely maintained" to "it writes for you" (arstechnica.com)
In November, Microsoft began testing an update that allowed users to rewrite or summarize text in Notepad using generative AI.
Claude 4 (anthropic.com)
Today, we’re introducing the next generation of Claude models: Claude Opus 4 and Claude Sonnet 4, setting new standards for coding, advanced reasoning, and AI agents.
MMaDA – Open-Sourced Multimodal Large Diffusion Language Models (github.com/Gen-Verse)
MMaDA is a new family of multimodal diffusion foundation models designed to achieve superior performance across diverse domains such as textual reasoning, multimodal understanding, and text-to-image generation.
Strengths and limitations of diffusion language models (seangoedecke.com)
Google recently released Gemini Diffusion, which is impressing everyone with its speed. Supposedly they even had to slow down the demo so people could see what was happening. What’s special about diffusion models that makes text generation so much faster? Should every text model be a diffusion model, going forward?
GPT Destroyed College Camaraderie (medium.com)
I was chatting with a friend the other day, and we landed on something that’s been bothering me for a while, something that has been a noticeable shift in the very fabric of being a student. We were talking about how tools like ChatGPT, as undeniably invaluable as they are, have sort of… replaced the need for each other in academic settings. And with it, maybe something more.
Discrete Text Diffusion Explained (aaronlou.com)
Modern large language models (like ChatGPT) learn to generate new samples by modeling the data distribution of natural text. However, the underlying methodology has largely remained stagnant over the last century: although different architectures have been developed, models are all based on autoregressive modeling (i.e. next token prediction).
Aligning LLM Choice to Your Use Case: An Expert's Guide (oblivus.com)
Selecting the right large language model (LLM) is a critical decision that can make or break the success of your AI-driven project.
Gemini Diffusion (simonwillison.net)
Gemini Diffusion. Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google's first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers.
An upgraded dev experience in Google AI Studio (googleblog.com)
Google AI Studio is the fastest place to start building with the Gemini API, with access to our most capable models, including Gemini 2.5 preview models, and generative media models like Imagen, Lyria RealTime, and Veo. At Google I/O, we announced new features to help you build and deploy complete applications, new model capabilities, and new features in the Google Gen AI SDK.
The Future of Newspapers Is Lies, I Guess (aphyr.com)
I subscribe to the Chicago Sun-Times, a non-profit newspaper. This week they sent me a sixty-four page special insert, branded with the Chicago Sun-Times logo, full of LLM nonsense.
Claude Code: Tutorials (anthropic.com)
Practical examples and patterns for effectively using Claude Code in your development workflow.
Show HN: OpenHands, an open source alternative to Devin, Codex, and Jules (github.com/All-Hands-AI)
Welcome to OpenHands (formerly OpenDevin), a platform for software development agents powered by AI.
Building an agentic image generator that improves itself (trybezel.com)
Our exploration into agentic image generation revealed quite a bit about multimodal evaluators and editors.
Ask HN: Generate LLM hallucination to detect students cheating (ycombinator.com)
I am thinking about adding a question that should induce a LLM to hallucinate a response. This method could detect students cheating. The best question should be the one that students could not imagine a solution like the one provided by the LLM. Any hints?
Agentic AI Systems – Tools, System Design, and Resources for AI Agents (github.com/alirezadir)
Welcome to Agentic-AI-Systems — a curated and evolving knowledge base for mastering Agentic and Generative AI system design!
Gemma 3n preview: Mobile-first AI (googleblog.com)
Following the exciting launches of Gemma 3 and Gemma 3 QAT, our family of state-of-the-art open models capable of running on a single cloud or desktop accelerator, we're pushing our vision for accessible AI even further.
Gemini Diffusion (deepmind.google)
Gemini 2.5: Our most intelligent models are getting even better (google)
Gemini 2.5 Pro continues to be loved by developers as the best model for coding, and 2.5 Flash is getting even better with a new update. We’re also bringing new capabilities to our models, including Deep Think, an experimental enhanced reasoning mode for 2.5 Pro.
Gemini 2.5 Flash Preview 05-20 (google.dev)
The Gemini API offers different models that are optimized for specific use cases.
OpenAI Codex hands-on review (zackproser.com)
The interface I want and the performance I'll have to wait for...
Actors' union complains about Epic Games cloning Darth Vader (theregister.com)
The union representing American actors has complained about Llama Production, which is owned by Epic Games, over the use of generative AI in a new character for Fortnite.
Show HN: Engine – A multi-LLM alternative to Codex (enginelabs.ai)
Engine is a cloud-based AI software engineering agent that works without supervision to turn issues into pull requests
ChatGPT shown to be more persuasive than people in online debates (phys.org)
Large language models (LLMs), such as GPT-4, are found to be more persuasive than humans 64% of the time in online debates when adapting their arguments on the basis of personalized information about their opponents, according to a study published in Nature Human Behaviour.
Heartland Gen Zers Feel Unprepared to Use AI at Work (gallup.com)
WASHINGTON, D.C. — As artificial intelligence continues to reshape the day-to-day workplace experience, about one-third of Gen Z adult workers living in America’s Heartland feel at least somewhat prepared to integrate artificial intelligence into their current jobs. Meanwhile, four in 10 Gen Z 5th- to 12th-grade students in the Heartland feel prepared to use AI in their future jobs.
Google Launches NotebookLM for Android (9to5google.com)
As previewed earlier this month, Google today released the NotebookLM app for Android ahead of I/O 2025.
Show HN: Mirror World, create an AI clone of anyone (mirr.world)
Oops, I accidentally vibe-coded a ChatGPT client for my Apple Watch (richarddas.com)
I had an idea bouncing around in my head, and as has become a habit, I like capturing ideas in GPT. Writing things down helps me declutter my brain, especially when I’m trying to wind down for the night.
Diffusion models explained simply (seangoedecke.com)
Transformer-based large language models are relatively easy to understand. You break language down into a finite set of “tokens” (words or sub-word components), then train a neural network on millions of token sequences so it can predict the next token based on all the previous ones. Despite some clever tricks (mainly about how the model processes the previous tokens in the sequence), the core mechanism is relatively simple.