Hacker News with Generative AI: Diffusion Models

Why I find diffusion models interesting? (rnikhil.com)
I stumbled across this tweet a week or so back where this company called Inception Labs released a Diffusion LLM (dLLM). Instead of being autoregressive and predicting tokens left to right, here you start all at once and then gradually come up with sensible words simultaneously (start/finish/middle etc. all at once). Something which worked historically for image and video models is now outperforming similar-sized LLMs in code generation.
LLM generates the entire output at once (first diffusion LLM) [video] (youtube.com)
First large diffusion-based LLM (twitter.com)
Something went wrong, but don’t fret — let’s give it another shot.
Large Language Diffusion Models (ml-gsai.github.io)
TL;DR: We introduce LLaDA, a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.
Diffusion Without Tears (notion.site)
Stable Flow: Vital Layers for Training-Free Image Editing (omriavrahami.com)
Diffusion models have revolutionized the field of content synthesis and editing. Recent models have replaced the traditional UNet architecture with the Diffusion Transformer (DiT), and employed flow-matching for improved training and sampling. However, they exhibit limited generation diversity. In this work, we leverage this limitation to perform consistent image edits via selective injection of attention features.
Diffusion training from scratch on a micro-budget (github.com/SonyResearch)
This repository provides a minimalistic implementation of our approach to training large-scale diffusion models from scratch on an extremely low budget.
Pulsar: Secure Steganography for Diffusion Models (iacr.org)
Widespread efforts to subvert access to strong cryptography has renewed interest in steganography, the practice of embedding sensitive messages in mundane cover messages.
Diffusion models are real-time game engines (gamengen.github.io)
DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model (arxiv.org)
Diffusion Training from Scratch on a Micro-Budget (arxiv.org)
Diffusion Texture Painting (nvidia.com)
CAT3D: Create Anything in 3D with Multi-View Diffusion Models (cat3d.github.io)