Hacker News with Generative AI: Image Generation

Building an agentic image generator that improves itself (trybezel.com)
Our exploration into agentic image generation revealed quite a bit about multimodal evaluators and editors.
Veo 3 and Imagen 4, and a new tool for filmmaking called Flow (google)
Introducing Veo 3 and Imagen 4, and a new tool for filmmaking called Flow.
Hunyuan Image 2.0: Real-Time AI Image Generator (hunyuan-image.com)
Experience the future of AI image generation with millisecond-level response. Our breakthrough technology delivers 15x faster performance than industry standards, with 95%+ accuracy in GenEval benchmarks. Join thousands of creators who are transforming their creative workflow.
Freepik releases an 'open' AI image generator trained on licensed data (techcrunch.com)
Freepik, the online graphic design platform, unveiled a new “open” AI image model on Tuesday that the company says was trained exclusively on commercially licensed, “safe-for-work” images.
GPT Image prompted to "create the exact replica of this image" 74 times (reddit.com)
ChatGPT Omni prompted to "create the exact replica of this image, don't change a thing" 74 times
Ask HN: Why does OpenAI require an ID to use their image API? (ycombinator.com)
I was hoping to try out the “Open”AI image generation API in their Playground but it asked me to give them a copy of my ID for verification?
Stable Diffusion Comparison in C, Rust and Ruby (leetarxiv.substack.com)
Every blog, and every tutorial on Stable Diffusion (and diffusion models in general) is written in Python, or incoherent math symbols.
Show HN: I built an AI image generator lets you create styled images in seconds (4oimage.site)
4oimage makes AI generation simple and fun. Create high-quality original images with just one click, choose from multiple artistic styles, and craft amazing visual works without any specialized skills.
Visual Reasoning Is Coming Soon (arcturus-labs.com)
I gotta say – I love it living in exponential times. I can just wish that something existed and then within a month it does! This time it happened with OpenAI's 4o image generation release. In this blog post I'll briefly cover the release and why I think it's pretty cool. Then I'll dive into a new opportunity that I think is even more exciting – visual reasoning.
This Bench Does Not Exist (openbenches.org)
A StyleGAN3 generated image based on 25,000 photos from OpenBenches.
No elephants: Breakthroughs in image generation (oneusefulthing.org)
Over the past two weeks, first Google and then OpenAI rolled out their multimodal image generation abilities. This is a big deal.
Mirrors: The Blind Spot of Image and Video Generation Models (medium.com)
Recent advances in image generation models have demonstrated remarkable capabilities in creating photorealistic and imaginative visuals. However, a persistent challenge remains: accurately rendering reflections in mirrors.
OpenAI just made it harder to turn your pics into Studio Ghibli-style images (aol.com)
OpenAI has blocked some users' requests for Studio Ghibli-style images.
Vibe marketing prompts for OpenAI's new model (indiehackers.com)
OpenAI just launched a powerful new image generation model.
Show HN: Fingernotes – handwritten notes which become their own preview image (fingernotes.com)
Experiment with Gemini 2.0 Flash native image generation (googleblog.com)
In December we first introduced native image output in Gemini 2.0 Flash to trusted testers. Today, we're making it available for developer experimentation across all regions currently supported by Google AI Studio.
InstantStyle: Free Lunch Towards Style-Preserving in Text-to-Image Generation (github.com/instantX-research)
InstantStyle is a general framework that employs two straightforward yet potent techniques for achieving an effective disentanglement of style and content from reference images.
Biases in Apple's Image Playground (giete.ma)
Although Image Playground is heavily restricted, and we do not have direct access to the underlying model, can we still use the prompting interface with the above image input to influence the skin tone of the resulting image? Turns out we can, and in precisely the biased way most image models behave 🤦‍♂️.
Show HN: Generate Notion-style line avatars using AI (open source) (lineavatars.com)
Upload or capture a photo of your face, preferably at a slight angle. Then we’ll use AI to generate a line avatar for you. For free!
Visuali.io – A cross platform online infinite canvas AI image sandbox (visuali.io)
Turn your imagination into reality with Visuali's AI-powered generative art tools
How to Train an AI Image Model on Yourself (coryzue.com)
Yesterday I had a couple hours to kill and decided to try out a project I’ve been meaning to explore for a long time: training my own AI image model so I can generate pictures of myself that look like this:
TokenVerse: Multi-Concept Personalization in Token Modulation Space by Google (token-verse.github.io)
We present TokenVerse -- a method for multi-concept personalization, leveraging a pre-trained text-to-image diffusion model.
Show HN: I Made a SOTA Affordable Midjourney Alternative, Subscription Trap Free (ayecreate.ai)
AyeCreate makes it easy to Create, Generate and Craft Piece of Content, Designs or Media using State of the Art AI
Mann-E, breakthrough image generation platform with crypto payments (mann-e.com)
Our AI-powered platform helps you create stunning, unique images in seconds. Whether you're an artist, designer, or just looking to explore your creativity, we've got you covered.
NeuralSVG: An Implicit Representation for Text-to-Vector Generation (sagipolaczek.github.io)
NeuralSVG generates vector graphics from text prompts with ordered and editable shapes.
How to generate OpenGraph images with Astro and Satori (skyfall.dev)
Generating OpenGraph images for your Astro site is an easy way to increase click-through rates and make link previews more appealing. Here's how to set them up!
1.58-Bit Flux (chenglin-yang.github.io)
We present 1.58-bit FLUX, the first successful approach to quantizing the state-of-the-art text-to-image generation model, FLUX.1-dev, using 1.58-bit weights (i.e., values in {-1, 0, +1}) while maintaining comparable performance for generating 1024 x 1024 images.
NYT Quiz: Which Parts of These Images Are A.I.-Generated? (nytimes.com)
Artificial intelligence tools can fabricate entirely new images and videos. But they can now also make much smaller tweaks by inserting A.I. elements into genuine photographs, further blurring the line between what’s real and what’s fake.
ByteDance INFP: The AI That Brings Images to Life (pdftranslate.ai)
Bytedance has introduced INFP, a powerful AI that can turn any single image into a lively character that can talk, sing, and interact with its surroundings.
AI model for near-instant image creation on consumer-grade hardware (surrey.ac.uk)
A groundbreaking AI model that creates images as the user types, using only modest and affordable hardware, has been announced by the Surrey Institute for People-Centred Artificial Intelligence (PAI) at the University of Surrey.