Hacker News with Generative AI: Image Processing

Deformable Image Registration KU Repository (github.com/ThomasAlscher1991)
The Deformable Image Registration KU repository contains software developed at the Department of Computer Science at the University of Copenhagen dealing with flow based image registration.
Combining 15s interval whole-sky-camera photos to form a 4y spanning keogram (astrodon.social)
Racket-embed-images-in-source-text (github.com/shriram)
Racket enables you to embed images in the document source. The moment you do, however, the file becomes a different format (WXME). In particular, this format is effectively binary, which means it doesn't work well with tools like grep, git, etc.
Sub-pixel distance transform (2023) (acko.net)
This page includes diagrams in WebGPU, which has limited browser support. For the full experience, use Chrome on Windows or Mac, or a developer build on other platforms.
Spectral Imaging Made Easy: A Powerful Python Library (github.com/siapy)
Spectral imaging analysis for Python (SiaPy) is a tool for efficient processing of spectral images.
Show HN: Jp3g: a fast private bulk image to JPEG/WebP converter (jp3g.org)
Attribute Extraction from Images Using DSPy (langtrace.ai)
DSPy recently added support for VLMs in beta. A quick thread on attributes extraction from images using DSPy. For this example, we will see how to extract useful attributes from screenshots of websites
Memory-safe PNG decoders now outperform C PNG libraries (reddit.com)
World Labs: Generate 3D worlds from a single image (worldlabs.ai)
Homography Explained with Code (opencv.org)
This tutorial will demonstrate the basic concepts of the homography with some codes.
Watermark Anything (github.com/facebookresearch)
Implementation and pretrained models for the paper Watermark Anything. Our approach allows for embedding (possibly multiple) localized watermarks into images.
FLUX1.1 a Prompt Like "IMG_1018.CR2" (twitter.com)
Physics-informed Shadowgraph Network: End-to-end Density Field Reconstruction (arxiv.org)
This study presents a novel approach for quantificationally reconstructing density fields from shadowgraph images using physics-informed neural networks
BC7 optimal solid-color blocks (wordpress.com)
That’s right, it’s another texture compression blog post! I’ll keep it short. By “solid-color block”, I mean a 4×4 block of pixels that all have the same color. ASTC has a dedicated encoding for these (“void-extent blocks”), BC7 does not. Therefore we have an 8-bit RGBA input color and want to figure out how to best encode that color with the encoding options we have.
Leopard: A Vision Language Model for Text-Rich Multi-Image Tasks (arxiv.org)
Text-rich images, where text serves as the central visual element guiding the overall understanding, are prevalent in real-world applications, such as presentation slides, scanned documents, and webpage snapshots.
Show HN: AI Image Upscaler and Photo Enhancer with up to 10x resolution boost (imageupscaler.io)
Improve image quality, resolution, and clarity easily with our advanced AI image upscaler and photo enhancer.
The magic (image resampling) kernel (johncostella.com)
“The magic kernel” is, today, a colloquial name for Magic Kernel Sharp, a world-beating image resizing algorithm, superior to the popular Lanczos kernels, which has helped power the two largest social media photo sites in the world, Facebook and Instagram.
Vecint: Average Color (wunkolo.github.io)
In a previous post, I used Intel’s AMX instructions intended for AI/ML use-cases to take the average color of an image.
Building a compile-time SIMD optimized smoothing filter (scientificcomputing.rs)
I built a Savitzky-Golay filter (fancy name for a dot product with some known constants on a rolling window) and tried to optimize the crap out of it.
Show HN: Fast and Exact Algorithm for Image Merging (github.com/C-Naoki)
This is a python implementation for stitching images by automatically searching for overlap region.
New AI diffusion model approach solves the aspect ratio problem (news.rice.edu)
Parallel PNG Proposal (2021) (github.com/DavidBuchanan314)
This is a proof-of-concept implementation of a parallel-decodable PNG format, based on ideas from https://github.com/brion/mtpng
Tutorial on diffusion models for imaging and vision (arxiv.org)
The astonishing growth of generative tools in recent years has empowered many exciting applications in text-to-image generation and text-to-video generation.
GraphicsMagick – a Swiss army knife of image processing (graphicsmagick.org)
GraphicsMagick is the swiss army knife of image processing.
sRGB Gamut Clipping (2021) (bottosson.github.io)
Conversion to Grayscale (2011) (entropymine.com)
Show HN: I built image converters site that run in the browser (dynapik.com)
Splatt3R: Zero-Shot Gaussian Splatting from Uncalibrated Image Pairs (active.vision)
Show HN: Remove-bg – open-source remove background using WebGPU (bannerify.co)
Ethically Sourced Lena Picture (mortenhannemose.github.io)