Hacker News with Generative AI: Image Processing

Creating static map images with OpenStreetMap, Web Mercator, and Pillow (alexwlchan.net)
I’ve been working on a project where I need to plot points on a map. I don’t need an interactive or dynamic visualisation – just a static map with coloured dots for each coordinate.
Commodore 64 PETSCII Image (2022) (medium.com)
It is said that life begins at the age of 40 — I hope it is true … After 23 years of working in IT and spending the last 12 years in a company I co-founded, my contract was terminated, I lost my job and I suddenly found myself almost overnight in a situation where I have all day ahead with no schedule, no deadlines, no important phone calls, no one is waiting for my analysis, consultation or technical specification.
MS Paint IDE (ms-paint-i.de)
MS Paint IDE is a program that can read a normal image file saved with MS Paint, and can then translate it to text with the ability to highlight the text in the image, parse the code, compile and execute it.
Dramatically improve microscope resolution with Fourier Ptychography [video] (youtube.com)
Encrypt Images Without a Key Using Visual Cryptography (github.com/coduri)
VisualCrypto is an open-source Python-based toolkit with a web interface designed for Visual Secret Sharing (VSS), a cryptographic technique that splits a secret image into multiple shares.
Show HN: Automated Sorting of group photos by user defined N people in each pic (github.com/Karvy-Singh)
Sort photos based on the criteria of "Me with my favorite people (x, y, z...)" out of a bunch of group photos/random photos.
Bilinear down/upsampling, aligning pixel grids, and that infamous GPU half pixel (2021) (bartwronski.com)
See this ugly pixel shift when upsampling a downsampled image? My post describes where it can come from and how to avoid those!
Detecting edges of images at the speed of light (phys.org)
Physicists from the group of Jorik van de Groep at the UvA-Institute of Physics have devised a new method that can be used to detect edges of images in an extremely energy efficient and ultrafast way.
Subpixel Zoo: A Catalog of Subpixel Geometry (geometrian.com)
An image pixel might be a little square with a flat color[1], but to actually present the image on a physical display (or related technology), we illuminate discrete 'subpixels'. The perceived color in an area is then the addition of the subpixels' emissions in that area.
How hard would it be to display the contents of an image file on the screen? (nereid.pl)
How hard would it be to display the contents of an image file on the screen? You just load the image pixels somehow, perhaps using a readily available library, and then display those pixels on the screen. Easy, right? Well, not quite, as it turns out.
Deformable Image Registration KU Repository (github.com/ThomasAlscher1991)
The Deformable Image Registration KU repository contains software developed at the Department of Computer Science at the University of Copenhagen dealing with flow based image registration.
Combining 15s interval whole-sky-camera photos to form a 4y spanning keogram (astrodon.social)
Racket-embed-images-in-source-text (github.com/shriram)
Racket enables you to embed images in the document source. The moment you do, however, the file becomes a different format (WXME). In particular, this format is effectively binary, which means it doesn't work well with tools like grep, git, etc.
Sub-pixel distance transform (2023) (acko.net)
This page includes diagrams in WebGPU, which has limited browser support. For the full experience, use Chrome on Windows or Mac, or a developer build on other platforms.
Spectral Imaging Made Easy: A Powerful Python Library (github.com/siapy)
Spectral imaging analysis for Python (SiaPy) is a tool for efficient processing of spectral images.
Show HN: Jp3g: a fast private bulk image to JPEG/WebP converter (jp3g.org)
Attribute Extraction from Images Using DSPy (langtrace.ai)
DSPy recently added support for VLMs in beta. A quick thread on attributes extraction from images using DSPy. For this example, we will see how to extract useful attributes from screenshots of websites
Memory-safe PNG decoders now outperform C PNG libraries (reddit.com)
World Labs: Generate 3D worlds from a single image (worldlabs.ai)
Homography Explained with Code (opencv.org)
This tutorial will demonstrate the basic concepts of the homography with some codes.
Watermark Anything (github.com/facebookresearch)
Implementation and pretrained models for the paper Watermark Anything. Our approach allows for embedding (possibly multiple) localized watermarks into images.
FLUX1.1 a Prompt Like "IMG_1018.CR2" (twitter.com)
Physics-informed Shadowgraph Network: End-to-end Density Field Reconstruction (arxiv.org)
This study presents a novel approach for quantificationally reconstructing density fields from shadowgraph images using physics-informed neural networks
BC7 optimal solid-color blocks (wordpress.com)
That’s right, it’s another texture compression blog post! I’ll keep it short. By “solid-color block”, I mean a 4×4 block of pixels that all have the same color. ASTC has a dedicated encoding for these (“void-extent blocks”), BC7 does not. Therefore we have an 8-bit RGBA input color and want to figure out how to best encode that color with the encoding options we have.
Leopard: A Vision Language Model for Text-Rich Multi-Image Tasks (arxiv.org)
Text-rich images, where text serves as the central visual element guiding the overall understanding, are prevalent in real-world applications, such as presentation slides, scanned documents, and webpage snapshots.
Show HN: AI Image Upscaler and Photo Enhancer with up to 10x resolution boost (imageupscaler.io)
Improve image quality, resolution, and clarity easily with our advanced AI image upscaler and photo enhancer.
The magic (image resampling) kernel (johncostella.com)
“The magic kernel” is, today, a colloquial name for Magic Kernel Sharp, a world-beating image resizing algorithm, superior to the popular Lanczos kernels, which has helped power the two largest social media photo sites in the world, Facebook and Instagram.
Vecint: Average Color (wunkolo.github.io)
In a previous post, I used Intel’s AMX instructions intended for AI/ML use-cases to take the average color of an image.
Building a compile-time SIMD optimized smoothing filter (scientificcomputing.rs)
I built a Savitzky-Golay filter (fancy name for a dot product with some known constants on a rolling window) and tried to optimize the crap out of it.
Show HN: Fast and Exact Algorithm for Image Merging (github.com/C-Naoki)
This is a python implementation for stitching images by automatically searching for overlap region.