Hacker News with Generative AI: Image Processing

Image to Text Converter (vheer.com)
Turn images into readable text with ease using Vheer’s AI-powered tool. Extract text from images in seconds and download it instantly. Perfect for business documents, handwritten notes, and more.

Image Processing, AI, OCR, Text Extraction, Business

14 points by vertex_steven 57 days ago | 4 comments

Bitmap Images in 1965 – Mariner 4 (oliverkwebb.github.io)
On July 14th, 1965, the Mariner 4 spacecraft flew by Mars, the first to ever reach the planet successfully.

Space Exploration, History, Image Processing, Mars

5 points by oliverkwebb 57 days ago | 0 comments

Are the Colors in Astronomical Images 'Real'? (scientificamerican.com)
In colorful photographs of galaxies, stars, planets, and more, what you see isn’t necessarily what you get

Astronomy, Image Processing, Science, Space

19 points by bryanrasmussen 58 days ago | 30 comments

Depth Anything V2 (depth-anything-v2.github.io)
Depth Anything V2 is trained from 595K synthetic labeled images and 62M+ real unlabeled images, providing the most capable monocular depth estimation (MDE) model with the following features: more fine-grained details than Depth Anything V1, more robust than Depth Anything V1 and SD-based models (e.g., Marigold, Geowizard), more efficient (10x faster) and more lightweight than SD-based models, impressive fine-tuned performance with our pre-trained models. We also release six metric depth models of three scales for indoor and outdoor scenes, respectively.

Computer Vision, Machine Learning, Open Source, Image Processing, Artificial Intelligence

5 points by Brajeshwar 62 days ago | 0 comments

What If Every Picture You've Ever Seen Already Exists? (ycombinator.com)
I was thinking recently about how images work at the data level, and it kind of broke my brain.

Data, Image Processing, Philosophy

20 points by cin4ed 63 days ago | 28 comments

ClawPDF – Open-Source Virtual/Network PDF Printer with OCR and Image Support (github.com/clawsoftware)
ClawPDF may seem like yet another Virtual PDF/OCR/Image Printer, but it actually comes packed with features that are typically found in enterprise solutions.

Open Source, PDF, OCR, Image Processing, Software

192 points by miles 64 days ago | 28 comments

Turning into Turing (2022) (jk-keller.com)
I stumbled into this one while working on a different project where I scripted the rotation of an image in preparation for an animation. When I looked at the last frame though, I noticed the image looked washed out and with odd patterning.

Animation, Image Processing, Software Development

29 points by TrianguloY 77 days ago | 1 comments

How to reverse engineer AI models: a study on Google Photos (skyld.io)
Google Photos is one of the most widely-used photo management applications globally, pre-installed on almost every Android device running Google Mobile Services (GMS). It is appreciated by users because it offers powerful features like “Magic Eraser” and advanced AI-powered photo editing tools. Of course, Google doesn’t open-source its AI models to keep its competitive edge.

Reverse Engineering, AI Models, Google Photos, Image Processing

10 points by superkitten 84 days ago | 6 comments

Generative Modelling in Latent Space (sander.ai)
Most contemporary generative models of images, sound and video do not operate directly on pixels or waveforms. They consist of two stages: first, a compact, higher-level latent representation is extracted, and then an iterative generative process operates on this representation instead. How does this work, and why is this approach so popular?

Generative AI, Machine Learning, Image Processing, Sound Processing, Video Processing

15 points by xavriley 98 days ago | 0 comments

Watermark segmentation (github.com/Diffusion-Dynamics)
This repository by Diffusion Dynamics, showcases the core technology behind the watermark segmentation capabilities of our first product, clear.photo. This work leverages insights from research on diffusion models for image restoration tasks.

Image Processing, Computer Vision, Diffusion Models, Open Source

32 points by abriosi 99 days ago | 21 comments

Doing the Prospero-Challenge in RPython (pypy.org)
Recently I had a lot of fun playing with the Prospero Challenge by Matt Keeter. The challenge is to render a 1024x1024 image of a quote from The Tempest by Shakespeare. The input is a mathematical formula with 7866 operations, which is evaluated once per pixel.

Programming, Challenges, Python, Image Processing, Shakespeare

29 points by tekknolagi 104 days ago | 2 comments

Estimating Camera Motion from a Single Motion-Blurred Image (jerredchen.github.io)
Given a single motion-blurred image, we exploit the motion blur cues to predict the camera velocity at that instant without performing any deblurring.

Computer Vision, Image Processing, Motion Estimation

71 points by smusamashah 116 days ago | 20 comments

Show HN: I built a tool to add noise texture to your images (vercel.app)
Drop or select a file

Image Processing, Tools, Show HN

53 points by Rayid 117 days ago | 40 comments

High-Performance PNG Decoding (blend2d.com)
It's been some time I have written about a High-Performance QOI Codec, which joined other codecs offered by Blend2D library in 2024. The development of image codecs continued and now I would like to announce a new high-performance PNG codec, which is much faster than other available codecs written in C, C++, and other programming languages.

Image Processing, Performance Optimization, PNG, C++, Programming Languages

84 points by PaulHoule 121 days ago | 8 comments

StarVector: Generating Scalable Vector Graphics Code from Images and Text (starvector.github.io)
StarVector represents a breakthrough in Scalable Vector Graphics (SVG) generation, seamlessly integrating visual and textual inputs into a unified foundation SVG model.

Image Processing, Text Processing, AI, Code Generation

72 points by lnyan 122 days ago | 7 comments

Image Dithering: Eleven Algorithms and Source Code (tannerhelland.com)
Today’s graphics programming topic - dithering - is one I receive a lot of emails about, which some may find surprising.

Graphics Programming, Algorithms, Source Code, Image Processing

9 points by ibobev 127 days ago | 0 comments

Compression of Spectral Images Using Spectral JPEG XL (jcgt.org)

Image Compression, Image Processing, JPEG XL

91 points by ksec 128 days ago | 13 comments

Paint.net 5.1.5 Is Now Available with JPEG XL Support (getpaint.net)
This update adds JPEG XL (*.jxl) support, improves quantization color quality, updates AVIF loading to better handle mapping HDR images to SDR, and fixes some bugs.

Software, Image Processing, New Releases, Bug Fixes

8 points by ksec 128 days ago | 0 comments

Arbitrary-Scale Super-Resolution with Neural Heat Fields (therasr.github.io)
Thera is the first arbitrary-scale super-resolution method with a built-in physical observation model.

Super-Resolution, Image Processing, Computer Vision, Artificial Intelligence, Neural Networks

151 points by 0x12A 129 days ago | 56 comments

Image Processing in C (2000) [pdf] (ed.ac.uk)

Image Processing, C Programming, Computer Science, Technical Documents, 1990s

139 points by nill0 130 days ago | 26 comments

Fast-PNG: PNG image decoder and encoder (github.com/image-js)
PNG image decoder and encoder written entirely in JavaScript.

JavaScript, Image Processing, PNG, Software, Open Source

48 points by javatuts 133 days ago | 20 comments

Dithering in Colour (obrhubr.org)
After reading a post on the HN frontpage from amanvir.com about dithering, I decided to join in on the fun. Here’s my attempt at implementing Atkinson dithering with support for colour palettes and correct linearisation.

Dithering, Image Processing, Programming, Color Theory

168 points by surprisetalk 134 days ago | 65 comments

Creating static map images with OpenStreetMap, Web Mercator, and Pillow (alexwlchan.net)
I’ve been working on a project where I need to plot points on a map. I don’t need an interactive or dynamic visualisation – just a static map with coloured dots for each coordinate.

Mapping, Python, OpenStreetMap, Image Processing, Visualization

5 points by todsacerdoti 137 days ago | 0 comments

Commodore 64 PETSCII Image (2022) (medium.com)
It is said that life begins at the age of 40 — I hope it is true … After 23 years of working in IT and spending the last 12 years in a company I co-founded, my contract was terminated, I lost my job and I suddenly found myself almost overnight in a situation where I have all day ahead with no schedule, no deadlines, no important phone calls, no one is waiting for my analysis, consultation or technical specification.

Commodore 64, Retro Computing, Pixel Art, Image Processing, Personal Reflections

57 points by erickhill 138 days ago | 12 comments

MS Paint IDE (ms-paint-i.de)
MS Paint IDE is a program that can read a normal image file saved with MS Paint, and can then translate it to text with the ability to highlight the text in the image, parse the code, compile and execute it.

Programming, Software, Image Processing, Tools

220 points by smusamashah 139 days ago | 75 comments

Dramatically improve microscope resolution with Fourier Ptychography [video] (youtube.com)

Microscopes, Optics, Image Processing, Science, Videos

5 points by MaximilianEmel 141 days ago | 1 comments

Encrypt Images Without a Key Using Visual Cryptography (github.com/coduri)
VisualCrypto is an open-source Python-based toolkit with a web interface designed for Visual Secret Sharing (VSS), a cryptographic technique that splits a secret image into multiple shares.

Cryptography, Python, Security, Open Source, Image Processing

8 points by italianguy 142 days ago | 2 comments

Show HN: Automated Sorting of group photos by user defined N people in each pic (github.com/Karvy-Singh)
Sort photos based on the criteria of "Me with my favorite people (x, y, z...)" out of a bunch of group photos/random photos.

Computer Vision, Image Processing, Machine Learning, Software

32 points by Karvy 168 days ago | 3 comments

Bilinear down/upsampling, aligning pixel grids, and that infamous GPU half pixel (2021) (bartwronski.com)
See this ugly pixel shift when upsampling a downsampled image? My post describes where it can come from and how to avoid those!

Image Processing, Computer Graphics, GPU, Pixel Manipulation, Downsampling/Upsampling

136 points by fanf2 176 days ago | 23 comments

Detecting edges of images at the speed of light (phys.org)
Physicists from the group of Jorik van de Groep at the UvA-Institute of Physics have devised a new method that can be used to detect edges of images in an extremely energy efficient and ultrafast way.

Image Processing, Physics, Computer Science

101 points by bookofjoe 176 days ago | 27 comments