Hacker News with Generative AI: Data Structures

Implementing complex numbers and FFT with just datatypes (2023) (github.com)
In this article, I'll explain why implementing numbers with just algebraic datatypes is desirable.

Functional Programming, Data Structures, Mathematics

38 points by surprisetalk 58 days ago | 3 comments

Dividing an array into fair sized chunks (lemire.me)
Suppose that you have an array of N elements and you want to divide it into M chunks. It is a common task when trying to spread N units of work over M threads, for example.

Programming, Algorithms, Arrays, Data Structures

10 points by signa11 60 days ago | 0 comments

A Run of CRDT Posts (jhellerstein.github.io)
Over the next few days, I'm going to post a number of observations about CRDTs: Convergent Conflict-free Replicated Data Types. These are data structures that aspire to help us with coordination-free distributed programming, a topic that interests me a lot. How can developers (or languages/compilers) deliver distributed programs that are safe or correct in important ways, without employing expensive mechanisms for coordination that make the global cloud run as slowly as a sequential computer?

Distributed Programming, Data Structures, Software Development, Programming Languages

11 points by pfarago 61 days ago | 1 comments

Kangaroo: A flash cache optimized for tiny objects (2021) (engineering.fb.com)
Kangaroo is a new flash cache that enables more efficient caching of tiny objects (objects that are ~100 bytes or less) and overcomes the challenges presented by existing flash cache designs.

Software, Caching, Facebook, Optimization, Data Structures

22 points by PaulHoule 61 days ago | 1 comments

CRDTs: Pros and Cons (Lattices and Lettuces?) (jhellerstein.github.io)
Over the next few days, I'm going to post a number of observations about CRDTs: Convergent Replicated Data Types. These are data structures that aspire to help us with coordination-free distributed programming, a topic that interests me a lot. How can developers (or languages/compilers) deliver distributed programs that are safe or correct in important ways, without employing expensive mechanisms for coordination that make the global cloud run as slowly as a sequential computer?

Distributed Systems, Programming, Data Structures, Computer Science, Coordination

6 points by KraftyOne 62 days ago | 0 comments

Optimal bounds for open addressed hash tables without reordering (arxiv.org)
In this paper, we revisit one of the simplest problems in data structures: the task of inserting elements into an open-addressed hash table so that elements can later be retrieved with as few probes as possible.

Data Structures, Hash Tables, Algorithms, Computer Science

4 points by fanf2 65 days ago | 0 comments

Wavelet Trees: An Introduction (2011) (alexbowe.com)
Today I will talk about an elegant way of answering rank queries on sequences over larger alphabets – a structure called the Wavelet Tree.

Data Structures, Algorithms, Computer Science

56 points by Tomte 68 days ago | 16 comments

Show HN: Slice-tree – A piece table data structure implemented using RB tree (github.com/eu-ge-ne)
In computing, a piece table is a data structure typically used to represent a text document while it is edited in a text editor.

Data Structures, Programming, Software, Algorithms

3 points by eu-ge-ne 73 days ago | 0 comments

Adaptive Hashing (quotenil.com)
At the 2024 ELS, I gave a talk on adaptive hashing, which focusses on making general purpose hash tables faster and more robust at the same time.

Data Structures, Hashing, Performance

56 points by varjag 75 days ago | 8 comments

Reservoir Sampling (samwho.dev)
Reservoir sampling is a technique for selecting a fair random sample when you don't know the size of the set you're sampling from.

Data Structures, Algorithms, Randomness, Sampling

532 points by chrisdemarco 75 days ago | 105 comments

The Design of Compact Elastic Binary Trees (Cebtree) (blogspot.com)
Those who often hear me discuss about my week-end projects have been accustomed to hearing about deuterium fusion (that's for another post), laser engraving, and the compact version of the ebtrees, aka compact elastic binary trees, without knowing all the details. That's what we'll be discussing here.

Data Structures, Algorithms, Computer Science

36 points by r4um 78 days ago | 0 comments

Adaptive Hashing (quotenil.com)
At the 2024 ELS, I gave a talk on adaptive hashing, which focusses on making general purpose hash tables faster and more robust at the same time.

Hashing, Data Structures, Computer Science, Algorithms

10 points by todsacerdoti 81 days ago | 1 comments

Bloom Filters (thegreenplace.net)
The original motivation for the creation of Bloom filters is efficient set membership, using a probabilistic approach to significantly reduce the time and space required to reject items that are not members in a certain set.

Data Structures, Algorithms, Probabilistic Methods

240 points by mfrw 82 days ago | 77 comments

Swarm Testing Data Structures (tigerbeetle.com)
We discovered a cute little pattern the other day when refactoring TigerBeetle’s intrusive queue — using Zig’s comptime reflection for exhaustively testing data structure’s public API. Isn’t it cool when your property test fails when you add a new API, because “public API is tested” is one of the properties you test?!

Software, Data Structures, Testing, Programming Languages

12 points by todsacerdoti 83 days ago | 1 comments

Gems in Geospatial Indexing (chaos.social)

Geospatial Indexing, Databases, Data Structures, Performance

5 points by altilunium 84 days ago | 0 comments

Programming languages should have a tree traversal primitive (tylerglaiel.com)
There should be a control flow construct in programming languages that can handle tree-like traversal in a nice way, similar to how for/foreach loops can handle linear traversal. It's a bit of a missing gap in the current set of control flow constructs most languages these days have settled on. Its a thing I end up having to do *all the time* and it seems like there should be some shortcuts for it.

Programming Languages, Software Design, Control Flow, Data Structures

288 points by azhenley 84 days ago | 201 comments

Programming languages should have a tree traversal primitive (tylerglaiel.com)
There should be a control flow construct in programming languages that can handle tree-like traversal in a nice way, similar to how for/foreach loops can handle linear traversal.

Programming Languages, Control Flow, Data Structures

12 points by TylerGlaiel 85 days ago | 2 comments

Packed Data Support in Haskell (arthi-chaud.github.io)
This blog post aims to be a short and accessible summary of a paper that will be published at ECOOP 2025, titled Type-safe and portable support for packed data.

Haskell, Programming Languages, Data Structures, Software Engineering

77 points by matt_d 85 days ago | 12 comments

Pahole: Analysing Memory Layout of Complex Data Structures with Ease (pramodkumbhar.com)
Sunday, 5th November 2023: Putting together this blog post feels like a positive stride! As I mentioned in the previous post, (core-to-core latency tool), I'm aiming to integrate more consistent writing into my routine. While it took a month to pen this down, it's progress from the previous year 😇. Hoping that the upward trend will continue...!

Memory Management, Software Development, Programming, Data Structures, Linux

19 points by todsacerdoti 95 days ago | 0 comments

Stop Writing `__init__` Methods (glyph.im)
YEARS OF DATACLASSES yet NO REAL-WORLD USE FOUND for overriding special methods just so you can have some attributes.

Python, Programming, Software Development, Data Structures

19 points by todsacerdoti 96 days ago | 6 comments

Show HN: Bptree – A B+ tree implementation in C (github.com/habedi)
Bptree is a lightweight single-header B+ tree implementation written in C.

Data Structures, C Programming, Open Source, Software, GitHub

3 points by habedi0 99 days ago | 0 comments

Zig's new LinkedList API (it's time to learn fieldParentPtr) (openmymind.net)
In a recent, post-Zig 0.14 commit, Zig's SinglyLinkedList and DoublyLinkedList saw significant changes.

Programming, New Releases, Data Structures, Zig

227 points by todsacerdoti 99 days ago | 178 comments

Fibonacci Hashing: The Optimization That the World Forgot (probablydance.com)
I recently posted a blog post about a new hash table, and whenever I do something like that, I learn at least one new thing from my comments. In my last comment section Rich Geldreich talks about his hash table which uses “Fibonacci Hashing”, which I hadn’t heard of before.

Hash Tables, Algorithms, Data Structures, Optimization

143 points by juancampa 100 days ago | 29 comments

Lisp Programs Don't Have Parentheses (blogspot.com)
Lisp programs don't have parentheses — they are made of nested linked lists. The parentheses only exist in the printed representation — the ASCII serialization — of a Lisp program. They tell the Lisp reader where the nested lists begin and end. Parenthesis are the contour lines in the topographic map of your Lisp program.

Programming, Lisp, Data Structures

9 points by adunk 103 days ago | 3 comments

Stop Treating YAML Like a String (theyamlengineer.com)
Koreo is a data structure orchestration engine. Although it's primarily designed for Kubernetes resource orchestration, Koreo's core functionality can orchestrate and manage virtually any structured data. What Koreo provides today, however, is a new approach to Kubernetes configuration management empowering developers and platform teams through programmable workflows. This approach draws upon the strengths of existing tools like Helm, Kustomize, and Crossplane while addressing some of their limitations.

Kubernetes, Configuration Management, Data Structures, Programming Languages, DevOps

23 points by tylertreat 105 days ago | 0 comments

Vector Sets are part of Redis (antirez.com)
Yesterday we finally merged vector sets into Redis, here you can find the README that explains in detail what you get:

Redis, Databases, Data Structures

7 points by zX41ZdbW 107 days ago | 0 comments

Two Attacks on Naive Tree Hashes (jacko.io)

Cryptography, Security, Data Structures

6 points by oconnor663 114 days ago | 0 comments

A diagram of C23 basic types (wordpress.com)
This week on the C committee mailing list we had a discussion about how C’s types are organized into different categories. At the end I came up with a diagram with that organization. It basically translates the section “6.2.5 Types” of the C23 standard into a graph of inclusions.

C Programming, Programming Languages, Data Structures, Diagrams, Standards

6 points by ingve 115 days ago | 0 comments

Writing a tiny undo/redo stack in JavaScript (julik.nl)
I’ve needed this before - a couple of times. Third time I figured I needed something small, nimble - yet complete. And - at the same time - wondering about how to do it in a very simple manner. I think it worked out great, so let’s dig in.

JavaScript, Programming, Software Development, Data Structures

170 points by julik 121 days ago | 69 comments

Shift-to-Middle Array: A Faster Alternative to Std:Deque? (github.com/attilatorda)
The Shift-To-Middle Array is a dynamic array designed to optimize insertions and deletions at both ends, offering a high-performance alternative to std::deque, std::vector, and linked lists.

Data Structures, Algorithms, Performance, C++

128 points by AttilaT 121 days ago | 118 comments