Hacker News with Generative AI: Scaling

The Practical Guide to Scaling Django (slimsaas.com)
Most Django scaling guides focus on theoretical maximums. But real scaling isn’t about handling hypothetical millions of users - it’s about systematically eliminating bottlenecks as you grow. Here’s how to do it right, based on patterns that work in production.
Data movement bottlenecks to large-scale model training: Scaling past 1e28 FLOP (epochai.org)
Data movement bottlenecks limit LLM scaling beyond 2e28 FLOP, with a "latency wall" at 2e31 FLOP. We may hit these in ~3 years. Aggressive batch size scaling could potentially overcome these limits.
Possible futures for the Ethereum protocol, part 2: The Surge (eth.limo)
At the beginning, Ethereum had two scaling strategies in its roadmap. One (eg. see this early paper from 2015) was "sharding": instead of verifying and storing all of the transactions in the chain, each node would only need to verify and store a small fraction of the transactions. This is how any other peer-to-peer network (eg. BitTorrent) works too, so surely we could make blockchains work the same way.
Upgrading Uber's MySQL Fleet (uber.com)
What can we do to make games scale? (twitter.com)
Does It Scale (Down)? (bugsink.com)
It’s 2024, and software is in a ridiculous state.
How we built ngrok's data platform (ngrok.com)
At ngrok, we manage an extensive data lake with an engineering team of one (me!).
How Discord stores trillions of messages (2023) (discord.com)
In 2017, we wrote a blog post on how we store billions of messages.
Sharding and Scaling PostgreSQL, No Citus (pg-sharding.tech)
Scaling Rails and Postgres to millions of users at Microsoft (stepchange.work)
We run migrations across 2,800 microservices (monzo.com)
How OpenAI Scaled Kubernetes to 7,500 Nodes by Removing One Plugin (betterstack.com)
Scaling One Million Checkboxes to 650M checks (eieio.games)
SPQR 1.5.0: a production-ready system for horizontal scaling of PostgreSQL (github.com/pg-sharding)
Our Wandering Path to Supporting 1000s of Domain Names (fusionauth.io)
Building and scaling Notion's data lake (notion.so)
SPQR: Scaling PostgreSQL via Sharding (mintlify.app)
Will We Run Out of Data? Limits of LLM Scaling Based on Human-Generated Data (epochai.org)
Nexus zkVM: Efficient, massively-parallel, zero-knowledge proving (nexus.xyz)
Pinterest Scaled to 11M Users with Only 6 Engineers (medium.com)
Meritocracy at Scale (scale.com)
Scaling Clubhouse From 10K to 10M Users In 6 Months With Postgres (stepchange.work)
Autoscale Kubernetes workloads on any cloud using any event (kedify.io)
Scaling to Count Billions (canva.dev)
Using ClickHouse to scale an events engine (github.com/getlago)
Scaling will never get us to AGI (garymarcus.substack.com)