Hacker News with Generative AI: Cloud Computing

The Anatomy of a Durable Execution Stack from First Principles (restate.dev)
We dive into the architecture details of Restate, a Durable Execution engine we built from the ground up. Restate requires no database/log or other system, but implements a full stack that competes with the best logs in terms of durability and operations.
Ask HN: Thoughts on using American-based cloud vendors in the Trump era? (ycombinator.com)
I think American cloud providers offer state‐of‐the‐art infrastructure but it's fair to question whether the current U.S. political environment—and the unpredictability it sometimes brings—could affect data sovereignty and regulatory stability.
AWS blocking troubshooting docs behind paid premium support plan (reddit.com)
When did AWS decide that troubeshooting docs/articles require you to have a paid premium support plan....like seriously who thought this was a good idea
Why We're Building for On-Prem (oxla.com)
Why we’re building for on-prem
Debugging Hetzner: Uncovering failures with powerstat, sensors, and dmidecode (ubicloud.com)
At Ubicloud, we build software that turns bare metal providers into cloud platforms. One of the providers we like is Hetzner because of their affordable and reliable servers.
Amazon Web Services in Plain English (2025) (expeditedsecurity.com)
But with 50 plus opaquely named services, we decided that enough was enough and that some plain english descriptions were needed.
AWS paywalling select knowledge base articles, requiring Premium Support plan (repost.aws)
I use kubectl commands to connect to the Amazon Elastic Kubernetes Service (Amazon EKS) API server. I received the message "error: You must be logged in to the server (Unauthorized)".
Building an Open, Multi-Engine Data Lakehouse with S3 and Python (tower.dev)
The idea of open, multi-engine data lakehouses is gaining momentum in the data industry.
What Comes After Kubernetes? (2023) (mattrickard.com)
Few projects are ever “finished”, but the Kubernetes APIs have reached a steady state. Core APIs are well into v1 and the extensibility model (custom resource definitions) is stable.
Vercel Fluid Compute (vercel.com)
The power of servers, in serverless form
We were wrong about GPUs (fly.io)
We’re building a public cloud, on hardware we own. We raised money to do that, and to place some bets; one of them: GPU-enabling our customers. A progress report: GPUs aren’t going anywhere, but: GPUs aren’t going anywhere.
John Resig on Using Cloudflare Workers (twitter.com)
Comments on Shared Unix Hosting vs. the Cloud (oils.pub)
This is a follow-up to Comments on Scripting, CGI, and FastCGI, from June (lobste.rs comments).
Tolerating full cloud outages with Monzo Stand-in (monzo.com)
Our customers reasonably expect to be able to spend on their card, make bank transfers and pay their bills 24 hours a day, 365 days a year. Their lives don’t have downtime for maintenance so nor should we. We dedicate a lot of our engineering effort to minimise the risk of downtime during technical migrations and other day-to-day operations, but unforeseen incidents that cause unexpected outages are impossible to eliminate entirely.
Will AI Agents Revolutionize How We Query and Use Data? (ycombinator.com)
Snowflake just announced AI Data Agents in Cortex, a new way to automate and streamline data workflows with AI.
SAP Databricks (databricks.com)
How to run Firecracker without KVM on cloud VMs (alexellis.io)
In this post I want to introduce a novel way to run virtual machines, namely microVMs on cloud VMs where KVM is not available.
Google Cloud will start charging $1.50 / alert (cloud.google.com)
Starting no sooner than May 1, 2026, Cloud Monitoring will begin charging for the use of alerting policies.
WASM-Native Orchestration (wasmcloud.com)
wasmCloud is an open source project from the Cloud Native Computing Foundation (CNCF) that enables teams to build polyglot applications composed of reusable Wasm components and run them—resiliently and efficiently—across any cloud, Kubernetes, datacenter, or edge.
WASM will replace containers (creston.blog)
In the year 2030, no one will remember Kubernetes.
Ask HN: What do you run instead of Datadog? (ycombinator.com)
Datadog has turned into an ever loving piece of shit. I am sick of their sales team grabbing us by the ankles and "Accidentally" charging for services we don't use. Now, this morning they changed something with their AWS integration that is causing 10X the API calls against our accounts (and thus, 10X guardduty costs on our end analyzing those API requests).
Canonical announces 12 year Kubernetes LTS (canonical.com)
Canonical’s Kubernetes LTS (Long Term Support) will support FedRAMP compliance and receive at least 12 years of committed security maintenance and enterprise support on bare metal, public clouds, OpenStack, Canonical MicroCloud and VMware.
Meta’s Hyperscale Infrastructure: Overview and Insights (cacm.acm.org)
Hyperscalers, such as Alibaba, Amazon, ByteDance, Google, Meta, Microsoft, and Tencent, have developed planetary-scale infrastructure to deliver cloud, Web, or mobile services to their global users.
SQL pipe syntax available in public preview in BigQuery (cloud.google.com)
Pipe query syntax is an extension to GoogleSQL that supports a linear query structure designed to make your queries easier to read, write, and maintain. You can use pipe syntax anywhere you write GoogleSQL.
Disaggregated OLTP Systems (transactional.blog)
These notes were first prepared for an informal presentation on the various cloud-native disaggregated OLTP RDBMS designs that have been getting published and it cherry-picked one paper per notable design decision. For the papers covered then, I’ve included a summary of the discussion we had after each paper. This page is being actively extended to cover all disaggregated OLTP papers, even for papers that are similar between two different vendors. ("Actively" meaning as of 2025-02-09.)
The missing tier for query compilers (scattered-thoughts.net)
Database query engines used to be able to assume that disk latency was so high that the overhead of interpreting the query plan didn't matter. Unfortunately these days a cheap nvme ssd can supply data much faster than a query interpreter can process it.
Intel ruined an Israeli startup it bought for $2B–and lost the AI race (calcalistech.com)
In December 2020, Amazon announced with great fanfare that it would use Gaudi chips from the small Israeli startup Habana Labs to train its large language models (LLMs) in the cloud.
Meta's Hyperscale Infrastructure: Overview and Insights (cacm.acm.org)
Hyperscalers, such as Alibaba, Amazon, ByteDance, Google, Meta, Microsoft, and Tencent, have developed planetary-scale infrastructure to deliver cloud, Web, or mobile services to their global users.
Servers can last a long time (world.hey.com)
We bought sixty-one servers for the launch of Basecamp 3 back in 2015. Dell R430s and R630s, packing thousands of cores and terabytes of RAM. Enough to fill all the app, job, cache, and database duties we needed. The entire outlay for this fleet was about half a million dollars, and it's only now, almost a decade later, that we're finally retiring the bulk of them for a full hardware refresh. What a bargain!
Amazon Will Spend Nearly a Year of AWS Revenue on AI Investments (nextplatform.com)
There is a bit of AI spending one-upmanship going on among the hyperscalers and cloud builders – and now the foundation model builders who are partnering with their new sugar daddies to be able to afford to build vast AI accelerator estates to push the state of the art in model capabilities and intelligence.