Hacker News with Generative AI: Rate Limiting

Use the Gemini API with OpenAI Fallback in TypeScript (sometechblog.com)
If you want to use Gemini’s public API, but at the same time have a safe fallback in case you have exhausted the rate limits, you can use the OpenAI TS/JS library and a few helper functions. In my particular case I needed a type-safe solution for a chartmaker app with a fallback since Gemini’s gemini-2.5-pro-exp-03-25 model is restricted to 20 request/min.
We've hit a rate limit with Anthropic (cursor.com)
Docker limits unauthenticated pulls to 10/HR/IP from Docker Hub, from March 1 (docker.com)
When using Docker Hub, unauthenticated and Docker Personal users are subject to strict limits. In contrast, Docker Pro, Team, and Business users benefit from a consumption-based model with a base amount of included usage. This included usage is not a hard limit; users can scale or upgrade their subscriptions to receive additional usage or use on-demand usage.
Backblaze Rate Limiting Policy for Consistent Performance (backblaze.com)
Highways have lanes for a reason. The lanes help ensure that large volumes of traffic can reach their destinations quickly and safely. And they support order and predictability in systems where some folks want (or need) to go NASCAR fast and others like myself a little less so.
Exponential Rate Limiting (dotat.at)
Exponential rate limiting uses an exponentially-weighted moving average to measure the client’s rate. It is motivated by a shift of perspective:
Visualizing algorithms for rate limiting (smudge.ai)