Hacker News with Generative AI: Statistics

Show HN: Kartoffels – Cellular Automata, Statistics, 32-bit RISC-V (pwy.io)
Today I've released v0.7, which spans 122 commits and brings:
Basis of the Kalman Filter [pdf] (github.com/tpn)
Understanding the Basis of the Kalman Filter Via a Simple and Intuitive Derivation (2012).pdf
% Obesity by Country Ranking (worldobesity.org)
Prevalence of obesity (BMI ≥ 30kg/m²)
Backblaze Drive Stats for 2024 (backblaze.com)
As of December 31, 2024, we had 305,180 drives under management. Of that number, there were 4,060 boot drives and 301,120 data drives. This report will focus on those data drives as we review the Q4 2024 annualized failure rates (AFR), the 2024 failure rates, and the lifetime failure rates for the drive models in service as of the end of 2024.
Computing Tricky Probabilities Using Model Counting (msoos.org)
Probabilities of certain events are really hard to estimate sometimes. Often, it’s because we lack information of the underlying causal chains, but sometimes, it’s because the causes are so intertwined that even if we know the underlying probabilities of certain events happening along with the causal chains, it’s still impossible for us to untangle the web.
Supersizing vehicles offers minimal safety benefits – but substantial dangers (iihs.org)
The safety benefits of larger vehicles top out quickly once curb weights exceed the fleet average, a new IIHS study shows.
Servo in 2024: stats, features and donations (servo.org)
2025-01-31 Summary of Servo’s progress in 2024: some numbers, main highlights and plans for the future.
NYPD records longest no-shooting streak in 30 Years (abc7ny.com)
New York City went five days this week without a person shot -- the longest the NYPD has not recorded a shooting victim in at least 30 years -- but the streak is over.
Violent Crime Conviction Rate in Denmark by Nation of Origin 2010-2021 (twitter.com)
Two Bites of Data Science in K (zdsmith.com)
“no bowler with as many wickets has a better average”.
Major data revisions are coming – should make you trust official statistics more (slowboring.com)
Next month, the jobs report will incorporate its annual benchmark revision, which will meaningfully change some key economic statistics and will solve some nagging puzzles and misconceptions about the labor market.
More than 40% of postdocs leave academia, study reveals (nature.com)
More than 40% of postdoctoral researchers leave academia, according to a study of some 45,500 researchers’ careers1.
Statistical Literacy (entropicthoughts.com)
I am convinced there exists something we can call statistical literacy.
This is How Many Startup Businesses Fail in the First Year (+Survival Tips) (54collective.vc)
The startup journey is not for the faint-hearted. Pondering how many startup businesses fail in the first year is a prudent thing to do at pre-launch. This startup survival guide outlines the most critical startup failure statistics to keep in mind when planning your launch.
The average American spent 2.5 months on their phone in 2024 (pcmag.com)
There's a good chance that you're currently reading this article on your phone. If you’re like one of the Americans surveyed by Reviews.org, this is one of 205 times today that you’ll be checking the device in your hand.
Lots of driving is proof we have a healthy economy: Misuse of VMT and GDP charts (urbanismspeakeasy.com)
Motordom's defenders will frequently overlay vehicle miles traveled (VMT) and gross domestic product (GDP) as if the story must be "more driving equals more prosperity."
Lies. Damned Lies. P-value thresholds (newyorker.com)
Harold Eddleston, a seventy-seven-year-old from Greater Manchester, was still reeling from a cancer diagnosis he had been given that week when, on a Saturday morning in February, 1998, he received the worst possible news. He would have to face the future alone: his beloved wife had died unexpectedly, from a heart attack.
Optimality of Frequency Moment Estimation (weizmann.ac.il)
Kelly Can't Fail (win-vector.com)
You may have heard of the Kelly bet allocation strategy. It is a system for correctly exploiting information or bias in a gambling situation. It is also known as a maximally aggressive or high variance strategy, in that betting more than the Kelly selection can be quite ruinous.
Only 15% of all Steam users' time was spent playing games released in 2024 (pcgamer.com)
Lies, damn lies, and shoplifting statistics (popular.info)
For 32 years, the National Retail Federation (NRF) — the lobbying group representing major retailers in the United States — has produced the "National Retail Security Survey."
The distribution of eigenvalues of GUE and its minors at fixed index (wordpress.com)
Leadership Power Tools: SQL and Statistics (blwt.io)
A common pattern I’ve seen over the years have been folks in engineering leadership positions that are not super comfortable with extracting and interpreting data from stores, be it databases, CSV files in an object store, or even just a spreadsheet.
Why probability probably doesn't exist (but it is useful to act like it does) (nature.com)
All of statistics and much of science depends on probability — an astonishing achievement, considering no one’s really sure what it is.
Maximum likelihood estimation and loss functions (rish-01.github.io)
When I started learning about loss functions, I could always understand the intuition behind them. For example, the mean squared error (MSE) for regression seemed logical—penalizing large deviations from the ground-truth makes sense. But one thing always bothered me: I could never come up with those loss functions on my own. Where did they come from? Why do we use these specific formulas and not something else?
Datasaurus dozen – Different datasets with the same descriptive statistics (wikipedia.org)
The Datasaurus dozen comprises thirteen data sets that have nearly identical simple descriptive statistics to two decimal places, yet have very different distributions and appear very different when graphed.[1] It was inspired by the smaller Anscombe's quartet that was created in
Assisted dying now accounts for one in 20 Canada deaths (bbc.co.uk)
Medically-assisted dying – also known as voluntary euthanasia – accounted for 4.7% of deaths in Canada in 2023, new government data shows.
San Francisco is on track to have lowest homicide rate in 60 years – KTVU FOX 2 (ktvu.com)
San Francisco is on track to have the lowest homicide rate in 60 years, according to the police department and mayor's office.
Too many people are killed by supersized cars. This new rule could help (vox.com)
The deadly consequences of “autobesity,” in 3 charts.
US airlines transported passengers over two light-years since the last crash (ourworldindata.org)
When an airplane crashes, we all hear about it. Large crashes are major news events, with shocking pictures repeated endlessly across our television screens.