Hacker News with Generative AI: Data Storage

Next Password Could Be Stored in Plastic (ieee.org)
Forget cloud storage. Scientists can now save data in plastic—storing digital information in short-chain polymers, and reading it back with electricity.

Data Storage, Materials Science, Security, Nanotechnology

8 points by Brajeshwar 189 days ago | 5 comments

Scanner – The Team Accelerating Log Analysis with Rust (filtra.io)
Scanner is a petabyte scale log search and storage tool. We basically do log search and analysis at very, very large scales for cloud-specific architectures. The way we currently brand it right now is as a “diet Splunk.” It's a product that's ten times cheaper than Splunk and significantly faster than Splunk. In exchange, it doesn't have a lot of the long tail features that Splunk has. That's the current trade-off.

Rust, Cloud Computing, Log Analysis, Data Storage, Open Source

8 points by arconis987 191 days ago | 0 comments

A disk is a bunch of bits (2023) (cyberdemon.org)
Have you ever heard someone say that a disk or memory is a “bunch of bits”?

Computer Science, Data Storage

38 points by rrampage 193 days ago | 6 comments

37signals is completing its on-prem move (theregister.com)
Web software biz 37signals has started to migrate its data out of the cloud and onto on-prem storage – and expects to save a further $1.3 million (£980,000) a year after completing its high-profile cloud repatriation project and getting off AWS once and for all.

Cloud Computing, Data Storage, Cost Savings

57 points by xrayarx 199 days ago | 8 comments

Show HN: Mycelium (github.com/mycweb)
Mycelium is a set of typed formats for storing and transferring data.

Data Storage, Data Transfer, Open Source, Programming Languages

40 points by brendoncarroll 201 days ago | 8 comments

DNA is maybe 60-750MB of data (dynomight.net)
This is an article that just appeared in Asimov Press, who kindly agreed that I could publish it here and also humored my deep emotional need to use words like “Sparklepuff”.

Genetics, Data Storage, Science

57 points by MattSayar 202 days ago | 31 comments

ClickHouse gets lazier and faster: Introducing lazy materialization (clickhouse.com)
Imagine if you could skip packing your bags for a trip because you find out at the airport you’re not going. That’s what ClickHouse is doing with data now.

Data Storage, Databases, Performance, ClickHouse

366 points by tbragin 218 days ago | 122 comments

(All) Databases Are Just Files. Postgres Too (tselai.com)
Dear reader: If you’re feeling an urge to comment solely based on the title, just be warned that too many have done so already.

Databases, Postgres, Data Storage

6 points by mooreds 220 days ago | 0 comments

Unpowered SSD endurance investigation finds data loss, performance issues (tomshardware.com)

SSD, Data Storage, Hardware, Performance, Testing

11 points by lentoutcry 224 days ago | 0 comments

How Long Can SSD Store Data Unpowered? Year 2 Update (2024) [video] (youtube.com)

Data Storage, Hardware, SSD, Technology, Video

21 points by userbinator 225 days ago | 5 comments

Colossus for Rapid Storage (cloud.google.com)
As an object storage service, Google Cloud Storage is popular for its simplicity and scale, a big part of which is due to the stateless REST protocols that you can use to read and write data. But with the rise of AI and as more customers look to run data-intensive workloads, two major obstacles to using object storage are its higher latency and lack of file-oriented semantics.

Cloud Computing, Object Storage, Google Cloud, AI, Data Storage

244 points by alobrah 231 days ago | 117 comments

SpacetimeDB (spacetimedb.com)

Databases, Time Series, Data Storage

347 points by matthewfcarlson 232 days ago | 176 comments

U.S. Gov't eliminates tape data storage at the GSA to save $1M per year (tomshardware.com)

Government, Data Storage, Cost Savings

20 points by voxadam 233 days ago | 8 comments

Reducing Cloud Spend: Migrating Logs from CloudWatch to Iceberg with Postgres (crunchydata.com)
As a database service provider, we store a number of logs internally to audit and oversee what is happening within our systems.

Cloud Computing, Cost Optimization, Data Storage, Databases, Logging

11 points by rubiquity 245 days ago | 0 comments

Scoping a Local-First Image Archive (scottishstoater.com)
For years, I’ve been thinking about how we store and access our digital files, especially photos.

Data Storage, Photography, Personal Projects

31 points by stog 252 days ago | 13 comments

Preview: Amazon S3 Tables and Lakehouse in DuckDB (duckdb.org)
TL;DR: We are happy to announce a new preview feature that adds support for Apache Iceberg REST Catalogs, enabling DuckDB users to connect to Amazon S3 Tables and Amazon SageMaker Lakehouse with ease.

Database, Data Storage, Cloud Computing, Preview

177 points by hn1986 253 days ago | 47 comments

The real failure rate of EBS (planetscale.com)
PlanetScale has deployed millions of Amazon Elastic Block Store (EBS) volumes across the world. We create and destroy tens of thousands of them every day as we stand up databases for customers, take backups, and test our systems end-to-end. Through this experience, we have an unique viewpoint into the failure rate and mechanisms of EBS, and have spent a lot of time working on how to mitigate them.

Cloud Computing, Databases, Reliability, Data Storage

113 points by QuinnyPig 254 days ago | 30 comments

Archival Storage (dshr.org)
I'm honored to appear in what I believe is the final series of these seminars. Most of my previous appearances have focused on debunking some conventional wisdom, and this one is no exception. My parting gift to you is to stop you wasting time and resources on yet another seductive but impractical idea — that the solution to storing archival data is quasi-immortal media. As usual, you don't have to take notes.

Data Storage, Archival Data, Technology, Digital Preservation

362 points by rbanffy 254 days ago | 193 comments

Theory crafting a system for 1000 simultaneous micro SD card ingests (level1techs.com)

Hardware, Data Storage, Theory Crafting

62 points by daemontus 258 days ago | 36 comments

Ask HN: What do you think of BDXL (100GB disks)? (ycombinator.com)
I still have a need to archive data and I'm thinking about getting a BDXL writer and some disks. Is this a dumb thing to do in 2025?

Data Storage, Technology, Hardware, Archiving

9 points by criddell 259 days ago | 20 comments

Put a data center on the moon? (ieee.org)
Lonestar Data Holdings is sending a test mission, aiming to safeguard valuable data

Space Exploration, Data Storage, Technology, Moon

76 points by pseudolus 273 days ago | 170 comments

Hard Drive Graveyard (benjdd.com)
Hard drive graveyard

Hardware, Technology, Data Storage, Data Recovery

7 points by bddicken 273 days ago | 0 comments

What 5 Megabytes of Data Looked Like in 1966 (62,500 punched cards) (vintag.es)
In 1966, computing was in its infancy, and the concept of data storage and processing looked drastically different from today’s instant access to vast amounts of information.

History, Technology, Data Storage

18 points by dxs 281 days ago | 10 comments

Are SSDs more reliable than hard drives? (2021) (backblaze.com)
Solid-state drives (SSDs) continue to become more and more a part of the data storage landscape. And while our SSD 101 series has covered topics like upgrading, troubleshooting, and recycling your SSDs, we’d like to test one of the more popular declarations from SSD proponents: that SSDs fail much less often than our old friend, the hard disk drive (HDD).

Data Storage, Hardware, Reliability, SSDs, HDDs

77 points by fanf2 281 days ago | 69 comments

12 years of Backblaze data center storage drives, visualized (benjdd.com)
1 small node -> 100 drives

Data Storage, Visualization, Hardware, Reliability

147 points by bddicken 281 days ago | 47 comments

Backblaze Drive Stats for 2024 (backblaze.com)
As of December 31, 2024, we had 305,180 drives under management. Of that number, there were 4,060 boot drives and 301,120 data drives. This report will focus on those data drives as we review the Q4 2024 annualized failure rates (AFR), the 2024 failure rates, and the lifetime failure rates for the drive models in service as of the end of 2024.

Hardware, Data Storage, Cloud Storage, Statistics

646 points by TangerineDream 288 days ago | 205 comments

Seagate's HDD scandal deepens clues point at Chinese Chia mining farms (tomshardware.com)

Hard Drives, Data Storage, Cryptocurrency, China

15 points by snvzz 291 days ago | 0 comments

Cloudflare R2 Incident on February 6, 2025 (cloudflare.com)
Multiple Cloudflare services, including our R2 object storage, were unavailable for 59 minutes on Thursday, February 6th. This caused all operations against R2 to fail for the duration of the incident, and caused a number of other Cloudflare services that depend on R2 — including Stream, Images, Cache Reserve, Vectorize and Log Delivery — to suffer significant failures.

Cloud Computing, Security, Web Services, Data Storage, Outage

40 points by ko_pivot 293 days ago | 13 comments

For privacy: Change of our refund policy from 30 to 14 days (mullvad.net)
As part of our ongoing commitment to storing less user data and protect your privacy, we’re updating our refund policy.

Privacy, Refund Policy, Data Storage

26 points by raybb 300 days ago | 10 comments

Husky: Efficient Compaction at Datadog Scale (datadoghq.com)
In a previous blog post, we introduced our Husky event store system. Husky is a distributed storage system that is layered over object storage (e.g., Amazon S3, Google Cloud Storage, Azure Blob Storage, etc.), with the query system acting as a cache over this storage. We also did a deep dive into Husky’s ingestion pipelines that we built to handle the scale of our customer data. In this post, we’ll cover how we designed Husky’s underlying data storage layer.

Data Storage, Distributed Systems, Cloud Storage, Databases, Event Streaming

30 points by gttalbot 301 days ago | 2 comments