Hacker News with Generative AI: Data Preservation

Digital Archivists: Protecting Public Data from Erasure (ieee.org)
Through clever usage of APIs, the Library Innovation Lab at Harvard Law School has created an archive of Data.gov, home to 311,000 public datasets
Archive Team (archiveteam.org)
And we've been trashing our history
The critical window of shadow libraries (annas-archive.se)
At Anna’s Archive, we are often asked how we can claim to preserve our collections in perpetuity, when the total size is already approaching 1 Petabyte (1000 TB), and is still growing. In this article we’ll look at our philosophy, and see why the next decade is critical for our mission of preserving humanity’s knowledge and culture.
Tell HN: We should snapshot a mostly AI output free version of the web (ycombinator.com)