Announcing the data.gov archive(law.harvard.edu) Today we released our archive of data.gov on Source Cooperative. The 16TB collection includes over 311,000 datasets harvested during 2024 and 2025, a complete archive of federal public datasets linked by data.gov. It will be updated daily as new datasets are added to data.gov.
CDC data are disappearing(theatlantic.com) Last night, scientists began to hear cryptic and foreboding warnings from colleagues: Go to the CDC website, and download your data now. They were all telling one another the same thing: Data on the website were about to disappear, or be altered, to comply with the Trump administration’s ongoing attempt to scrub federal agencies of any mention of gender, DEI, and accessibility.
CDC Data Is Disappearing(theatlantic.com) Last night, scientists began to hear cryptic and foreboding warnings from colleagues: Go to the CDC website, and download your data now.
Show HN: Spice.ai OSS 1.0 – data query and AI-inference engine built in Rust(spiceai.org) 🎉 Today marks the 1.0-stable release of Spice.ai Open Source—purpose-built to help enterprises ground AI in data. By unifying federated data query, retrieval, and AI inference into a single engine, Spice mitigates AI hallucinations, accelerates data access for mission-critical workloads, and makes it simple and easy for developers to build fast and accurate data-intensive applications across cloud, edge, or on-prem.
Zuckerberg appeared to know Llama trained on Libgen(rollingstone.com) The AI rush has brought with it thorny questions of copyright and ownership of data as tech companies train bots like ChatGPT on existing texts, but it seems Meta largely brushed these aside as they worked to integrate such tools into Facebook and Instagram.
Brief Introduction to Fix and Fix JSON(fixparser.dev) The FIX Protocol (Financial Information Exchange) is a standardized messaging system for real-time electronic communication of trade-related information in financial markets.