Hacker News with Generative AI: AI

Show HN: I built a more productive way to manage AI chats (contextch.at)
Easily set up multiple projects with web, file, and GitHub context. Start a new chat, and leverage your saved context to get instant answers and insights
Avoiding becoming the lone dependency peg with load-bearing anime (xeiaso.net)
While working on Anubis (a Web AI Firewall Utility designed to stop rampant scraping from taking out web services), one question in particular keeps coming up:
Honey Badger (simonwillison.net)
I'm helping make some changes to a large, complex and very unfamiliar to me WordPress site. It's a perfect opportunity to try out Claude Code running against the new Claude 4 models.
Microsoft dumps AI into Notepad as 'Copilot all the things' mania takes hold (theregister.com)
Microsoft has continued to shovel AI into its built-in Windows inbox apps, and now it's rolling out a Notepad update that will use Copilot to write text for you.
In 3.5 years, Notepad has gone from "barely maintained" to "it writes for you" (arstechnica.com)
AI features in Windows are gradually becoming more widespread and inescapable.
KumoRFM: A Foundation Model for In-Context Learning on Relational Data (kumo.ai)
Foundation Models (FMs) have completely taken over unstructured data domains like natural language and images, delivering significant advances in performance across tasks with little to no task-specific training. Yet structured and semi-structured relational data, which represent some of the most valuable information assets, largely miss out on this AI wave.
The Prompt Theory (twitter.com)
Something went wrong, but don’t fret — let’s give it another shot.
In 3.5 years, Notepad.exe goes from "barely maintained" to "it writes for you" (arstechnica.com)
In November, Microsoft began testing an update that allowed users to rewrite or summarize text in Notepad using generative AI.
Claude 4 Opus can blow the whistle on you (twitter.com)
Something went wrong, but don’t fret — let’s give it another shot.
Show HN: Pi Co-pilot – Evaluation of AI apps made easy (withpi.ai)
How I used o3 to find a remote 0-day vulnerability in the Linux kernel (ksmbd) (heelan.io)
In this post I’ll show you how I found a zeroday vulnerability in the Linux kernel using OpenAI’s o3 model. I found the vulnerability with nothing more complicated than the o3 API – no scaffolding, no agentic frameworks, no tool use.
New #1 open-source AI Agent on SWE-bench Verified (refact.ai)
Refact.ai Agent achieved 69.8% on SWE-bench Verified — autonomously solving 349 out of 500 tasks. This makes Refact.ai a leading open-source AI programming Agent on SWE-bench and places it among the top ranks on the leaderboard.
Koog, a Kotlin-based framework to build and run Al agents in idiomatic Kotlin (github.com/JetBrains)
Koog is a Kotlin-based framework designed to build and run AI agents entirely in idiomatic Kotlin.
Terminator: Hasta La Vista, Shell (github.com/steipete)
Keep Cursor, Windsurf, and other AI coding assistants working without disruption.
Aligning LLM Choice to Your Use Case: An Expert's Guide (oblivus.com)
Selecting the right large language model (LLM) is a critical decision that can make or break the success of your AI-driven project.
I don't like ChatGPT's new memory dossier (simonwillison.net)
Last month ChatGPT got a major upgrade. As far as I can tell the closest to an official announcement was this tweet from @OpenAI:
Google rejected giving publishers more choice to opt out of AI Search (theverge.com)
Google didn’t want to give publishers the choice to keep their content out of AI Search results because it’s “evolving into a space for monetisation.”
Alan Turing papers saved from shredder could fetch £150k (theguardian.com)
Widely considered the father of theoretical computer science, Alan Turing’s influence on modern life continues to be felt in the age of artificial intelligence. But despite this legacy, a cache of his most important papers was nearly shredded – only to be saved at the last minute when their significance was recognised at a family event.
Python Tooling at Scale: LlamaIndex’s Monorepo Overhaul (llamaindex.ai)
When we talk about LlamaIndex, we’re actually referring to an ecosystem consisting of more than 650 Python packages, mostly Integrations and Packs. All these packages share a single GitHub repository, what engineers fondly call a “monorepo”. In this article, we’re going to introduce LlamaDev, our new tool for managing monorepos at scale, and explain the challenges we ran into with existing tooling to get us to this point.
Show HN: Representing Agents as MCP Servers (github.com/lastmile-ai)
Launch HN: SIM Studio (YC X25) – Figma-Like Canvas for Agent Workflows (ycombinator.com)
Hey HN! We're Emir and Waleed from Sim Studio (https://simstudio.ai). We did a Show HN a few weeks ago about our open-source project (https://news.ycombinator.com/item?id=43823096). Today, we’re launching our hosted platform—a collaborative interface to build and deploy agent workflows. We just removed the waitlist (with 5,000+ people) and you can sign up and access it at: https://simstudio.ai
PlainsightAI Releases OpenFilter: Framework For Universal Vision Workloads (github.com/PlainsightAI)
OpenFilter is an universal abstraction for building and running vision workloads in modular image/video processing pipelines.
Show HN: KVoiceWalk – Voice cloning for Kokoro TTS using random walk algorithms (github.com/RobViren)
KVoiceWalk tries to create new Kokoro voice style tensors that clones target voices by using a random walk algorithm and a hybrid scoring method that combines Resemblyzer similarity, feature extraction, and self similarity.
Show HN: Super (YC W18) - Turn company data into answers & agents for your team (super.work)
Super is how AI curious companies get to action. Connect all your tools, get the best enterprise search experience, prebuilt assistants, and bring it in your own workflows.
Show HN: OpenHands, an open source alternative to Devin, Codex, and Jules (github.com/All-Hands-AI)
Welcome to OpenHands (formerly OpenDevin), a platform for software development agents powered by AI.
The rise, fall and spectacular comeback of Sam Altman (theguardian.com)
The short-lived firing of Sam Altman, the CEO of possibly the world’s most important AI company, was sensational.
Why Good Programmers Use Bad AI (nmn.gl)
AI code generation is error-prone. Why, then, are programmers still using it?
Show HN: Bricks – One Click Dashboards from Your Data Using AI (thebricks.com)
With AI Mode, Google Search Is About to Get Even Chattier (wired.com)
Google is rolling out its AI Mode search experience to everyone in the US starting today.
Gemini 2.5: Our most intelligent models are getting even better (google)
Gemini 2.5 Pro continues to be loved by developers as the best model for coding, and 2.5 Flash is getting even better with a new update. We’re also bringing new capabilities to our models, including Deep Think, an experimental enhanced reasoning mode for 2.5 Pro.