Hacker News with Generative AI: Search

Google's AI Mode is 'the definition of theft,' publishers say (9to5google.com)
The AI takeover of Search is in full swing, especially as Google’s new AI Mode is going live for all US users. But for publishers, this continues the existential crisis around how Google Search is changing, with a new statement calling AI Mode “the definition of theft” while legal documents reveal that Google did consider opt out controls that ultimately weren’t implemented.

Artificial Intelligence, Google, Publishing, Law, Search

56 points by ironyman 63 days ago | 63 comments

Tell HN: Mozilla is preparing to remove bookmark keywords (ycombinator.com)
On Bugzilla there's an active ticket [0] tracking progress to remove bookmark keywords from Firefox, in order to consolidate them into Search.

Mozilla, Firefox, Web Browsing, Search, Bookmarks

16 points by RheingoldRiver 64 days ago | 2 comments

Show HN: A free, privacy preserving, archive of public Discord servers (searchcord.io)

Discord, Privacy, Open Source, Social Media, Search

70 points by searchcord 67 days ago | 81 comments

Chrome's New Embedding Model: Smaller, Faster, Same Quality (dejan.ai)
Chrome’s latest update incorporates a new text embedding model that is 57% smaller (35.14MB vs 81.91MB) than its predecessor while maintaining virtually identical performance in semantic search tasks.

Chrome, Machine Learning, Search, Software Updates, Text Embeddings

40 points by kaycebasques 73 days ago | 11 comments

Show HN: Airweave – Let agents search any app (github.com/airweave-ai)
Airweave is a tool that lets agents semantically search any app. It's MCP compatible and seamlessly connects any app, database, or API, to transform their contents into agent-ready knowledge.

AI, Search, Agents, OpenAI

176 points by lennertjansen 74 days ago | 40 comments

Show HN: Extension for full-text browser history search (vercel.app)

Web Development, Browser Extensions, Search

29 points by ApbNfMR 78 days ago | 11 comments

OpenSearch 3.0 Released (opensearch.org)
The OpenSearch Software Foundation, the vendor-neutral home for the OpenSearch Project, today announced the general availability of OpenSearch 3.0.

Open Source, Search, New Releases

110 points by kmaliszewski 79 days ago | 30 comments

Ever wondered why Gmail search fails to find text you're sure is present? (emaildiscussions.com)
Ever wondered why Gmail search fails to find text you're sure is present?

Email, Search, Gmail

26 points by chrisjj 81 days ago | 12 comments

Bitbucket search that doesn't suck – Sourcebot, OSS alternative to Sourcegraph (sourcebot.dev)
We’ve added support for indexing repos from Bitbucket Cloud and Bitbucket Data Center. Check out our docs for more info!

Software, Open Source, Git, Version Control, Search

8 points by bshzzle 91 days ago | 0 comments

Discord Indexes Trillions of Messages (discord.com)
Back in 2017, we shared how we built our message search system to index billions of messages.

Search, Social Media, Discord, Messaging, Technology

38 points by todsacerdoti 92 days ago | 10 comments

Lucene University (github.com/msfroh)
This repository contains some examples of Apache Lucene features with verbose explanations as code comments written in Markdown.

Apache Lucene, Search, Software, Programming, Education

66 points by softwaredoug 93 days ago | 1 comments

AI assisted search-based research works now (simonwillison.net)
In this first half of 2025 I think these systems have finally crossed the line into being genuinely useful.

Artificial Intelligence, Search, Research

283 points by simonw 95 days ago | 147 comments

Google Search switching to Google․com around the world (9to5google.com)
Google Search is now getting rid of these country code top-level domain names (ccTLD) in favor of using google.com globally.

Google, Search, Domain Names, Web

7 points by alentred 100 days ago | 1 comments

Show HN: H-1B salary search without fuss (h1bsalaries.fyi)
This website indexes the Labor Condition Application (LCA) disclosure data from the United States Department of Labor (DOL).

Data, Employment, Immigration, Search, US Government

8 points by dtgeadamo 103 days ago | 0 comments

Show HN: RAG, No Vectors (github.com/VectifyAI)
PageIndex is a document indexing system that builds search tree structures from long documents, making them ready for reasoning-based RAG.

Search, Information Retrieval

11 points by vectify_AI 105 days ago | 1 comments

An LLM Query Understanding Service (softwaredoug.com)
We need to be cheating at search with LLMs. Indeed I’m teaching a whole course on this in July.

Search, AI, Programming

38 points by softwaredoug 107 days ago | 3 comments

Show HN: HNSW index for vector embeddings in approx 500 LOC (github.com/dicroce)

Show HN, GitHub, Vector Embeddings, Software, Search

73 points by dicroce 108 days ago | 12 comments

Show HN: Hot Notes – Fuzzy Search for Apple Notes (macOS) (hotmatcha.dev)
Hot Notes is a macOS app that opens your Apple Notes using fast fuzzy search

macOS, Apple Notes, Productivity, Search, Software

9 points by emadda 114 days ago | 7 comments

Knowledge Library MCP (devpost.com)
Knowledge Library MCP (KL MCP) is a multi-modal application leveraging Azure AI Agent Service to locate documents—text and images—and deliver conversational insights via bots. It enhances search with live data integration and Responsible AI principles, designed for scalable, professional-grade querying.

Azure, Artificial Intelligence, Search, Knowledge Management, Conversational AI

5 points by thstart 119 days ago | 1 comments

Show HN: Search and chat with millions of court cases using AI. (courtsearch.ai)

AI, Legal Tech, Search, Court Cases

31 points by ashr_ 124 days ago | 7 comments

Improving recommendation systems and search in the age of LLMs (eugeneyan.com)
Recommendation systems and search have historically drawn inspiration from language modeling. For example, the adoption of Word2vec to learn item embeddings (for embedding-based retrieval), and using GRUs, Transformer, and BERT to predict the next best item (for ranking). The current paradigm of large language models is no different.

Recommendation Systems, Search, Artificial Intelligence

408 points by 7d7n 125 days ago | 93 comments

Show HN: I converted my notebook into a searchable database of IT keywords (techbook.digital)

Show HN, Databases, IT, Search

78 points by jurajstefanic 129 days ago | 27 comments

Google gets 373x more searches every day than ChatGPT (sparktoro.com)
For years, two questions have dominated both marketers’ interest and the media’s coverage of Google Search:

Google, Search, ChatGPT, Artificial Intelligence, Marketing

5 points by nsmog767 136 days ago | 1 comments

Long Read: Lessons from Building Semantic Search for GitHub and Why I Failed (notion.site)

Long Read, Search, Software Development, GitHub

146 points by zxt_tzx 139 days ago | 51 comments

Optimizing for Multiple Objectives in Search and Recommendations (shaped.ai)
Building effective recommendation and search systems means going beyond simply predicting relevance. Modern users expect personalized experiences that cater to a wide range of needs and preferences, and businesses need systems that align with their overarching goals. This requires optimizing for multiple objectives simultaneously – a complex challenge that demands a nuanced approach. This post explores the concept of value modeling and multi-objective optimization (MOO), explaining how these techniques enable the development of more sophisticated and valuable recommendation and search experiences.

Search, Recommendations, Optimization, Machine Learning, User Experience

10 points by tullie 142 days ago | 1 comments

BM25 in PostgreSQL (vectorchord.ai)
We’re excited to share something special with you: VectorChord-BM25, a new extension designed to make PostgreSQL’s full-text search even better. Whether you’re building a small app or managing a large-scale system, this tool brings advanced BM25 scoring and ranking right into PostgreSQL, making your searches smarter and faster.

PostgreSQL, Databases, Search, Full-Text Search, Open Source

139 points by gaocegege 145 days ago | 31 comments

Chrome has built-in AI history search (support.google.com)
You can use everyday language to find and receive generated answers about what you’re searching for in your Chrome browsing history. This works even if you don't know an exact keyword or website address.

Chrome, AI, Search, Browsing History

22 points by exp1orer 145 days ago | 9 comments

Show HN: PG-Capture – a better way to sync Postgres with Algolia (or Elastic) (onrender.com)
Schema-based Change-Data-Capture for Postgres

Databases, Postgres, Search, Open Source, SaaS

56 points by nick-keller 147 days ago | 19 comments

Google is making it even easier to remove your personal information on Search (engadget.com)
Google has been offering the Results About You tool since 2022 and updated it once in 2023.

Privacy, Google, Search

8 points by 01-_- 149 days ago | 1 comments

VectorChord-BM25: PostgreSQL Search with BM25 – 3x Faster Than Elasticsearch (vectorchord.ai)
We’re excited to share something special with you: VectorChord-BM25, a new extension designed to make PostgreSQL’s full-text search even better. Whether you’re building a small app or managing a large-scale system, this tool brings advanced BM25 scoring and ranking right into PostgreSQL, making your searches smarter and faster.

PostgreSQL, Search, Performance, Open Source, Databases

5 points by gaocegege 151 days ago | 0 comments