Hacker News with Generative AI: Search Engines

Google Doesn't Want You to Search (honest-broker.com)
Almost everything in the digital world is turning into its opposite.
Perplexity claims to have purged Chinese censorship from its new DeepSeek clone (sherwood.news)
Perplexity claims to have purged Chinese censorship and propaganda from its new DeepSeek clone
Google Quality Issues: Harmful to Consumers and Possibly Intentional (wallethub.com)
Google is the portal to the internet for millions of people, serving not just as the bridge between searchers and information but also increasingly positioning itself as the final destination, pushing owned and operated properties and features on users.
Searchcode.com’s SQLite database is probably 6 terabytes bigger than yours (boyter.org)
searchcode.com’s SQLite database is probably one of the largest in the world, at least for a public facing website. It’s actual size is 6.4 TB. Which is probably 6 terabytes bigger than yours.
Ask HN: Do you still use Google? (ycombinator.com)
Do you still use Google? Gmail? YouTube? Gemini? Chrome? Android?
Will Google Search Be the Next to Join Killed by Google? (stan-kondrat.github.io)
Google Search Is Dying for Me – LLMs Took Over.
Privacy Pass Authentication for Kagi Search (kagi.com)
Today we are announcing a new privacy feature coming to Kagi Search. Privacy Pass is an authentication protocol first introduced by Davidson et al., in [1], and recently standardized by the IETF as RFCs [2—4]. At the same time, we are announcing the immediate availability of Kagi’s Tor onion service.
Microsoft Hit by French Antitrust Probe over Rivals' Bing Access (bloomberg.com)
Microsoft Corp. is under investigation from the French antitrust authority amid concerns the US tech giant is degrading the quality of results when smaller rivals pay to use Bing technology in their own search-engine products.
Show HN: Fuckingsearch.com, search Google without AI Overviews (fuckingsearch.com)
Ask HN: Is it me or we almost stopped using Google because of LLMs? (ycombinator.com)
Hi, simple question: is it me or we almost stopped using Google heavily as before because of LLMs?
Show HN: Searchable library of free audiobooks (booksearch.party)
This is an open beta, currently listing (4610 Books):
Open-source DeepResearch – Freeing our search agents (huggingface.co)
Yesterday, OpenAI released Deep Research, a system that browses the web to summarize content and answer questions based on the summary.
ChatGPT and the Enshittification of Google (tompccs.github.io)
In any analysis of Gen-Z’s use of ChatGPT (and Tik-Tok for that matter) to find information in just the same way you and I used to use Google, the question we should be asking is - why has no one been able to make a substantial business out of giving people the information they actually need?
Search logs faster than Sonic – Log search engine internals (vegasecurity.com)
Have you ever wondered how Elasticsearch works? How is it so fast? What makes it different from other databases like PostgreSQL? What cool data structures are at play?
Show HN: I indexed 10M Shopify products to build an API (searchagora.com)
"My wife asked me for a pair of red shoes for Christmas. I quickly typed it into Google and found a combination of ads from large retailers and links to a 1948 movie called 'Red Shoes'. I decided to build Agora to solve my own problem (and stay happily married)."
Add "fucking" to your Google searches to neutralize AI summaries (gizmodo.com)
If you are tired of Google’s AI-powered search results leading you astray with poor information from bad sources, there is some good news. It turns out that if you include any expletives in your search query, Google will not return an AI Overview, as they are called, at the top of the results page.
DIY Projects Search Engine (FindingDIY.com)
Use our search engine to find free DIY projects on topics like arduino, home decor, embroidery, woodworking, and more. Start typing below to explore thousands of DIY guides from around the web.
DeepSeek FAQ (stratechery.com)
It’s Monday, January 27. Why haven’t you written about DeepSeek yet?
Marginalia – A search engine that prioritizes non-commercial content (marginalia-search.com)
This is the new design and home of Marginalia Search.
Show HN: SimpleSearch – Just a list of search bars (simplesearch.info)
Kagi Launches Kite (kagi.com)
Driven by relentless ad monetization, news has become mental junk food - easy to consume but toxic to our minds, triggering stress responses and interrupting deep thought. We can do better, by going back to the essence of journalism - to inform and educate citizens.
Building a full-text search engine in 150 lines of Python code (2021) (degoe.de)
Full-text search is everywhere. From finding a book on Scribd, a movie on Netflix, toilet paper on Amazon, or anything else on the web through Google (like how to do your job as a software engineer), you’ve searched vast amounts of unstructured data multiple times today. What’s even more amazing, is that you’ve even though you searched millions (or billions) of records, you got a response in milliseconds.
Microsoft Bing Now Hides Google Search Results (seroundtable.com)
Recently, we saw Microsoft tricking searchers into thinking they were searching in Google and not on Bing.
Whoogle – open-source, self-hosted, ad-free, privacy-aware metasearch engine (github.com/benbusby)
Get Google search results, but without any ads, JavaScript, AMP links, cookies, or IP address tracking. Easily deployable in one click as a Docker app, and customizable with a single config file. Quick and simple to implement as a primary search engine replacement on both desktop and mobile.
Google.com search now refusing to search for FF esr 128 without JavaScript (ycombinator.com)
Google.com search now refusing to search for FF esr 128 without JavaScript
Show HN: A blocklist to remove spam and bad websites from search results (github.com/popcar2)
This is a blocklist that intends to remove garbage websites from search results, such as AI-generated articles, low-effort spam sites, and thinly-veiled advertisements acting as information.
Vanished from Google/Bing/LinkedIn: a rebuttal of an anti-net neutrality paper (blogspot.com)
What if something can't be found through Google Search? Does it still exist? That question I have to ask myself when I found that a blogpost I wrote two years ago can't be found anymore through either Google or Bing, though it is still listed on Brave. A LinkedIn post where I refer to the blogpost was also made inaccessible, even though Bing can still find that it exists and it can be partially read when browsing anonymously.
IRC Driven – modern IRC indexing site and search engine (ircdriven.com)
IRC Driven is a modern IRC indexing site and search engine. Originally started in 2006, inspired by the now defunct SearchIRC, and after several rewrites we are now evolving into a social media platform for IRC.
Show HN: New search engine and free-FOIA-by-fax-via-web for US veteran records (birls.org)
A three month review of kagi search and the orion web browser (2024) (flatfootfox.com)
There’s a new web search in town. No, it’s not a re-skin of Bing results. No, it’s not an AI powered tool chasing this particular moment of Large Language Model (LLM) hype. Kagi is an honest to goodness general purpose search engine with a simple proposal: