Hacker News with Generative AI: Retrieval-Augmented Generation

RAGDoll: Efficient Offloading-Based Online RAG System on a Single GPU (arxiv.org)
Retrieval-Augmented Generation (RAG) enhances large language model (LLM) generation quality by incorporating relevant external knowledge.

Retrieval-Augmented Generation, Artificial Intelligence, Computer Science, GPUs

4 points by PaulHoule 57 days ago | 0 comments

How to build an agentic, chat or RAG knowledge system using Pinecone Assistant (pinecone.io)
Pinecone Assistant is a powerful service that leverages retrieval-augmented generation (RAG) to enable users to upload documents, ask questions, receive context-aware responses and power agentic workflows.

Knowledge Systems, Artificial Intelligence, RAG, Pinecone Assistant, Retrieval-Augmented Generation

1 points by four_fifths 152 days ago | 0 comments

Show HN: RAGLite – A Python package for the unhobbling of RAG (github.com/superlinear-ai)
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite.

Python, Retrieval-Augmented Generation, Open Source

19 points by lsorber 199 days ago | 0 comments

Long Context vs. RAG (jonathanadly.com)
One of the projects I have built is a long-standing retrieval-augmented generation (RAG) application. Documents are saved in a database, chunked into a reasonable amount of text that a large language model (LLM) can handle, and turned into numerical representation (vectors).

Retrieval-Augmented Generation, Information Retrieval

4 points by jonathan-adly 304 days ago | 0 comments