Hacker News with Generative AI: Code Retrieval

SOTA Code Retrieval with Efficient Code Embedding Models (qodo.ai)
Today, we’re excited to announce Qodo-Embed-1, a new code embedding model family that achieves state-of-the-art performance while maintaining a significantly smaller footprint than existing models.

Code Retrieval, Generative AI, Machine Learning, Software, AI

11 points by jimminyx 142 days ago | 2 comments

How do we evaluate vector-based code retrieval? (voyageai.com)
Despite the widespread use of vector-based code retrieval, evaluating the retrieval quality of embedding models for code retrieval is a common pain point.

Code Retrieval, Evaluation, Machine Learning

55 points by fzliu 171 days ago | 0 comments

Voyage-code-3 (voyageai.com)
TL;DR – Introducing voyage-code-3, our next-generation embedding model optimized for code retrieval. It outperforms OpenAI-v3-large and CodeSage-large by an average of 13.80% and 16.81% on a suite of 32 code retrieval datasets, respectively. By supporting smaller dimensions with Matryoshka learning and quantized formats like int8 and binary, voyage-code-3 can also dramatically reduce storage and search costs with minimal impact on retrieval quality.

Generative AI, Code Retrieval, Machine Learning

111 points by fzliu 191 days ago | 30 comments