Hacker News with Generative AI: Technical Reports

Qwen3 Technical Report [pdf] (github.com/QwenLM)

AI, Research Papers, Technical Reports

15 points by nkko 191 days ago | 2 comments

BitNet b1.58 2B4T Technical Report (arxiv.org)
We introduce BitNet b1.58 2B4T, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale.

Open Source, Artificial Intelligence, Technical Reports

111 points by galeos 218 days ago | 30 comments

DeepSeek-V3 Technical Report (arxiv.org)
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

Artificial Intelligence, Technical Reports

132 points by signa11 239 days ago | 34 comments

Gemma 3 Technical Report [pdf] (googleapis.com)

Technical Reports, Software, AI

488 points by meetpateltech 254 days ago | 245 comments

Gemini 1.5 Model Family: Technical Report [pdf] (googleapis.com)

Generative AI, Technical Reports

57 points by panabee 552 days ago | 3 comments

Phi-3 Technical Report (arxiv.org)

Technical Reports, Artificial Intelligence, Machine Learning

411 points by varunvummadi 577 days ago | 130 comments