DeepSeek-V3/R1 Inference System Overview (github.com/deepseek-ai)
The optimization objectives of serving DeepSeek-V3/R1 inference are: higher throughput and lower latency.