Hacker News with Generative AI: ONNX

Low-Latency Bayesian Inference: Deploying Models with PyTorch and ONNX (world.hey.com)
Deploying Bayesian models in production often requires balancing predictive accuracy with low-latency inference.

Machine Learning, Bayesian Inference, PyTorch, ONNX, Deployment

7 points by apetrov 310 days ago | 0 comments

Transposing Tensor Files (mmapped.blog)
The safetensors library from Huggingface is popular for representing tensors on disk, and its data layout is fully compatible with the onnx raw tensor data format.

Tensor Files, Data Storage, Huggingface, ONNX, Machine Learning

20 points by surprisetalk 472 days ago | 8 comments