Hacker News
- Quantization-Aware Training for Large Language Models with PyTorch (2024) https://pytorch.org/blog/quantization-aware-training/ 0 comments
Linked pages
- PyTorch 2.0 | PyTorch https://pytorch.org/get-started/pytorch-2.0/ 153 comments
- Tensor Cores: Versatility for HPC & AI | NVIDIA https://www.nvidia.com/en-us/data-center/tensor-cores/ 31 comments
- GitHub - EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of autoregressive language models. https://github.com/EleutherAI/lm-evaluation-harness 0 comments
- GitHub - google/XNNPACK: High-efficiency floating-point neural network inference operators for mobile, server, and Web https://github.com/google/XNNPACK 0 comments
- GitHub - pytorch/torchtune: PyTorch native finetuning library https://github.com/pytorch/torchtune 0 comments
Related searches:
Search whole site: site:pytorch.org
Search title: Quantization-Aware Training for Large Language Models with PyTorch | PyTorch
See how to search.