Hacker News
- We Ran Over Half a Million Evaluations on Quantized LLMs https://neuralmagic.com/blog/we-ran-over-half-a-million-evaluations-on-quantized-llms-heres-what-we-found/ 2 comments
Linked pages
- GitHub - lmarena/arena-hard-auto: Arena-Hard-Auto: An automatic LLM benchmark. https://github.com/lmarena/arena-hard-auto 3 comments
- [2210.09261] Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them https://arxiv.org/abs/2210.09261 1 comment
- Open LLM Leaderboard 2 - a Hugging Face Space by open-llm-leaderboard https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard 1 comment
- [2103.03874] Measuring Mathematical Problem Solving With the MATH Dataset https://arxiv.org/abs/2103.03874 0 comments
- [2210.17323] GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers https://arxiv.org/abs/2210.17323 0 comments
- GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs https://github.com/vllm-project/vllm 0 comments
- [2311.12022] GPQA: A Graduate-Level Google-Proof Q&A Benchmark https://arxiv.org/abs/2311.12022 0 comments
- [2311.07911] Instruction-Following Evaluation for Large Language Models https://arxiv.org/abs/2311.07911 0 comments
- https://lmarena.ai/ 0 comments
Related searches:
Search whole site: site:neuralmagic.com
Search title: 500K+ Evaluations by Neural Magic Show Quantized LLMs Retain Accuracy
See how to search.