500K+ Evaluations by Neural Magic Show Quantized LLMs Retain Accuracy - discu.eu

Hacker News

We Ran Over Half a Million Evaluations on Quantized LLMs https://neuralmagic.com/blog/we-ran-over-half-a-million-evaluations-on-quantized-llms-heres-what-we-found/ 2 comments 18/10/2024

Linked pages

https://lmarena.ai/ 18 comments
GitHub - lmarena/arena-hard-auto: Arena-Hard-Auto: An automatic LLM benchmark. https://github.com/lmarena/arena-hard-auto 3 comments
[2210.09261] Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them https://arxiv.org/abs/2210.09261 1 comment
Open LLM Leaderboard - a Hugging Face Space by open-llm-leaderboard https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard 1 comment
[2103.03874] Measuring Mathematical Problem Solving With the MATH Dataset https://arxiv.org/abs/2103.03874 0 comments
[2210.17323] GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers https://arxiv.org/abs/2210.17323 0 comments
GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs https://github.com/vllm-project/vllm 0 comments
[2311.12022] GPQA: A Graduate-Level Google-Proof Q&A Benchmark https://arxiv.org/abs/2311.12022 0 comments
[2311.07911] Instruction-Following Evaluation for Large Language Models https://arxiv.org/abs/2311.07911 0 comments

Related searches:

Search whole site: site:neuralmagic.com

Search title: 500K+ Evaluations by Neural Magic Show Quantized LLMs Retain Accuracy

See how to search.

Submit link to: