Linking pages
- GitHub - lm-sys/FastChat: An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. https://github.com/lm-sys/FastChat 4 comments
- HQQ quantization https://mobiusml.github.io/hqq_blog/ 2 comments
- GitHub - oscinis-com/Awesome-LLM-Productization: Awesome-LLM-Productization: a curated list of tools/tricks/news/regulations about AI and Large Language Model (LLM) productization https://github.com/oscinis-com/Awesome-LLM-Productization 1 comment
- GitHub - RUCAIBox/LLMSurvey: The official GitHub page for the survey paper "A Survey of Large Language Models". https://github.com/RUCAIBox/LLMSurvey 0 comments
- GitHub - horseee/Awesome-Efficient-LLM: A curated list for Efficient Large Language Models https://github.com/horseee/Awesome-Efficient-LLM 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
- Serving LLMs on a budget - NOS Docs https://docs.nos.run/docs/blog/serving-llms-on-a-budget.html 0 comments
- The Many Ways to Deploy a Model | Outerbounds https://outerbounds.com/blog/the-many-ways-to-deploy-a-model/ 0 comments
- GitHub - NexaAI/Awesome-LLMs-on-device: Awesome LLMs on Device: A Comprehensive Survey https://github.com/NexaAI/Awesome-LLMs-on-device 0 comments
Linked pages
- GitHub - lm-sys/FastChat: An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. https://github.com/lm-sys/FastChat 4 comments
- [2306.00978] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration https://arxiv.org/abs/2306.00978 2 comments
- [2210.17323] GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers https://arxiv.org/abs/2210.17323 0 comments
- GitHub - haotian-liu/LLaVA: Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities. https://github.com/haotian-liu/LLaVA 0 comments
- GitHub - mit-han-lab/smoothquant: [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models https://github.com/mit-han-lab/smoothquant 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - mit-han-lab/llm-awq: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
See how to search.