Hacker News
- Exllamav2: Inference library for running LLMs locally on consumer-class GPUs https://github.com/turboderp/exllamav2 125 comments
Linking pages
- GitHub - oobabooga/text-generation-webui: A Gradio web UI for Large Language Models. https://github.com/oobabooga/text-generation-webui 41 comments
- A Visual Guide to Quantization - by Maarten Grootendorst https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-quantization 29 comments
- A guide to LLM inference and performance https://www.baseten.co/blog/llm-transformer-inference-guide/ 14 comments
- GitHub - mlabonne/llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks. https://github.com/mlabonne/llm-course 10 comments
- GitHub - janhq/awesome-local-ai: An awesome repository of local AI tools https://github.com/janhq/awesome-local-ai 3 comments
- GitHub - deepseek-ai/DeepSeek-Coder: DeepSeek Coder: Let the Code Write Itself https://github.com/deepseek-ai/DeepSeek-Coder 1 comment
- GitHub - vince-lam/awesome-local-llms: Identify popular and active GitHub repos for hosting local LLMs https://github.com/vince-lam/awesome-local-llms 1 comment
- The Four Wars of the AI Stack (Dec 2023 Recap) https://www.latent.space/p/dec-2023 0 comments
- The Four Wars of the AI Stack (Dec 2023 Recap) https://www.latent.space/i/140396949/mixtral-sparks-a-gpuinference-war 0 comments
- Introduction | AIKit https://sozercan.github.io/aikit/ 0 comments
- GitHub - ComfyUI-Workflow/awesome-comfyui: A collection of awesome custom nodes for ComfyUI https://github.com/ComfyUI-Workflow/awesome-comfyui 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - turboderp/exllamav2: A fast inference library for running LLMs locally on modern consumer-class GPUs
See how to search.