Linking pages
- Mistral 7B | Mistral AI | Open source models https://mistral.ai/news/announcing-mistral-7b/ 618 comments
- Building a fully local AI smart home assistant | John's Website https://johnthenerd.com/blog/local-llm-assistant/ 186 comments
- Accelerating Generative AI with PyTorch II: GPT, Fast | PyTorch https://pytorch.org/blog/accelerating-generative-ai-2/ 69 comments
- GitHub - jzhang38/TinyLlama https://github.com/jzhang38/TinyLlama 60 comments
- vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention https://vllm.ai/ 42 comments
- What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 40 comments
- GitHub - microsoft/aici: AICI: Prompts as (Wasm) Programs https://github.com/microsoft/aici 36 comments
- GitHub - punica-ai/punica: Serving multiple LoRA finetuned LLM as one https://github.com/punica-ai/punica 26 comments
- GitHub - imoneoi/openchat: OpenChat: Advancing Open-source Language Models with Imperfect Data https://github.com/imoneoi/openchat 25 comments
- GitHub - S-LoRA/S-LoRA: S-LoRA: Serving Thousands of Concurrent LoRA Adapters https://github.com/S-LoRA/S-LoRA 20 comments
- AI on Linux: A Collection of AI Models, LLMs and Chatbots for Linux https://linuxblog.io/ai-on-linux-a-collection-of-ai-models-llms-and-chatbots-for-linux/ 10 comments
- Snowflake Arctic - LLM for Enterprise AI https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/ 6 comments
- GitHub - tensorchord/Awesome-LLMOps: An awesome & curated list of best LLMOps tools for developers https://github.com/tensorchord/Awesome-LLMOps 5 comments
- Mistral AI 7B vLLM inference guide https://docs.mystic.ai/docs/mistral-ai-7b-vllm-fast-inference-guide 4 comments
- GitHub - janhq/awesome-local-ai: An awesome repository of local AI tools https://github.com/janhq/awesome-local-ai 3 comments
- GitHub - taprosoft/llm_finetuning: Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes) https://github.com/taprosoft/llm_finetuning 2 comments
- Serving LLM 24x Faster On the Cloud with vLLM and SkyPilot | SkyPilot Blog https://blog.skypilot.co/serving-llm-24x-faster-on-the-cloud-with-vllm-and-skypilot/ 1 comment
- Llama 2 chat with vLLM (7B, 13B & multi-gpu 70B) https://docs.mystic.ai/docs/llama-2-with-vllm-7b-13b-multi-gpu-70b 1 comment
- GitHub - oscinis-com/Awesome-LLM-Productization: Awesome-LLM-Productization: a curated list of tools/tricks/news/regulations about AI and Large Language Model (LLM) productization https://github.com/oscinis-com/Awesome-LLM-Productization 1 comment
- GitHub - deepseek-ai/DeepSeek-LLM: DeepSeek LLM: Let there be answers https://github.com/deepseek-ai/DeepSeek-LLM 1 comment
Related searches:
Search whole site: site:github.com
Search title: GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs
See how to search.