GitHub - intel/intel-extension-for-transformers: ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Linking pages

Accelerate Llama 2 with Intel AI Hardware and Software Optimizations https://www.intel.com/content/www/us/en/developer/articles/news/llama2.html 0 comments
Build an Interactive Chat-Generation Model with DialoGPT & PyTorch https://www.intel.com/content/www/us/en/developer/articles/technical/build-chat-generation-model-with-dialogpt-pytorch.html#gs.55x94j 0 comments
GitHub - oneapi-community/awesome-oneapi: An Awesome list of oneAPI projects https://github.com/oneapi-community/awesome-oneapi 0 comments
Hands-on guide to quantizing LLMs https://www.intel.com/content/www/us/en/developer/articles/technical/hands-on-guide-to-quantizing-llms.html 0 comments

Linked pages

GitHub - ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++ https://github.com/ggerganov/llama.cpp 286 comments
GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks https://github.com/mit-han-lab/streaming-llm 65 comments
GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/transformers 26 comments
https://arxiv.org/abs/2309.17453 12 comments
GitHub - lm-sys/FastChat: An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. https://github.com/lm-sys/FastChat 4 comments
https://medium.com/intel-analytics-software/the-practice-of-supervised-finetuning-and-direct-preference-optimization-on-habana-gaudi2-a1197d8a3cd3 1 comment
Assisted Generation: a new direction toward low-latency text generation https://huggingface.co/blog/assisted-generation 0 comments
GitHub - TimDettmers/bitsandbytes: 8-bit CUDA functions for PyTorch https://github.com/TimDettmers/bitsandbytes 0 comments
Intel/neural-chat-7b-v3-1 · Hugging Face https://huggingface.co/Intel/neural-chat-7b-v3-1 0 comments
Intel Neural-Chat 7b: Fine-Tuning on Gaudi2 for Top LLM Performance https://huggingface.co/blog/Andyrasika/neural-chat-intel 0 comments
GitHub - EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of autoregressive language models. https://github.com/EleutherAI/lm-evaluation-harness 0 comments