What is QLoRA?: A Visual Guide to Efficient Finetuning of Quantized LLMs

Linking pages

Linked pages

Introducing the next generation of Claude \ Anthropic https://www.anthropic.com/news/claude-3-family 704 comments
[1706.03762] Attention Is All You Need https://arxiv.org/abs/1706.03762 145 comments
[2305.14314] QLoRA: Efficient Finetuning of Quantized LLMs https://arxiv.org/abs/2305.14314 129 comments
WWDC24 - Apple Developer https://developer.apple.com/wwdc24/ 123 comments
PyTorch http://pytorch.org/ 100 comments
[1hr Talk] Intro to Large Language Models - YouTube https://www.youtube.com/watch?v=zjkBMFhNj_g 36 comments
Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA https://huggingface.co/blog/4bit-transformers-bitsandbytes 15 comments
[2106.09685] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 8 comments
How Tesla Continuously and Automatically Improves Autopilot and Full Self-Driving Capability On 5M+ Cars https://codecompass00.substack.com/p/tesla-data-engine-trigger-classifiers 2 comments
The Challenges of Building Effective LLM Benchmarks https://codecompass00.substack.com/p/llm-evaluation-leaderboards 2 comments
[2303.08774] GPT-4 Technical Report https://arxiv.org/abs/2303.08774 1 comment
Mistral AI | Open source models https://mistral.ai/ 1 comment
[2312.11805] Gemini: A Family of Highly Capable Multimodal Models https://arxiv.org/abs/2312.11805 1 comment
[2403.05530] Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context https://arxiv.org/abs/2403.05530 1 comment
What is LoRA?: A Visual Guide to Low-Rank Approximation for Fine-Tuning LLMs Efficiently https://codecompass00.substack.com/p/what-is-lora-a-visual-guide-llm-fine-tuning 1 comment
Build Your Own Open-source RAG Using LangChain, LLAMA 3 and Chroma https://codecompass00.substack.com/p/build-open-source-rag-langchain-llm-llama-chroma 1 comment
[2010.11929] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale https://arxiv.org/abs/2010.11929 0 comments
A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes https://huggingface.co/blog/hf-bitsandbytes-integration 0 comments
[2302.13971] LLaMA: Open and Efficient Foundation Language Models https://arxiv.org/abs/2302.13971 0 comments