Linking pages
- How Google DeepMind's AlphaGeometry Reached Math Olympiad Level Reasoning By Combining Creative LLMs With Deductive Symbolic Engines https://codecompass00.substack.com/p/google-deepmind-alpha-geometry-neuro-symbolic-llm-system 21 comments
- How OpenAI Uses LLMs to Explain Neurons Inside LLMs At Scale https://codecompass00.substack.com/p/how-openai-uses-llms-to-explain-llm-neurons-at-scale 1 comment
Linked pages
- Introducing the next generation of Claude \ Anthropic https://www.anthropic.com/news/claude-3-family 704 comments
- [1706.03762] Attention Is All You Need https://arxiv.org/abs/1706.03762 145 comments
- [2305.14314] QLoRA: Efficient Finetuning of Quantized LLMs https://arxiv.org/abs/2305.14314 129 comments
- WWDC24 - Apple Developer https://developer.apple.com/wwdc24/ 123 comments
- PyTorch http://pytorch.org/ 100 comments
- [1hr Talk] Intro to Large Language Models - YouTube https://www.youtube.com/watch?v=zjkBMFhNj_g 36 comments
- Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA https://huggingface.co/blog/4bit-transformers-bitsandbytes 15 comments
- [2106.09685] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 8 comments
- How Tesla Continuously and Automatically Improves Autopilot and Full Self-Driving Capability On 5M+ Cars https://codecompass00.substack.com/p/tesla-data-engine-trigger-classifiers 2 comments
- The Challenges of Building Effective LLM Benchmarks https://codecompass00.substack.com/p/llm-evaluation-leaderboards 2 comments
- [2303.08774] GPT-4 Technical Report https://arxiv.org/abs/2303.08774 1 comment
- Mistral AI | Open source models https://mistral.ai/ 1 comment
- [2312.11805] Gemini: A Family of Highly Capable Multimodal Models https://arxiv.org/abs/2312.11805 1 comment
- [2403.05530] Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context https://arxiv.org/abs/2403.05530 1 comment
- What is LoRA?: A Visual Guide to Low-Rank Approximation for Fine-Tuning LLMs Efficiently https://codecompass00.substack.com/p/what-is-lora-a-visual-guide-llm-fine-tuning 1 comment
- Build Your Own Open-source RAG Using LangChain, LLAMA 3 and Chroma https://codecompass00.substack.com/p/build-open-source-rag-langchain-llm-llama-chroma 1 comment
- [2010.11929] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale https://arxiv.org/abs/2010.11929 0 comments
- A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes https://huggingface.co/blog/hf-bitsandbytes-integration 0 comments
- [2302.13971] LLaMA: Open and Efficient Foundation Language Models https://arxiv.org/abs/2302.13971 0 comments
Related searches:
Search whole site: site:codecompass00.substack.com
Search title: What is QLoRA?: A Visual Guide to Efficient Finetuning of Quantized LLMs
See how to search.