Hacker News
- [R] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 2 comments machinelearning
- [R][D] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 3 comments machinelearning
Linking pages
- Google "We Have No Moat, And Neither Does OpenAI" https://www.semianalysis.com/p/google-we-have-no-moat-and-neither 1571 comments
- AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
- OpenAI's plans according to Sam Altman https://humanloop.com/blog/openai-plans 210 comments
- Why Open Source AI Will Win - by Varun - Public Experiments https://varunshenoy.substack.com/p/why-open-source-ai-will-win 174 comments
- What We Know About LLMs (Primer) https://willthompson.name/what-we-know-about-llms-primer 164 comments
- Towards 1-bit Machine Learning Models https://mobiusml.github.io/1bit_blog/ 157 comments
- GitHub - microsoft/LoRA: Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models" https://github.com/microsoft/LoRA 156 comments
- GitHub - Lightning-AI/lit-llama: Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. https://github.com/Lightning-AI/lit-llama 69 comments
- OpenAI's plans according to Sam Altman https://website-nm4keew22-humanloopml.vercel.app/blog/openai-plans 64 comments
- GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization https://github.com/okuvshynov/slowllama 54 comments
- Understanding Large Language Models - by Sebastian Raschka https://magazine.sebastianraschka.com/p/understanding-large-language-models 53 comments
- GitHub - QwenLM/Qwen: The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. https://github.com/QwenLM/Qwen 51 comments
- Decentralized Artificial Intelligence https://www.chaos-engineering.dev/p/decentralized-artificial-intelligence 43 comments
- Finetuning LLMs with LoRA and QLoRA: Insights from Hundreds of Experiments - Lightning AI https://lightning.ai/pages/community/lora-insights/ 39 comments
- Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) https://magazine.sebastianraschka.com/p/practical-tips-for-finetuning-llms 37 comments
- Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond | PyTorch https://pytorch.org/blog/inside-the-matrix/ 34 comments
- GitHub - chris-alexiuk/alpaca-lora: Instruct-tune LLaMA on consumer hardware https://github.com/chris-alexiuk/alpaca-lora 30 comments
- How to run your own LLM (GPT) https://blog.rfox.eu/en/Programming/How_to_run_your_own_LLM_GPT.html 26 comments
- GitHub - punica-ai/punica: Serving multiple LoRA finetuned LLM as one https://github.com/punica-ai/punica 26 comments
- OpenAI's Sam Altman Shares Insight into Future Plans and Challenges https://www.news.upveda.in/2023/06/03/openais-sam-altman-shares-insight-into-future-plans-and-challenges/ 17 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2106.09685] LoRA: Low-Rank Adaptation of Large Language Models
See how to search.