[2106.09685] LoRA: Low-Rank Adaptation of Large Language Models - discu.eu

Hacker News

LORA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 3 comments 5/5/2023

Reddit

[R] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 2 comments 19/6/2021 machinelearning
[R][D] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 3 comments 18/6/2021 machinelearning

Linking pages

Google "We Have No Moat, And Neither Does OpenAI" https://www.semianalysis.com/p/google-we-have-no-moat-and-neither 1571 comments
AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
OpenAI's plans according to Sam Altman https://humanloop.com/blog/openai-plans 210 comments
Why Open Source AI Will Win - by Varun - Public Experiments https://varunshenoy.substack.com/p/why-open-source-ai-will-win 174 comments
What We Know About LLMs (Primer) https://willthompson.name/what-we-know-about-llms-primer 164 comments
Towards 1-bit Machine Learning Models https://mobiusml.github.io/1bit_blog/ 157 comments
GitHub - microsoft/LoRA: Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models" https://github.com/microsoft/LoRA 156 comments
GitHub - Lightning-AI/lit-llama: Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. https://github.com/Lightning-AI/lit-llama 69 comments
OpenAI's plans according to Sam Altman https://website-nm4keew22-humanloopml.vercel.app/blog/openai-plans 64 comments
GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization https://github.com/okuvshynov/slowllama 54 comments
Understanding Large Language Models - by Sebastian Raschka https://magazine.sebastianraschka.com/p/understanding-large-language-models 53 comments
GitHub - QwenLM/Qwen: The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. https://github.com/QwenLM/Qwen 51 comments
Decentralized Artificial Intelligence https://www.chaos-engineering.dev/p/decentralized-artificial-intelligence 43 comments
Finetuning LLMs with LoRA and QLoRA: Insights from Hundreds of Experiments - Lightning AI https://lightning.ai/pages/community/lora-insights/ 39 comments
Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) https://magazine.sebastianraschka.com/p/practical-tips-for-finetuning-llms 37 comments
Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond | PyTorch https://pytorch.org/blog/inside-the-matrix/ 34 comments
GitHub - chris-alexiuk/alpaca-lora: Instruct-tune LLaMA on consumer hardware https://github.com/chris-alexiuk/alpaca-lora 30 comments
How to run your own LLM (GPT) https://blog.rfox.eu/en/Programming/How_to_run_your_own_LLM_GPT.html 26 comments
GitHub - punica-ai/punica: Serving multiple LoRA finetuned LLM as one https://github.com/punica-ai/punica 26 comments
OpenAI's Sam Altman Shares Insight into Future Plans and Challenges https://www.news.upveda.in/2023/06/03/openais-sam-altman-shares-insight-into-future-plans-and-challenges/ 17 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2106.09685] LoRA: Low-Rank Adaptation of Large Language Models

See how to search.

Submit link to: