[2302.13971] LLaMA: Open and Efficient Foundation Language Models

Linking pages

Laws of Tech: Commoditize Your Complement · Gwern.net https://www.gwern.net/Complement 323 comments
GitHub - ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++ https://github.com/ggerganov/llama.cpp 286 comments
AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
Using ChatGPT Plugins with LLaMA. Use OpenAI’s chatgpt-retrieval-plugin… | by Sarmad Qadri | Mar, 2023 | lastmile ai https://blog.lastmileai.dev/using-openais-retrieval-plugin-with-llama-d2e0b6732f14 165 comments
GitHub - ray-project/llm-numbers: Numbers every LLM developer should know https://github.com/ray-project/llm-numbers 113 comments
AI and Open Source in 2023 - by Sebastian Raschka, PhD https://magazine.sebastianraschka.com/p/ai-and-open-source-in-2023 67 comments
GitHub - xenova/transformers.js: State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server! https://github.com/xenova/transformers.js 55 comments
Understanding Large Language Models - by Sebastian Raschka https://magazine.sebastianraschka.com/p/understanding-large-language-models 53 comments
Ahead of AI #11: New Foundation Models https://magazine.sebastianraschka.com/p/ahead-of-ai-11-new-foundation-models 34 comments
Mini-post: first look at LLaMA. Background | by Enryu | Mar, 2023 | Medium https://medium.com/@enryu9000/mini-post-first-look-at-llama-4403517d41a1 27 comments
GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/transformers 26 comments
How to run your own LLM (GPT) https://blog.rfox.eu/en/Programming/How_to_run_your_own_LLM_GPT.html 26 comments
Optimizing LLMs From a Dataset Perspective https://sebastianraschka.com/blog/2023/optimizing-LLMs-dataset-perspective.html 24 comments
Pile-T5 | EleutherAI Blog https://blog.eleuther.ai/pile-t5/ 14 comments
GitHub - aimerou/top-ai-papers: A curated list of the most impressive AI papers https://github.com/aimerou/top-ai-papers 9 comments
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama - Cerebras https://www.cerebras.net/blog/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama 7 comments
GPT-4 architecture: what we can deduce from research literature | Kirill Gadjello's personal blog and website https://kir-gadjello.github.io/posts/gpt4-some-technical-hypotheses/ 6 comments
"Attention, Please!": A Visual Guide To The Attention Mechanism [Transformer Series] https://open.substack.com/pub/codecompass00/p/visual-guide-attention-mechanism-transformers?r=rcorn 5 comments
Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters https://sebastianraschka.com/blog/2023/llm-finetuning-llama-adapter.html 4 comments
GitHub - vcskaushik/LLMzip https://github.com/vcskaushik/LLMzip 4 comments