Linking pages
- Laws of Tech: Commoditize Your Complement · Gwern.net https://www.gwern.net/Complement 323 comments
- GitHub - ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++ https://github.com/ggerganov/llama.cpp 286 comments
- AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
- Using ChatGPT Plugins with LLaMA. Use OpenAI’s chatgpt-retrieval-plugin… | by Sarmad Qadri | Mar, 2023 | lastmile ai https://blog.lastmileai.dev/using-openais-retrieval-plugin-with-llama-d2e0b6732f14 165 comments
- GitHub - ray-project/llm-numbers: Numbers every LLM developer should know https://github.com/ray-project/llm-numbers 113 comments
- AI and Open Source in 2023 - by Sebastian Raschka, PhD https://magazine.sebastianraschka.com/p/ai-and-open-source-in-2023 67 comments
- GitHub - xenova/transformers.js: State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server! https://github.com/xenova/transformers.js 55 comments
- Understanding Large Language Models - by Sebastian Raschka https://magazine.sebastianraschka.com/p/understanding-large-language-models 53 comments
- Ahead of AI #11: New Foundation Models https://magazine.sebastianraschka.com/p/ahead-of-ai-11-new-foundation-models 34 comments
- Mini-post: first look at LLaMA. Background | by Enryu | Mar, 2023 | Medium https://medium.com/@enryu9000/mini-post-first-look-at-llama-4403517d41a1 27 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/transformers 26 comments
- How to run your own LLM (GPT) https://blog.rfox.eu/en/Programming/How_to_run_your_own_LLM_GPT.html 26 comments
- Optimizing LLMs From a Dataset Perspective https://sebastianraschka.com/blog/2023/optimizing-LLMs-dataset-perspective.html 24 comments
- Pile-T5 | EleutherAI Blog https://blog.eleuther.ai/pile-t5/ 14 comments
- GitHub - aimerou/top-ai-papers: A curated list of the most impressive AI papers https://github.com/aimerou/top-ai-papers 9 comments
- SlimPajama: A 627B token cleaned and deduplicated version of RedPajama - Cerebras https://www.cerebras.net/blog/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama 7 comments
- GPT-4 architecture: what we can deduce from research literature | Kirill Gadjello's personal blog and website https://kir-gadjello.github.io/posts/gpt4-some-technical-hypotheses/ 6 comments
- "Attention, Please!": A Visual Guide To The Attention Mechanism [Transformer Series] https://open.substack.com/pub/codecompass00/p/visual-guide-attention-mechanism-transformers?r=rcorn 5 comments
- Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters https://sebastianraschka.com/blog/2023/llm-finetuning-llama-adapter.html 4 comments
- GitHub - vcskaushik/LLMzip https://github.com/vcskaushik/LLMzip 4 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2302.13971] LLaMA: Open and Efficient Foundation Language Models
See how to search.