Linking pages
- RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious https://www.latent.space/p/rwkv#%C2%A7the-eleuther-mafia 66 comments
- Apple releases eight small AI language models aimed at on-device use | Ars Technica https://arstechnica.com/information-technology/2024/04/apple-releases-eight-small-ai-language-models-aimed-at-on-device-use/ 61 comments
- GitHub - ashvardanian/StringZilla: Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖 https://github.com/ashvardanian/Stringzilla 57 comments
- SlimPajama: A 627B token cleaned and deduplicated version of RedPajama - Cerebras https://www.cerebras.net/blog/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama 7 comments
- GitHub - eugeneyan/open-llms: 🤖 A list of open LLMs available for commercial use. https://github.com/eugeneyan/open-llms 2 comments
- GitHub - Oxen-AI/BitNet-1.58-Instruct: Implementation of BitNet-1.58 instruct tuning https://github.com/Oxen-AI/BitNet-1.58-Instruct 2 comments
- RLHF: Reinforcement Learning from Human Feedback https://huyenchip.com/2023/05/02/rlhf.html 1 comment
- Snowflake releases a flagship generative AI model of its own | TechCrunch https://techcrunch.com/2024/04/24/snowflake-releases-a-flagship-generative-ai-model-of-its-own/ 1 comment
- How Good Are the Latest Open LLMs? And Is DPO Better Than PPO? https://magazine.sebastianraschka.com/p/how-good-are-the-latest-open-llms 1 comment
- GitHub - Mooler0410/LLMsPracticalGuide: A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) https://github.com/Mooler0410/LLMsPracticalGuide 0 comments
- Timeline of AI and language models – Dr Alan D. Thompson – Life Architect https://lifearchitect.ai/timeline/ 0 comments
- GitHub - Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model https://github.com/Hannibal046/Awesome-LLM 0 comments
- BTLM-3B-8K: 7B Performance in a 3 Billion Parameter Model - Cerebras https://www.cerebras.net/machine-learning/btlm-3b-8k-7b-performance-in-a-3-billion-parameter-model/ 0 comments
- Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning https://xiamengzhou.github.io/sheared-llama/ 0 comments
- GitHub - lmmlzn/Awesome-LLMs-Datasets: Summarize existing representative LLMs text datasets. https://github.com/lmmlzn/Awesome-LLMs-Datasets 0 comments
- How much LLM training data is there, in the limit? – Educating Silicon https://www.educatingsilicon.com/2024/05/09/how-much-llm-training-data-is-there-in-the-limit/ 0 comments
- Apple Open-Sources One Billion Parameter Language Model OpenELM - InfoQ https://www.infoq.com/news/2024/05/apple-llm-openelm/ 0 comments
Linked pages
- RedPajama, a project to create leading open-source models, starts by reproducing LLaMA training dataset of over 1.2 trillion tokens — TOGETHER https://www.together.xyz/blog/redpajama 216 comments
- Stack Exchange Data Dump : Stack Exchange, Inc. : Free Download, Borrow, and Streaming : Internet Archive https://archive.org/details/stackexchange 56 comments
- LAION https://laion.ai/ 9 comments
- EleutherAI https://www.eleuther.ai/ 0 comments