GitHub - togethercomputer/RedPajama-Data: The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Linking pages

RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious https://www.latent.space/p/rwkv#%C2%A7the-eleuther-mafia 66 comments
Apple releases eight small AI language models aimed at on-device use | Ars Technica https://arstechnica.com/information-technology/2024/04/apple-releases-eight-small-ai-language-models-aimed-at-on-device-use/ 61 comments
GitHub - ashvardanian/StringZilla: Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖 https://github.com/ashvardanian/Stringzilla 57 comments
Noteworthy AI Research Papers of 2024 (Part One) https://magazine.sebastianraschka.com/p/ai-research-papers-2024-part-1 21 comments
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama - Cerebras https://www.cerebras.net/blog/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama 7 comments
GitHub - eugeneyan/open-llms: 🤖 A list of open LLMs available for commercial use. https://github.com/eugeneyan/open-llms 2 comments
GitHub - Oxen-AI/BitNet-1.58-Instruct: Implementation of BitNet-1.58 instruct tuning https://github.com/Oxen-AI/BitNet-1.58-Instruct 2 comments
RLHF: Reinforcement Learning from Human Feedback https://huyenchip.com/2023/05/02/rlhf.html 1 comment
Snowflake releases a flagship generative AI model of its own | TechCrunch https://techcrunch.com/2024/04/24/snowflake-releases-a-flagship-generative-ai-model-of-its-own/ 1 comment
How Good Are the Latest Open LLMs? And Is DPO Better Than PPO? https://magazine.sebastianraschka.com/p/how-good-are-the-latest-open-llms 1 comment
GitHub - DataEval/dingo: Dingo: A Comprehensive Data Quality Evaluation Tool https://github.com/DataEval/dingo 1 comment
GitHub - Mooler0410/LLMsPracticalGuide: A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) https://github.com/Mooler0410/LLMsPracticalGuide 0 comments
Timeline of AI and language models – Dr Alan D. Thompson – Life Architect https://lifearchitect.ai/timeline/ 0 comments
GitHub - Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model https://github.com/Hannibal046/Awesome-LLM 0 comments
BTLM-3B-8K: 7B Performance in a 3 Billion Parameter Model - Cerebras https://www.cerebras.net/machine-learning/btlm-3b-8k-7b-performance-in-a-3-billion-parameter-model/ 0 comments
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning https://xiamengzhou.github.io/sheared-llama/ 0 comments
GitHub - lmmlzn/Awesome-LLMs-Datasets: Summarize existing representative LLMs text datasets. https://github.com/lmmlzn/Awesome-LLMs-Datasets 0 comments
How much LLM training data is there, in the limit? – Educating Silicon https://www.educatingsilicon.com/2024/05/09/how-much-llm-training-data-is-there-in-the-limit/ 0 comments
Apple Open-Sources One Billion Parameter Language Model OpenELM - InfoQ https://www.infoq.com/news/2024/05/apple-llm-openelm/ 0 comments

Linking pages

Linked pages