Hacker News
- The Pile is a 825 GiB diverse, open-source language modelling data set (2020) https://pile.eleuther.ai/ 234 comments
- The Pile: An 800GB Dataset of Diverse Text for Language Modeling http://pile.eleuther.ai/ 60 comments
Linking pages
- GitHub - yandex/YaLM-100B: Pretrained language model with 100B parameters https://github.com/yandex/YaLM-100B 902 comments
- GPT-3 is No Longer the Only Game in Town https://lastweekin.ai/p/gpt-3-is-no-longer-the-only-game 215 comments
- GitHub - openlm-research/open_llama: OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset https://github.com/openlm-research/open_llama 183 comments
- GitHub - kingoflolz/mesh-transformer-jax: Model parallel transformers in JAX and Haiku https://github.com/kingoflolz/mesh-transformer-jax 146 comments
- Microsoft unveils AI model that understands image content, solves visual puzzles | Ars Technica https://arstechnica.com/?p=1920920 102 comments
- This AI Can Generate Convincing Text—and Anyone Can Use It | WIRED https://www.wired.com/story/ai-generate-convincing-text-anyone-use-it/ 97 comments
- Stability AI launches StableLM, an open source ChatGPT alternative | Ars Technica https://arstechnica.com/?p=1933856 91 comments
- GPT-J-6B: 6B JAX-Based Transformer – Aran Komatsuzaki https://arankomatsuzaki.wordpress.com/2021/06/04/gpt-j/ 79 comments
- Cerebras-GPT vs LLaMA AI Model Comparison | LunaTrace https://www.lunasec.io/docs/blog/cerebras-gpt-vs-llama-ai-model-comparison/ 70 comments
- GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
- RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious https://www.latent.space/p/rwkv#%C2%A7the-eleuther-mafia 66 comments
- Apple releases eight small AI language models aimed at on-device use | Ars Technica https://arstechnica.com/information-technology/2024/04/apple-releases-eight-small-ai-language-models-aimed-at-on-device-use/ 61 comments
- Microsoft unveils AI model that understands image content, solves visual puzzles | Ars Technica https://arstechnica.com/information-technology/2023/03/microsoft-unveils-kosmos-1-an-ai-language-model-with-visual-perception-abilities/ 54 comments
- TechScape: The AI tools that will write our emails, attend our meetings – and change our lives | Technology | The Guardian https://www.theguardian.com/technology/2023/mar/21/the-ai-tools-that-will-write-our-emails-attend-our-meetings-and-change-our-lives 35 comments
- Stability AI launches StableLM, an open source ChatGPT alternative | Ars Technica https://arstechnica.com/information-technology/2023/04/stable-diffusion-for-language-stability-launches-open-source-ai-chatbot/ 14 comments
- Connor Leahy on EleutherAI, Replicating GPT-2/GPT-3, AI Risk and Alignment https://thegradientpub.substack.com/p/connor-leahy-on-eleutherai-replicating 9 comments
- How the RWKV language model works | The Good Minima https://johanwind.github.io/2023/03/23/rwkv_details.html 6 comments
- What is Llama 2? Meta’s large language model explained | InfoWorld https://www.infoworld.com/article/3706470/what-is-llama-2-metas-large-language-model-explained.html 6 comments
- The best open source software of 2021 | InfoWorld https://www.infoworld.com/article/3637038/the-best-open-source-software-of-2021.html 5 comments
- GitHub - Stability-AI/StableLM: StableLM: Stability AI Language Models https://github.com/Stability-AI/StableLM 4 comments
Related searches:
Search whole site: site:pile.eleuther.ai
Search title: The Pile
See how to search.