Hacker News
- GPT Neo: open-source GPT model, with pretrained 1.3B & 2.7B weight models https://github.com/EleutherAI/gpt-neo/ 127 comments
Linking pages
- GitHub - xenova/transformers.js: State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server! https://github.com/xenova/transformers.js 55 comments
- GitHub - minimaxir/aitextgen: A robust Python tool for text-based AI training and generation using GPT-2. https://github.com/minimaxir/aitextgen 41 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/transformers 26 comments
- GitHub - CodedotAl/gpt-code-clippy: Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57 https://github.com/CodedotAl/gpt-code-clippy 13 comments
- Connor Leahy on EleutherAI, Replicating GPT-2/GPT-3, AI Risk and Alignment https://thegradientpub.substack.com/p/connor-leahy-on-eleutherai-replicating 9 comments
- What is Llama 2? Meta’s large language model explained | InfoWorld https://www.infoworld.com/article/3706470/what-is-llama-2-metas-large-language-model-explained.html 6 comments
- Rotary Embeddings: A Relative Revolution | EleutherAI Blog https://blog.eleuther.ai/rotary-embeddings/ 1 comment
- GitHub - slashml/awesome-coding-copilot: A collection of freely-available alternatives to github copilot https://github.com/slashml/awesome-coding-copilot 1 comment
- GitHub - amrzv/awesome-colab-notebooks: Collection of google colaboratory notebooks for fast and easy experiments https://github.com/amrzv/awesome-colab-notebooks 0 comments
- GPT-3's free alternative GPT-Neo is something to be excited about | VentureBeat https://venturebeat.com/2021/05/15/gpt-3s-free-alternative-gpt-neo-is-something-to-be-excited-about/ 0 comments
- DRM for Computer Vision Datasets - Unite.AI https://www.unite.ai/drm-for-computer-vision-datasets/ 0 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/pytorch-transformers 0 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/pytorch-pretrained-BERT 0 comments
- ChatGPT - A Free Trial of The Future https://aitoolsreviewed.substack.com/p/chatgpt-a-free-trial-of-the-future?sd=pf 0 comments
- GitHub - promptslab/Awesome-Prompt-Engineering: This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc https://github.com/promptslab/Awesome-Prompt-Engineering 0 comments
- GitHub - jiep/offensive-ai-compilation: A curated list of useful resources that cover Offensive AI. https://github.com/jiep/offensive-ai-compilation 0 comments
- LLaMA: LLMs for Everyone! - by Cameron R. Wolfe https://cameronrwolfe.substack.com/p/llama-llms-for-everyone 0 comments
- A List of 1 Billion+ Parameter LLMs - Matt Rickard https://blog.matt-rickard.com/p/a-list-of-1-billion-parameter-llms 0 comments
- GitHub - thebigbone/opensourceAI: A curated list of open source projects related to AI. https://github.com/thebigbone/opensourceAI 0 comments
- GitHub - Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model https://github.com/Hannibal046/Awesome-LLM 0 comments
Linked pages
- [2005.14165] Language Models are Few-Shot Learners https://arxiv.org/abs/2005.14165 201 comments
- [1701.06538] Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer https://arxiv.org/abs/1701.06538 125 comments
- [2101.00027] The Pile: An 800GB Dataset of Diverse Text for Language Modeling https://arxiv.org/abs/2101.00027 81 comments
- GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
- Cloud Computing Services | Google Cloud https://cloud.google.com 48 comments
- Cloud Storage | Google Cloud https://cloud.google.com/storage#section-10 31 comments
- GitHub - IDSIA/sacred: Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA. https://github.com/IDSIA/sacred 6 comments
- GitHub - EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of autoregressive language models. https://github.com/EleutherAI/lm-evaluation-harness 0 comments