Hacker News
- [R] Meta is releasing a 175B parameter language model https://arxiv.org/abs/2205.01068 94 comments machinelearning
Linking pages
- AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
- Stack Overflow Will Charge AI Giants for Training Data | WIRED https://www.wired.com/story/stack-overflow-will-charge-ai-giants-for-training-data/ 71 comments
- GitHub - xenova/transformers.js: State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server! https://github.com/xenova/transformers.js 55 comments
- Understanding Large Language Models - by Sebastian Raschka https://magazine.sebastianraschka.com/p/understanding-large-language-models 53 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/transformers 26 comments
- Understanding Large Language Models -- A Transformative Reading List https://sebastianraschka.com/blog/2023/llm-reading-list.html 26 comments
- Distributed Inference and Fine-tuning of Large Language Models Over The Internet https://browse.arxiv.org/html/2312.08361v1 22 comments
- Large Transformer Model Inference Optimization | Lil'Log https://lilianweng.github.io/posts/2023-01-10-inference-optimization/ 20 comments
- Efficient LLM inference - by Finbarr Timbers https://www.artfintel.com/p/efficient-llm-inference 11 comments
- GitHub - louisfb01/best_AI_papers_2022: A curated list of the latest breakthroughs in AI (in 2022) by release date with a clear video explanation, link to a more in-depth article, and code. https://github.com/louisfb01/best_AI_papers_2022 8 comments
- LLM in a flash: Efficient Large Language Model Inference with Limited Memory https://browse.arxiv.org/html/2312.11514v1 1 comment
- GitHub - amrzv/awesome-colab-notebooks: Collection of google colaboratory notebooks for fast and easy experiments https://github.com/amrzv/awesome-colab-notebooks 0 comments
- Using State-Of-The-Art Artificial Intelligence (AI) Models for Free: Try OPT-175B on Your Cellphone and Laptop - MarkTechPost https://www.marktechpost.com/2022/09/08/using-state-of-the-art-artificial-intelligence-ai-models-for-free-try-opt-175b-on-your-cellphone-and-laptop/ 0 comments
- Meta releases largest open source AI language model to date https://mixed-news.com/en/meta-releases-largest-open-source-ai-language-model-to-date/ 0 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/pytorch-transformers 0 comments
- Can large language models be democratized? - TechTalks https://bdtechtalks.com/2022/05/16/opt-175b-large-language-models/ 0 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/pytorch-pretrained-BERT 0 comments
- Meta AI Introduces Open Pre-trained Transformers (OPT): A Suite Of Decoder-Only Pre-Trained Transformers Ranging From 125M To 175B Parameters - MarkTechPost https://www.marktechpost.com/2022/05/06/meta-ai-introduces-open-pre-trained-transformers-opt-a-suite-of-decoder-only-pre-trained-transformers-ranging-from-125m-to-175b-parameters/ 0 comments
- GitHub - tomohideshibata/BERT-related-papers: BERT-related papers https://github.com/tomohideshibata/BERT-related-papers 0 comments
- Generating Text With Contrastive Search vs GPT-3/ChatGPT | Forecastegy https://forecastegy.com/posts/generating-text-with-contrastive-search-vs-gpt-3-chatgpt/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2205.01068] OPT: Open Pre-trained Transformer Language Models
See how to search.