[2210.11416] Scaling Instruction-Finetuned Language Models - discu.eu

Reddit

[D] Packing multiple shorter training examples in to single sequence in LM pretraining https://arxiv.org/abs/2210.11416 5 comments 15/1/2023 machinelearning

Linking pages

The Near Future of AI is Action-Driven - by John McDonnell https://jmcdonnell.substack.com/p/the-near-future-of-ai-is-action-driven 81 comments
AI and Open Source in 2023 - by Sebastian Raschka, PhD https://magazine.sebastianraschka.com/p/ai-and-open-source-in-2023 67 comments
GitHub - FranxYao/chain-of-thought-hub: Benchmarking large language models' complex reasoning ability with chain-of-thought prompting https://github.com/FranxYao/chain-of-thought-hub 26 comments
Pile-T5 | EleutherAI Blog https://blog.eleuther.ai/pile-t5/ 15 comments
Large language models to identify social determinants of health in electronic health records | npj Digital Medicine https://www.nature.com/articles/s41746-023-00970-0 2 comments
Better Language Models Without Massive Compute – Google AI Blog https://ai.googleblog.com/2022/11/better-language-models-without-massive.html 1 comment
OpenAI comes clean about GPT 3.5 - by John McDonnell https://jmcdonnell.substack.com/p/openai-comes-clean-about-gpt-35 1 comment
🎇 Your guide to AI: January 2023 https://nathanbenaich.substack.com/p/your-guide-to-ai-january-2023 1 comment
How Smaller Language Models Outperform LLMs - Deepgram Blog ⚡️ https://blog.deepgram.com/the-underdog-revolution-how-smaller-language-models-outperform-llms/ 1 comment
Generating Text With Contrastive Search vs GPT-3/ChatGPT | Forecastegy https://forecastegy.com/posts/generating-text-with-contrastive-search-vs-gpt-3-chatgpt/ 0 comments
Google is Leading the AGI race. But can it win? https://sergey.substack.com/p/google-is-leading-the-agi-race-but 0 comments
The Flan Collection: Advancing open source methods for instruction tuning – Google AI Blog https://ai.googleblog.com/2023/02/the-flan-collection-advancing-open.html 0 comments
Is Google Flan-T5 better than OpenAI GPT-3? | by Daniel Avila | Feb, 2023 | Medium https://medium.com/@dan.avila7/is-google-flan-t5-better-than-openai-gpt-3-187fdaccf3a6 0 comments
Google Research, 2022 & beyond: Responsible AI – Google AI Blog https://ai.googleblog.com/2023/01/google-research-2022-beyond-responsible.html 0 comments
Hypothetical Embeddings Explained https://summarity.com/hyde 0 comments
Lessons Learned Building Products Powered by Generative AI | by Clément Huyghebaert | Mar, 2023 | BuzzFeed Tech https://tech.buzzfeed.com/lessons-learned-building-products-powered-by-generative-ai-7f6c23bff376?gi=b50bf453296d 0 comments
New Library Updates in PyTorch 2.0 | PyTorch https://pytorch.org/blog/new-library-updates-in-pytorch-2.0/ 0 comments
GitHub - PiotrNawrot/nanoT5: Fast & Simple repository for pre-training and fine-tuning T5-style models https://github.com/PiotrNawrot/nanoT5 0 comments
Running LLMs in the Browser with Rust + WebGPU https://fleetwood.dev/posts/running-llms-in-the-browser 0 comments
GitHub - Mooler0410/LLMsPracticalGuide: A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) https://github.com/Mooler0410/LLMsPracticalGuide 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2210.11416] Scaling Instruction-Finetuned Language Models

See how to search.

Submit link to: