Hacker News
- IsoFLOP curves of large language models are flat https://severelytheoretical.wordpress.com/2024/07/31/isoflop-curves-of-large-language-models-are-extremely-flat/ 7 comments
Linked pages
- [2404.10102] Chinchilla Scaling: A replication attempt https://arxiv.org/abs/2404.10102 69 comments
- [2305.16264] Scaling Data-Constrained Language Models https://arxiv.org/abs/2305.16264 5 comments
- Go smol or go home | Harm de Vries https://www.harmdevries.com/post/model-size-vs-compute-overhead/ 0 comments
- [2401.00448] Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws https://arxiv.org/abs/2401.00448 0 comments
- The Llama 3 Herd of Models | Research - AI at Meta https://ai.meta.com/research/publications/the-llama-3-herd-of-models/ 0 comments
Related searches:
Search whole site: site:severelytheoretical.wordpress.com
Search title: IsoFLOP curves of large language models are extremely flat – Severely Theoretical
See how to search.