Linking pages
- IsoFLOP curves of large language models are extremely flat – Severely Theoretical https://severelytheoretical.wordpress.com/2024/07/31/isoflop-curves-of-large-language-models-are-extremely-flat/ 7 comments
- Revised Chinchilla scaling laws – LLM compute and token requirements – Educating Silicon https://www.educatingsilicon.com/2024/04/29/revised-chinchilla-scaling-laws-impact-on-llm-compute-and-token-requirements/ 0 comments
Linked pages
- [2203.15556] Training Compute-Optimal Large Language Models https://arxiv.org/abs/2203.15556 0 comments
- BigCode - Open and responsible development of LLMs for code https://www.bigcode-project.org/ 0 comments
- [2001.08361] Scaling Laws for Neural Language Models https://arxiv.org/abs/2001.08361 0 comments
- [2302.13971] LLaMA: Open and Efficient Foundation Language Models https://arxiv.org/abs/2302.13971 0 comments
Related searches:
Search whole site: site:harmdevries.com
Search title: Go smol or go home | Harm de Vries
See how to search.