Hacker News
- State of the Art: Training >70B LLMs on 10k H100 clusters https://www.latent.space/p/llm-training-2024 0 comments
Linking pages
Linked pages
- Introducing DBRX: A New State-of-the-Art Open LLM | Databricks https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm 343 comments
- From bare metal to a 70B model: infrastructure set-up and scripts - imbue https://imbue.com/research/70b-infrastructure/ 36 comments
- How to train a Million Context LLM — with Mark Huang of Gradient.ai https://www.latent.space/p/gradient 1 comment
- Latent Space | swyx | Substack https://www.latent.space/ 0 comments
- MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML https://www.latent.space/p/mosaic-mpt-7b 0 comments
- Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue https://www.latent.space/p/imbue 0 comments
- NeurIPS 2023 Recap — Best Papers - by swyx - Latent Space https://www.latent.space/p/neurips-2023-papers 0 comments
- Emulating Humans with NSFW Chatbots - with Jesse Silver https://www.latent.space/p/nsfw-chatbots 0 comments
- ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Christian Szegedy, Ilya Sutskever https://www.latent.space/p/iclr-2024-recap 0 comments
- ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt) https://www.latent.space/p/iclr-2024-benchmarks-agents 0 comments
- How To Hire AI Engineers - by Adam Wiggins and James Brady https://www.latent.space/p/hiring 0 comments
- Training a 70B model from scratch: open source tools, evaluation datasets, and learnings - imbue https://imbue.com/research/70b-intro/ 0 comments
Related searches:
Search whole site: site:www.latent.space
Search title: State of the Art: Training >70B LLMs on 10,000 H100 clusters
See how to search.