Linking pages
- The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka https://www.latent.space/p/yitay 4 comments
- Segment Anything 2: Demo-first Model Development https://www.latent.space/p/sam2 2 comments
- Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI https://www.latent.space/p/llama-3 1 comment
- AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai https://www.latent.space/p/answerai 0 comments
- Is finetuning GPT4o worth it? - Latent Space https://www.latent.space/p/cosine 0 comments
- The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic https://www.latent.space/p/claude-sonnet 0 comments
- Bolt.new, Flow Engineering for Code Agents, and >$8m ARR in 2 months as a Claude Wrapper https://www.latent.space/p/bolt 0 comments
Linked pages
- Introducing DBRX: A New State-of-the-Art Open LLM | Databricks https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm 343 comments
- From bare metal to a 70B model: infrastructure set-up and scripts - imbue https://imbue.com/research/70b-infrastructure/ 44 comments
- Emulating Humans with NSFW Chatbots - with Jesse Silver https://www.latent.space/p/nsfw-chatbots 1 comment
- How to train a Million Context LLM — with Mark Huang of Gradient.ai https://www.latent.space/p/gradient 1 comment
- Open-sourcing CARBS: a cost-effective hyperparameter optimizer that helps scale small experiments to large language models - imbue https://imbue.com/research/70b-carbs/ 1 comment
- Ensuring accurate model evaluations: open-sourced, cleaned datasets for models that reason and code - imbue https://imbue.com/research/70b-evals/ 1 comment
- Latent Space | swyx | Substack https://www.latent.space/ 0 comments
- MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML https://www.latent.space/p/mosaic-mpt-7b 0 comments
- Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue https://www.latent.space/p/imbue 0 comments
- NeurIPS 2023 Recap — Best Papers - by swyx - Latent Space https://www.latent.space/p/neurips-2023-papers 0 comments
- ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Christian Szegedy, Ilya Sutskever https://www.latent.space/p/iclr-2024-recap 0 comments
- ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt) https://www.latent.space/p/iclr-2024-benchmarks-agents 0 comments
- How To Hire AI Engineers - by Adam Wiggins and James Brady https://www.latent.space/p/hiring 0 comments
- Training a 70B model from scratch: open source tools, evaluation datasets, and learnings - imbue https://imbue.com/research/70b-intro/ 0 comments
Related searches:
Search whole site: site:www.latent.space
Search title: State of the Art: Training >70B LLMs on 10,000 H100 clusters
See how to search.