Linking pages
- $2 H100s: How the GPU Bubble Burst - by Eugene Cheah https://www.latent.space/p/gpu-bubble 289 comments
- Segment Anything 2: Demo-first Model Development https://www.latent.space/p/sam2 2 comments
- Emulating Humans with NSFW Chatbots - with Jesse Silver https://www.latent.space/p/nsfw-chatbots 1 comment
- GPT-4o-mini changed ChatBotArena - by Nathan Lambert https://www.interconnects.ai/p/gpt-4o-mini-changed-chatbotarena 0 comments
- AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai https://www.latent.space/p/answerai 0 comments
- Is finetuning GPT4o worth it? - Latent Space https://www.latent.space/p/cosine 0 comments
- Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind https://www.latent.space/p/carlini 0 comments
- Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation https://www.latent.space/p/nyla 0 comments
- From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team https://www.latent.space/p/openai-api-and-o1 0 comments
- Language Agents: From Reasoning to Acting - Latent Space https://www.latent.space/p/shunyu 0 comments
Linked pages
- [2402.17764] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits https://arxiv.org/abs/2402.17764 575 comments
- Llama https://llama.meta.com/ 270 comments
- https://youtu.be/WXuK6gekU1Y 190 comments
- [2302.04761] Toolformer: Language Models Can Teach Themselves to Use Tools https://arxiv.org/abs/2302.04761 153 comments
- GitHub - microsoft/autogen: Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ https://github.com/microsoft/autogen 55 comments
- Overleaf, Online LaTeX Editor https://www.overleaf.com/ 27 comments
- Introducing Meta Llama 3: The most capable openly available LLM to date https://ai.meta.com/blog/meta-llama-3/ 19 comments
- [2311.12983] GAIA: a benchmark for General AI Assistants https://arxiv.org/abs/2311.12983 8 comments
- Introducing Llama 3.1: Our most capable models to date https://ai.meta.com/blog/meta-llama-3-1/ 7 comments
- Optimizing AI Inference at Character.AI https://research.character.ai/optimizing-inference/ 6 comments
- The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka https://www.latent.space/p/yitay 4 comments
- SmolLM - blazingly fast and remarkably powerful https://huggingface.co/blog/smollm 2 comments
- Lindy.ai â Meet Your AI Employee https://www.lindy.ai/ 1 comment
- [2211.09085] Galactica: A Large Language Model for Science https://arxiv.org/abs/2211.09085 0 comments
- [2302.07842] Augmented Language Models: a Survey https://arxiv.org/abs/2302.07842 0 comments
- Latent Space | swyx | Substack https://www.latent.space/ 0 comments
- [2307.09288] Llama 2: Open Foundation and Fine-Tuned Chat Models https://arxiv.org/abs/2307.09288 0 comments
- RLHF 201 - with Nathan Lambert of AI2 and Interconnects https://www.latent.space/p/rlhf-201 0 comments
- Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI https://www.latent.space/p/soumith 0 comments
- GitHub - meta-llama/llama3: The official Meta Llama 3 GitHub site https://github.com/meta-llama/llama3 0 comments
Related searches:
Search whole site: site:www.latent.space
Search title: Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI
See how to search.