Linking pages
- The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka https://www.latent.space/p/yitay 4 comments
- The Winds of AI Winter - Latent Space https://www.latent.space/p/mar-jun-2024 2 comments
- Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI https://www.latent.space/p/llama-3 1 comment
- [AINews] Talaria: Apple's new MLOps Superweapon • Buttondown https://buttondown.email/ainews/archive/ainews-talaria-apples-new-mlops-superweapon-4066/ 0 comments
- How To Hire AI Engineers - by Adam Wiggins and James Brady https://www.latent.space/p/hiring 0 comments
- State of the Art: Training >70B LLMs on 10,000 H100 clusters https://www.latent.space/p/llm-training-2024 0 comments
- Is finetuning GPT4o worth it? - Latent Space https://www.latent.space/p/cosine 0 comments
- Language Agents: From Reasoning to Acting - Latent Space https://www.latent.space/p/shunyu 0 comments
- How to Run a Paper Club (also: LIVE at NeurIPS 2024!) https://www.latent.space/p/paperclub 0 comments
- The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic https://www.latent.space/p/claude-sonnet 0 comments
- Bolt.new, Flow Engineering for Code Agents, and >$8m ARR in 2 months as a Claude Wrapper https://www.latent.space/p/bolt 0 comments
Linked pages
- GitHub - princeton-nlp/SWE-agent https://github.com/princeton-nlp/swe-agent 164 comments
- [2305.01625] Unlimiformer: Long-Range Transformers with Unlimited Length Input https://arxiv.org/abs/2305.01625 109 comments
- [2308.00352] MetaGPT: Meta Programming for Multi-Agent Collaborative Framework https://arxiv.org/abs/2308.00352 82 comments
- GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks https://github.com/mit-han-lab/streaming-llm 65 comments
- [2210.07128] Language Models of Code are Few-Shot Commonsense Learners https://arxiv.org/abs/2210.07128 54 comments
- GitHub - stanfordnlp/dspy: DSPy: The framework for programming—not prompting—language models https://github.com/stanfordnlp/dspy 52 comments
- [2309.03883] DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models https://arxiv.org/abs/2309.03883 43 comments
- GitHub - BerriAI/litellm: lightweight package to simplify LLM API calls - Azure, OpenAI, Cohere, Anthropic. Manages input/output translation https://github.com/BerriAI/litellm 17 comments
- [2311.12983] GAIA: a benchmark for General AI Assistants https://arxiv.org/abs/2311.12983 8 comments
- WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai https://www.latent.space/p/sim-ai 7 comments
- [2305.20050] Let's Verify Step by Step https://arxiv.org/abs/2305.20050 3 comments
- Model Spec (2024/05/08) https://cdn.openai.com/spec/model-spec-2024-05-08.html 2 comments
- Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere https://www.latent.space/p/cursor 1 comment
- Emulating Humans with NSFW Chatbots - with Jesse Silver https://www.latent.space/p/nsfw-chatbots 1 comment
- How to train a Million Context LLM — with Mark Huang of Gradient.ai https://www.latent.space/p/gradient 1 comment
- [2206.14858] Solving Quantitative Reasoning Problems with Language Models https://arxiv.org/abs/2206.14858 0 comments
- [2302.07867] Learning Performance-Improving Code Edits https://arxiv.org/abs/2302.07867 0 comments
- Latent Space | swyx | Substack https://www.latent.space/ 0 comments
- GitHub - lukasberglund/reversal_curse https://github.com/lukasberglund/reversal_curse 0 comments
- [2310.06770] SWE-bench: Can Language Models Resolve Real-World GitHub Issues? https://arxiv.org/abs/2310.06770 0 comments
Related searches:
Search whole site: site:latent.space
Search title: ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)
See how to search.