Hacker News
Linking pages
Linked pages
- https://openai.com/index/hello-gpt-4o/ 2481 comments
- Google introduces Gemini 2.0: A new AI model for the agentic era https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/ 565 comments
- Introducing Appleâs On-Device and Server Foundation Models - Apple Machine Learning Research https://machinelearning.apple.com/research/introducing-apple-foundation-models 541 comments
- QwQ: Reflect Deeply on the Boundaries of the Unknown | Qwen https://qwenlm.github.io/blog/qwq-32b-preview/ 421 comments
- [2303.17580] HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace https://arxiv.org/abs/2303.17580 295 comments
- Answer.AI - You can now train a 70b language model at home http://www.answer.ai/posts/2024-03-06-fsdp-qlora.html 216 comments
- [2005.14165] Language Models are Few-Shot Learners https://arxiv.org/abs/2005.14165 201 comments
- LLM Powered Autonomous Agents | Lil'Log https://lilianweng.github.io/posts/2023-06-23-agent/ 177 comments
- [2302.04761] Toolformer: Language Models Can Teach Themselves to Use Tools https://arxiv.org/abs/2302.04761 153 comments
- [2401.04088] Mixtral of Experts https://arxiv.org/abs/2401.04088 150 comments
- Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet https://transformer-circuits.pub/2024/scaling-monosemanticity/ 135 comments
- [2304.15004] Are Emergent Abilities of Large Language Models a Mirage? https://arxiv.org/abs/2304.15004 130 comments
- [2305.14314] QLoRA: Efficient Finetuning of Quantized LLMs https://arxiv.org/abs/2305.14314 129 comments
- [2310.06825] Mistral 7B https://arxiv.org/abs/2310.06825 124 comments
- Building effective agents \ Anthropic https://www.anthropic.com/research/building-effective-agents 123 comments
- GitHub - unslothai/unsloth: 2x faster 50% less memory LLM finetuning https://github.com/unslothai/unsloth 122 comments
- Gorilla https://gorilla.cs.berkeley.edu/ 121 comments
- [2310.08560] MemGPT: Towards LLMs as Operating Systems https://arxiv.org/abs/2310.08560 106 comments
- GitHub - facebookresearch/faiss: A library for efficient similarity search and clustering of dense vectors. https://github.com/facebookresearch/faiss 100 comments
- GitHub - huggingface/distil-whisper https://github.com/huggingface/distil-whisper 83 comments
Related searches:
Search whole site: site:www.latent.space
Search title: The 2025 AI Engineering Reading List - Latent Space
See how to search.