Hacker News
Linked pages
- A Little Bit of Reinforcement Learning from Human Feedback https://rlhfbook.com/ 37 comments
- [2402.19427] Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models https://arxiv.org/abs/2402.19427 32 comments
- [2309.06180] Efficient Memory Management for Large Language Model Serving with PagedAttention https://arxiv.org/abs/2309.06180 16 comments
- OLMo - Open Language Model by AI2 https://allenai.org/olmo 4 comments
- A Meticulous Guide to Advances in Deep Learning Efficiency over the Years | Alex L. Zhang https://alexzhang13.github.io/blog/2024/efficient-dl/ 1 comment
- GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs https://github.com/vllm-project/vllm 0 comments
- [2305.13245] GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints https://arxiv.org/abs/2305.13245 0 comments
- Tulu | Ai2 https://allenai.org/tulu 0 comments
- GitHub - allenai/open-instruct https://github.com/allenai/open-instruct 0 comments
Related searches:
Search whole site: site:learnycurve.substack.com
Search title: How I think about learning - by Saurabh Shah
See how to search.