Hacker News
- TL;DR of Deep Dive into LLMs Like ChatGPT by Andrej Karpathy https://anfalmushtaq.com/articles/deep-dive-into-llms-like-chatgpt-tldr 83 comments
Linked pages
- [2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 1061 comments
- X. It’s what’s happening / X http://x.com 681 comments
- LM Studio - Discover, download, and run local LLMs https://lmstudio.ai 165 comments
- LLM Visualization https://bbycroft.net/llm 139 comments
- Let's reproduce GPT-2 (1.6B): one 8XH100 node, 24 hours, $672, in llm.c · karpathy/llm.c · Discussion #677 · GitHub https://github.com/karpathy/llm.c/discussions/677 63 comments
- https://www.youtube.com/watch?v=7xTGNNLPyMI 62 comments
- Together AI â Fast Inference, Fine-Tuning & Training https://together.ai 27 comments
- https://github.com/openai/gpt-2/blob/master/src/model.py 21 comments
- https://lmarena.ai/ 18 comments
- FineWeb: decanting the web for the finest text data at scale - a Hugging Face Space by HuggingFaceFW https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1 14 comments
- [1909.08593] Fine-Tuning Language Models from Human Preferences https://arxiv.org/abs/1909.08593 5 comments
- GitHub - openai/gpt-2: Code for the paper "Language Models are Unsupervised Multitask Learners" https://github.com/openai/gpt-2 2 comments
- https://arxiv.org/abs/2203.02155 0 comments
- Ollama https://ollama.com/ 0 comments
- [2407.21783] The Llama 3 Herd of Models https://arxiv.org/abs/2407.21783 0 comments
- Chat Templates https://huggingface.co/docs/transformers/main/en/chat_templating 0 comments
Related searches:
Search whole site: site:anfalmushtaq.com
Search title: Deep dive into LLMs like ChatGPT by Andrej Karpathy (TL;DR) | Anfal Mushtaq
See how to search.