Deep dive into LLMs like ChatGPT by Andrej Karpathy (TL;DR) | Anfal Mushtaq - discu.eu

Hacker News

TL;DR of Deep Dive into LLMs Like ChatGPT by Andrej Karpathy https://anfalmushtaq.com/articles/deep-dive-into-llms-like-chatgpt-tldr 83 comments 10/2/2025

Linked pages

[2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 1061 comments
X. It’s what’s happening / X http://x.com 786 comments
LM Studio - Discover, download, and run local LLMs https://lmstudio.ai 165 comments
LLM Visualization https://bbycroft.net/llm 139 comments
Let's reproduce GPT-2 (1.6B): one 8XH100 node, 24 hours, $672, in llm.c · karpathy/llm.c · Discussion #677 · GitHub https://github.com/karpathy/llm.c/discussions/677 63 comments
https://www.youtube.com/watch?v=7xTGNNLPyMI 62 comments
Together AI â Fast Inference, Fine-Tuning & Training https://together.ai 27 comments
gpt-2/src/model.py at master · openai/gpt-2 · GitHub https://github.com/openai/gpt-2/blob/master/src/model.py 21 comments
https://lmarena.ai/ 18 comments
FineWeb: decanting the web for the finest text data at scale - a Hugging Face Space by HuggingFaceFW https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1 14 comments
[1909.08593] Fine-Tuning Language Models from Human Preferences https://arxiv.org/abs/1909.08593 5 comments
GitHub - openai/gpt-2: Code for the paper "Language Models are Unsupervised Multitask Learners" https://github.com/openai/gpt-2 2 comments
https://arxiv.org/abs/2203.02155 0 comments
Ollama https://ollama.com/ 0 comments
[2407.21783] The Llama 3 Herd of Models https://arxiv.org/abs/2407.21783 0 comments
Chat Templates https://huggingface.co/docs/transformers/main/en/chat_templating 0 comments

Related searches:

Search whole site: site:anfalmushtaq.com

Search title: Deep dive into LLMs like ChatGPT by Andrej Karpathy (TL;DR) | Anfal Mushtaq

See how to search.

Submit link to: