Hacker News
Linking pages
- GitHub - BlinkDL/modded-nanogpt-rwkv: RWKV-7: Surpassing GPT https://github.com/BlinkDL/modded-nanogpt-rwkv 18 comments
- GitHub - KellerJordan/modded-nanogpt: NanoGPT (124M) quality in 3.25B tokens https://github.com/KellerJordan/modded-nanogpt 11 comments
- GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
- GitHub - anthonix/llm.c: LLM training in simple, raw C/HIP for AMD GPUs https://github.com/anthonix/llm.c 0 comments
Related searches:
Search whole site: site:github.com
Search title: Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 · karpathy/llm.c · Discussion #481 · GitHub
See how to search.