Hacker News
- Mini-R1: Reproduce DeepSeek R1 "Aha Moment" https://www.philschmid.de/mini-deepseek-r1 15 comments
- How Deepseek R1 Was Trained https://www.philschmid.de/deepseek-r1 3 comments
- Fine-tune classifier with ModernBERT in 2025 https://www.philschmid.de/fine-tune-modern-bert-in-2025 3 comments
- LLaMA 2 – Every Resource you need https://www.philschmid.de/llama-2 5 comments