Hacker News
- Deepseek R1 Zero learns to reason using reinforcement learning on base model [pdf] https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf 0 comments
Linking pages
Related searches:
Search whole site: site:github.com
Search title: DeepSeek-R1/DeepSeek_R1.pdf at main · deepseek-ai/DeepSeek-R1 · GitHub
See how to search.