Hacker News
- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL https://arxiv.org/abs/2501.12948 1051 comments
- "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning", Guo et al 2025 {DeepSeek} https://arxiv.org/abs/2501.12948#deepseek 2 comments reinforcementlearning
- [R] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 3 comments machinelearning
Linking pages
- OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Us https://www.404media.co/openai-furious-deepseek-might-have-stolen-all-the-data-openai-stole-from-us/ 2036 comments
- How China’s DeepSeek AI Chatbot Became an Overnight Success - The Atlantic https://www.theatlantic.com/technology/archive/2025/01/deepseek-china-ai/681481/ 602 comments
- Budget AI Model DeepSeek Overtakes ChatGPT on App Store - MacRumors https://www.macrumors.com/2025/01/27/deepseek-ai-app-top-app-store-ios/ 433 comments
- OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Us https://www.404media.co/email/855bf870-82ce-4544-8776-2225627fa39d/ 117 comments
- ChatGPT, DeepSeek, Or Llama? Meta’s LeCun Says Open-Source Is The Key https://www.forbes.com/sites/luisromero/2025/01/27/chatgpt-deepseek-or-llama-metas-lecun-says-open-source-is-the-key/ 5 comments
- Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial https://www.philschmid.de/mini-deepseek-r1 3 comments
- Why DeepSeek Could Change What Silicon Valley Believe About A.I. - The New York Times https://www.nytimes.com/2025/01/28/technology/why-deepseek-could-change-what-silicon-valley-believes-about-ai.html 1 comment
- Novus Ordo Seclorum - by Dean W. Ball - Hyperdimensional https://www.hyperdimensional.co/p/novus-ordo-seclorum 1 comment
- DeepSeek and the Future of AI Competition with Miles Brundage https://www.chinatalk.media/p/deepseek-and-the-future-of-ai-competition 0 comments
- Mixture-of-Experts (MoE) LLMs - by Cameron R. Wolfe, Ph.D. https://cameronrwolfe.substack.com/p/moe-llms 0 comments
- AIMO Progress Prize 1 | Acta Machina https://actamachina.com/posts/aimo-progress-prize-1 0 comments
- DeepSeek vs conspiracies - Tereza Tizkova https://terezatizkova.substack.com/p/deepseek-vs-conspiracies 0 comments
- I don’t believe DeepSeek crashed Nvidia’s stock https://www.understandingai.org/p/i-dont-believe-deepseek-crashed-nvidias 0 comments
- DeepSeek R1 is good enough | Tigris Object Storage https://www.tigrisdata.com/blog/thoughts-deepseek-r1/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
See how to search.