discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Reddit
"The Problem with Reasoners: Praying for Transfer Learning", Aidan McLaughlin (will more RL fix o1-style LLMs?)
https://aidanmclaughlin.notion.site/reasoners-problem
4 comments
21/1/2025
reinforcementlearning