discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Hacker News
Recent reasoning research: GRPO tweaks, base model RL and data curation
https://www.interconnects.ai/p/papers-im-reading-base-model-rl-grpo
0 comments
31/3/2025