Hacker News
- SFT Memorizes,RL Generalizes: Comparative Study of Foundation Model PostTraining https://arxiv.org/abs/2501.17161 0 comments
- "SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training", Chu et al 2025 https://arxiv.org/abs/2501.17161 4 comments reinforcementlearning
Linking pages
Related searches:
Search whole site: site:arxiv.org
Search title: [2501.17161] SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
See how to search.