discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Reddit
John Schulman (PPO, OA co-founder, post-training/RLHF) leaves OpenAI for Anthropic
https://x.com/johnschulman2/status/1820610863499509855
7 comments
6/8/2024
reinforcementlearning