- [P] Robust Policy Optimization is now in CleanRL π₯! https://docs.cleanrl.dev/ 2 comments machinelearning
- π₯ CleanRL's paper has reached v1.0.0; Reworked documentation, JAX support, and more! https://twitter.com/vwxyzjn/status/1592246430043103232 4 comments reinforcementlearning
- CleanRL now includes a handy hyperparameter tuner π https://twitter.com/vwxyzjn/status/1564032626486382594 2 comments reinforcementlearning
- CleanRL now has a TD3 + JAX that is 2-4x faster than TD3 + Torch! https://twitter.com/vwxyzjn/status/1553902725162729472 5 comments reinforcementlearning
- CleanRL now has a DDPG + JAX implementation roughly 2.5-4x faster than DDPG + PyTorch https://twitter.com/vwxyzjn/status/1546977088653205505 13 comments reinforcementlearning