Reddit
Related searches:

Search whole site: site:openreview.net

Search title: "R2D2: Recurrent Experience Replay in Distributed Reinforcement Learning", Anonymous 2018 [new ALE/DMLab-30 SOTA: "exceeds human-level in 52/57 ALE"; large improvement over Ape-X using a RNN]

See how to search.