- "Top-K Off-Policy Correction for a REINFORCE Recommender System", Chen et al 2018 {G} [scaling to millions of items for YouTube video recommendations] https://arxiv.org/abs/1812.02353 3 comments reinforcementlearning
Linking pages
Related searches:
Search whole site: site:arxiv.org
Search title: [1812.02353] Top-K Off-Policy Correction for a REINFORCE Recommender System
See how to search.