- "Motif: Intrinsic Motivation from Artificial Intelligence Feedback", Klissarov et al 2023 {FB} (labels from a LLM of Nethack states as a learned reward) https://arxiv.org/abs/2310.00166#facebook 3 comments reinforcementlearning
Linking pages
Related searches:
Search whole site: site:arxiv.org
Search title: [2310.00166] Motif: Intrinsic Motivation from Artificial Intelligence Feedback
See how to search.