[2210.14215] In-context Reinforcement Learning with Algorithm Distillation - discu.eu

Reddit

[R] In-context Reinforcement Learning with Algorithm Distillation https://arxiv.org/abs/2210.14215 7 comments 26/10/2022 machinelearning

Linking pages

LLM Powered Autonomous Agents | Lil'Log https://lilianweng.github.io/posts/2023-06-23-agent/ 177 comments
What is AGI-hard - by swyx - L-Space Diaries https://lspace.swyx.io/p/agi-hard 125 comments
Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between https://www.interconnects.ai/p/rlhf-interview-1-louis 1 comment

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2210.14215] In-context Reinforcement Learning with Algorithm Distillation

See how to search.

Submit link to: