- [R] In-context Reinforcement Learning with Algorithm Distillation https://arxiv.org/abs/2210.14215 7 comments machinelearning
Linking pages
- LLM Powered Autonomous Agents | Lil'Log https://lilianweng.github.io/posts/2023-06-23-agent/ 177 comments
- What is AGI-hard - by swyx - L-Space Diaries https://lspace.swyx.io/p/agi-hard 125 comments
- Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between https://www.interconnects.ai/p/rlhf-interview-1-louis 1 comment
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2210.14215] In-context Reinforcement Learning with Algorithm Distillation
See how to search.