Linking pages
Linked pages
- [2210.14215] In-context Reinforcement Learning with Algorithm Distillation https://arxiv.org/abs/2210.14215 8 comments
- [2109.10862] Recursively Summarizing Books with Human Feedback https://arxiv.org/abs/2109.10862 2 comments
- [2402.07896] Suppressing Pink Elephants with Direct Principle Feedback https://arxiv.org/abs/2402.07896 1 comment
- GitHub - FanaHOVA/smol-podcaster: smol-podcaster is your autonomous podcast production intern 🐣 https://github.com/FanaHOVA/smol-podcaster 0 comments
- GitHub - gkamradt/LLMTest_NeedleInAHaystack: Doing simple retrieval from LLM models at various context lengths to measure accuracy https://github.com/gkamradt/LLMTest_NeedleInAHaystack 0 comments
- RLHF 201 - with Nathan Lambert of AI2 and Interconnects https://www.latent.space/p/rlhf-201 0 comments
- [2402.15018] Unintended Impacts of LLM Alignment on Global Representation https://arxiv.org/abs/2402.15018 0 comments