Linking pages
- RLHF: Reinforcement Learning from Human Feedback https://huyenchip.com/2023/05/02/rlhf.html 1 comment
- Beyond human data: RLAIF needs a rebrand https://www.interconnects.ai/p/beyond-human-data-rlaif 0 comments
- GitHub - Mooler0410/LLMsPracticalGuide: A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) https://github.com/Mooler0410/LLMsPracticalGuide 0 comments
Linked pages
Related searches:
Search whole site: site:gist.github.com
Search title: rl-for-llms.md · GitHub
See how to search.