- "Reinforcement Learning from Human Feedback: Progress and Challenges", John Schulman 2023-04-19 {OA} (fighting confabulations) https://www.youtube.com/watch?t=1098s&v=hhiLw5Q_UFg 3 comments reinforcementlearning
Linking pages
- AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
- How RLHF actually works - by Nathan Lambert - Interconnects https://www.interconnects.ai/p/how-rlhf-works 32 comments
- RLHF: Reinforcement Learning from Human Feedback https://huyenchip.com/2023/05/02/rlhf.html 1 comment
- rl-for-llms.md · GitHub https://gist.github.com/yoavg/6bff0fecd65950898eba1bb321cfbd81 0 comments
- Beyond human data: RLAIF needs a rebrand https://www.interconnects.ai/p/beyond-human-data-rlaif 0 comments
- Modern AI is Domestification https://thegradient.pub/ai-is-domestification/ 0 comments
- Evaluating and uncovering open LLMs - by Nathan Lambert https://www.interconnects.ai/p/evaluating-open-llms 0 comments
- How instruction-tuning can encourage hallucinations https://peterjliu.substack.com/p/how-instruction-tuning-can-encourage 0 comments
- RLHF learning resources in 2024 - by Nathan Lambert https://www.interconnects.ai/p/rlhf-resources 0 comments
Related searches:
Search whole site: site:www.youtube.com
Search title: John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges - YouTube
See how to search.