Hacker News
- RLHF: Reinforcement Learning from Human Feedback https://huyenchip.com/2023/05/02/rlhf.html 0 comments
- A simple explanation of Reinforcement Learning from Human Feedback (RLHF) https://gist.github.com/JoaoLages/c6f2dfd13d2484aa8bb0b2d567fbf093 4 comments learnmachinelearning
- [R] A simple explanation of Reinforcement Learning from Human Feedback (RLHF) https://gist.github.com/JoaoLages/c6f2dfd13d2484aa8bb0b2d567fbf093 15 comments machinelearning
- [R] Illustrating Reinforcement Learning from Human Feedback (RLHF) https://huggingface.co/blog/rlhf 12 comments machinelearning
- GitHub - lucidrains/PaLM-rlhf-pytorch: Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM https://github.com/lucidrains/PaLM-rlhf-pytorch 2 comments python
- ChatLLaMA 🦙 the first open source implementation of LLaMA based on Reinforcement Learning from Human Feedback (RLHF): https://github.com/nebuly-ai/nebullvm/tree/main/apps/accelerate/chatllama https://github.com/nebuly-ai/nebullvm/tree/main/apps/accelerate/chatllama 5 comments deeplearning