Linking pages
- GitHub - zhimin-z/awesome-awesome-artificial-intelligence: A curated list of awesome curated lists of many topics closely related to artificial intelligence. https://github.com/zhimin-z/awesome-awesome-artificial-intelligence 15 comments
- RLHF learning resources in 2024 - by Nathan Lambert https://www.interconnects.ai/p/rlhf-resources 0 comments
Linked pages
- Introducing ChatGPT https://openai.com/blog/chatgpt/ 296 comments
- 500'000€ Prize for Compressing Human Knowledge http://prize.hutter1.net/ 253 comments
- GitHub - microsoft/visual-chatgpt: VisualChatGPT https://github.com/microsoft/visual-chatgpt 229 comments
- IMDb http://www.imdb.com/interfaces 63 comments
- [2212.09251] Discovering Language Model Behaviors with Model-Written Evaluations https://arxiv.org/abs/2212.09251 50 comments
- GitHub - google-research/bert: TensorFlow code and pre-trained models for BERT https://github.com/google-research/bert 21 comments
- GitHub - openai/evals https://github.com/openai/evals 16 comments
- GitHub - lucidrains/PaLM-rlhf-pytorch: Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM https://github.com/lucidrains/PaLM-rlhf-pytorch 15 comments
- Illustrating Reinforcement Learning from Human Feedback (RLHF) https://huggingface.co/blog/rlhf 14 comments
- [2009.01325] Learning to summarize from human feedback https://arxiv.org/abs/2009.01325 12 comments
- [1909.08593] Fine-Tuning Language Models from Human Preferences https://arxiv.org/abs/1909.08593 5 comments
- [2112.09332] WebGPT: Browser-assisted question-answering with human feedback https://arxiv.org/abs/2112.09332 3 comments
- [2109.10862] Recursively Summarizing Books with Human Feedback https://arxiv.org/abs/2109.10862 2 comments
- https://cdn.openai.com/papers/gpt-4.pdf 1 comment
- [2204.05862] Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback https://arxiv.org/abs/2204.05862 1 comment
- [2205.13636] Quark: Controllable Text Generation with Reinforced Unlearning https://arxiv.org/abs/2205.13636 0 comments
- [1706.03741] Deep reinforcement learning from human preferences https://arxiv.org/abs/1706.03741 0 comments
- Learning through human feedback https://deepmind.com/blog/learning-through-human-feedback/ 0 comments
- Google's Natural Questions https://ai.google.com/research/NaturalQuestions 0 comments
- GitHub - salesforce/booksum https://github.com/salesforce/booksum 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - opendilab/awesome-RLHF: A curated list of reinforcement learning with human feedback resources (continually updated)
See how to search.