Linking pages
- Research Papers in February 2024 https://sebastianraschka.com/blog/2024/research-papers-in-february-2024.html 7 comments
- Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between https://www.interconnects.ai/p/rlhf-interview-1-louis 1 comment
Related searches:
Search whole site: site:arxiv.org
Search title: [2402.07896] Suppressing Pink Elephants with Direct Principle Feedback
See how to search.