Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between - discu.eu

Linking pages

Interviewing Dean Ball on AI policy https://www.interconnects.ai/p/interviewing-dean-ball-on-ai-policy 0 comments

Linked pages

[2210.14215] In-context Reinforcement Learning with Algorithm Distillation https://arxiv.org/abs/2210.14215 8 comments
[2109.10862] Recursively Summarizing Books with Human Feedback https://arxiv.org/abs/2109.10862 2 comments
[2402.07896] Suppressing Pink Elephants with Direct Principle Feedback https://arxiv.org/abs/2402.07896 1 comment
GitHub - FanaHOVA/smol-podcaster: smol-podcaster is your autonomous podcast production intern 🐣 https://github.com/FanaHOVA/smol-podcaster 0 comments
GitHub - gkamradt/LLMTest_NeedleInAHaystack: Doing simple retrieval from LLM models at various context lengths to measure accuracy https://github.com/gkamradt/LLMTest_NeedleInAHaystack 0 comments
RLHF 201 - with Nathan Lambert of AI2 and Interconnects https://www.latent.space/p/rlhf-201 0 comments
[2402.15018] Unintended Impacts of LLM Alignment on Global Representation https://arxiv.org/abs/2402.15018 0 comments

Related searches:

Search whole site: site:interconnects.ai

Search title: Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between

See how to search.

Submit link to: