Hacker News
Linked pages
- FunSearch: Making new discoveries in mathematical sciences using Large Language Models - Google DeepMind https://deepmind.google/discover/blog/funsearch-making-new-discoveries-in-mathematical-sciences-using-large-language-models/ 103 comments
- [1611.02779] RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning https://arxiv.org/abs/1611.02779 9 comments
- [2305.18290] Direct Preference Optimization: Your Language Model is Secretly a Reward Model https://arxiv.org/abs/2305.18290 8 comments
- [2310.12931] Eureka: Human-Level Reward Design via Coding Large Language Models https://arxiv.org/abs/2310.12931 0 comments
- Evolving New Foundation Models: Unleashing the Power of Automating Model Development https://sakana.ai/evolutionary-model-merge/ 0 comments
- [2406.08414] Discovering Preference Optimization Algorithms with and for Large Language Models https://arxiv.org/abs/2406.08414 0 comments
Related searches:
Search whole site: site:sakana.ai
Search title: Can LLMs invent better ways to train LLMs?
See how to search.