Linking pages
- NLP Research in the Era of LLMs - by Sebastian Ruder https://nlpnewsletter.substack.com/p/nlp-research-in-the-era-of-llms 17 comments
- 📄 NeurIPS 2023 Primer - by Sebastian Ruder - NLP News https://nlpnewsletter.substack.com/p/neurips-2023-primer 0 comments
- GitHub - allenai/open-instruct https://github.com/allenai/open-instruct 0 comments
Linked pages
- [2305.18290] Direct Preference Optimization: Your Language Model is Secretly a Reward Model https://arxiv.org/abs/2305.18290 8 comments
- GitHub - BlackSamorez/tensor_parallel: Automatically split your PyTorch models on multiple GPUs for training & inference https://github.com/BlackSamorez/tensor_parallel 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - eric-mitchell/direct-preference-optimization: Reference implementation for DPO (Direct Preference Optimization)
See how to search.