GitHub - eric-mitchell/direct-preference-optimization: Reference implementation for DPO (Direct Preference Optimization) - discu.eu

Linking pages

NLP Research in the Era of LLMs - by Sebastian Ruder https://nlpnewsletter.substack.com/p/nlp-research-in-the-era-of-llms 17 comments
📄 NeurIPS 2023 Primer - by Sebastian Ruder - NLP News https://nlpnewsletter.substack.com/p/neurips-2023-primer 0 comments
GitHub - allenai/open-instruct https://github.com/allenai/open-instruct 0 comments

Linked pages

[2305.18290] Direct Preference Optimization: Your Language Model is Secretly a Reward Model https://arxiv.org/abs/2305.18290 8 comments
GitHub - BlackSamorez/tensor_parallel: Automatically split your PyTorch models on multiple GPUs for training & inference https://github.com/BlackSamorez/tensor_parallel 0 comments

Related searches:

Search whole site: site:github.com

Search title: GitHub - eric-mitchell/direct-preference-optimization: Reference implementation for DPO (Direct Preference Optimization)

See how to search.

Submit link to: