LLMs-from-scratch/ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb at main · rasbt/LLMs-from-scratch · GitHub - discu.eu

Reddit

Direct Preference Optimization (DPO) for LLM Alignment coded in Python & PyTorch from scratch https://github.com/rasbt/LLMs-from-scratch/blob/main/ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb 3 comments 5/8/2024 python

Linking pages

Would you like to stay up to date with Python? Checkout Python Weekly.

Related searches:

Search whole site: site:github.com

Search title: LLMs-from-scratch/ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb at main · rasbt/LLMs-from-scratch · GitHub

See how to search.

Submit link to: