[2305.11206] LIMA: Less Is More for Alignment - discu.eu

Hacker News

LIMA: Less Is More for Alignment https://arxiv.org/abs/2305.11206 3 comments 22/5/2023

LIMA: Less Is More for Alignment https://arxiv.org/abs/2305.11206 9 comments 22/5/2023

Reddit

LIMA, a 65B-Param LLaMa fine-tuned with standard supervised loss on only 1,000 carefully curated prompts & responses, without any RLHF, demonstrates remarkably strong performance, learning to follow specific responses from only a handful of examples in the training data, including complex queries. https://arxiv.org/abs/2305.11206 32 comments 22/5/2023 machinelearning

Linking pages

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2305.11206] LIMA: Less Is More for Alignment

See how to search.

Submit link to: