Should you fine-tune a model - discu.eu

Hacker News

Specialized foundation models fine-tuned for commerce use https://www.shoonya.computer/ 3 comments 26/12/2024

Improvements to the fine-tuning API and expanding our custom models program https://openai.com/blog/introducing-improvements-to-the-fine-tuning-api-and-expanding-our-custom-models-program 42 comments 4/4/2024
Gemma for AI Solutions with Fine-Tuning and Multi-Model Collaboration https://www.kaggle.com/code/jaguar00/gemma-fine-tuning-and-multi-model-collaboration 2 comments 4/3/2024
Recent Advances in Language Model Fine-Tuning (2021) https://www.ruder.io/recent-advances-lm-fine-tuning/ 3 comments 7/2/2023
Fine-tuning a vision model to recognize break dance power moves https://blog.bawolf.com/p/breaking-into-vision 3 comments 17/12/2024
Brev: Start fine-tuning and training models in < 10 minutes https://brev.dev/ 7 comments 27/10/2023
Show HN: I built a website where you can easily fine-tune Llama 3.1 models https://www.tunellama.com 4 comments 4/9/2024
Hermes 3: The First Fine-Tuned Llama 3.1 405B Model https://lambdalabs.com/blog/unveiling-hermes-3-the-first-fine-tuned-llama-3.1-405b-model-is-on-lambdas-cloud 67 comments 15/8/2024
MeZO: Fine-Tuning Language Models with Just Forward Passes https://github.com/princeton-nlp/MeZO 21 comments 6/6/2023
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models https://arxiv.org/abs/2401.01335 12 comments 3/1/2024
Show HN: Fine-tune llama3 model to support function calling https://github.com/michaelnny/Llama3-FunctionCalling 5 comments 11/6/2024
Distributed Inference and Fine-Tuning of Large Language Models over the Internet https://browse.arxiv.org/html/2312.08361v1 22 comments 2/1/2024

Reddit