- [R] All about evaluating Large language models https://explodinggradients.com/all-about-evaluating-large-language-models 8 comments machinelearning
- [R] Evaluating Large Language Models Trained on Code (paper on OpenAI Codex) https://arxiv.org/abs/2107.03374 4 comments machinelearning
- Check Out This DeepMind’s New Language Model, Chinchilla (70B Parameters), Which Significantly Outperforms Gopher (280B) and GPT-3 (175B) on a Large Range of Downstream Evaluation Tasks https://www.marktechpost.com/2022/04/09/check-out-this-deepminds-new-language-model-chinchilla-70b-parameters-which-significantly-outperforms-gopher-280b-and-gpt-3-175b-on-a-large-range-of-downstream-evaluation-tasks/ 4 comments artificial
- How Good is Hugging Face's BLOOM? Human Evaluation of Large Language Models [D] https://www.surgehq.ai/blog/how-good-is-hugging-faces-bloom-a-real-world-human-evaluation-of-language-models 28 comments machinelearning